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METHODS OF DIAGNOSIS OF LUNG CANCER, COMPOSITIONS AND METHODS 
OF SCREENING FOR MODULATORS OF LUNG CANCER 



CROSS-REFERENCES TO RELATED APPLICATIONS 

This application is related to USSN 60/284,770, filed April 18, 2001; USSN 
60/290,492, filed May 10, 2001; USSN 60/334,370, filed November 29, 2001; USSN 
60/339,245, filed November 9, 2001; USSN 60/350,666, filed November 13, 2001; and 
10 USSN 60/xxx,xxx, filed April 12, 2002 (Docket OMNI-002P); each of which is incorporated 
herein by reference in its entirety. 

FIELD OF THE INVENTION 
The invention relates to the identification of nucleic acid and protein expression 
15 profiles and nucleic acids, products, and antibodies thereto that are involved in lung cancer; 
and to the use of such expression profiles and compositions in diagnosis and therapy of lung 
cancer. The invention further relates to methods for identifying and using agents and/or 
targets that inhibit lung cancer or related conditions. 

20 BACKGROUND OF THE INVENTION 

Lung cancer is the second most commonly occurring cancer in the United States and 
is the leading cause of cancer-related death. It is estimated that there are over 160,000 new 
cases of lung cancer in the United States every year. Of those who are diagnosed with lung 
cancer, 86 percent will die within five years. Lung cancer is the most common visceral 

25 cancer in men and accounts for nearly one third of all cancer deaths in both men and women. 
In fact, lung cancer accounts for 7% of all deaths, due to any cause, in both men and women. 

Smoking is the primary cause of lung cancer, with more than 80% of lung cancers 
resulting from smoking. About 400 to 500 separate gaseous substances are present in the 
smoke of a non-filter cigarette. The most noteworthy substances include nitrogen oxides, 

30 hydrogen cyanide, formaldehyde, benzene, and toluene. The particles present in cigarette 
smoke contain at least 3,500 individual compounds such as nicotine, tobacco alkaloids 
(nornicotine, anatabine, anabasine), polycyclic aromatic hydrocarbons (e.g., benzo(a)pyrene, 
B(a)P), naphthalenes, aromatic amines, phenols, and tobacco-specific nitrosamines. 
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Tobacco-specific nitrosamines are formed during tobacco curing and processing, and 

are suspected of causing lung cancer in humans. In rodent studies, regardless of the where or 

how it is applied, the tobacco-specific nitrosamine known as NNK produces lung adenomas 

and lung adenocarcinomas. The tobacco-specific nitrosamine known as NNAL also produces 

5 lung adenocarcinomas in rodents. 

Many of the chemicals found in cigarette smoke also affect the nonsmoker inhaling 

"secondhand" or sidestream smoke. Indeed, the smoke inhaled by non-smokers has a 

chemical composition similar to the smoke inhaled by smokers, but, importantly, the 

concentrations of the carcinogenic tobacco-specific nitrosamines are present in higher 

10 concentrations in second hand smoke. For this and other reasons, "passive smoking" is an 

important cause of lung cancer, causing as many as 3,000 lung cancer deaths in nonsmokers 

each year. 

In addition to smoking, other factors thought to be causes of lung cancer include on- 
the-job exposure to carcinogens such as asbestos and uranium, exposure to chemical hazards 

1 5 such as radon, polycyclic aromatic hydrocarbons, chromium, nickel, and inorganic arsenic, 
genetic factors, and diet. 

Histological classification of various lung cancers define the types of cancer that 
begin in the lung. See, e.g., Travis, et al. (1999) Histological Typing of Lung and Pleural 
Tumours (International Histological Classification of Tumours, No 1. Four major cell types 

20 make up more than 88% of all primary lung neoplasms. These are: squamous or epidermoid 
carcinoma, small cell (also called oat cell) carcinoma, adenocarcinoma, and large cell (also 
called large cell anaplastic) carcinoma. The remainder include undifferentiated carcinomas, 
carcinoids, bronchial gland tumors, and other rarer types. The various cell types have 
different natural histories and responses to therapy, and, thus, a correct histologic diagnosis is 

25 the first step of effective treatment. 

Small cell lung cancer (SCLC) accounts for 18-25% of all lung cancers, and occurs 
less frequently than non-small cell lung cancers, and generally spread to distant organs more 
rapidly than non-small cell lung cancer. In general, at the time of presentation small cell lung 
cancers have already spread beyond the beyond the bounds where surgery and curative intent 

30 can be undertaken. Hoever, if identified early enough, these cancers are often responsive to 
chemotherapy and thoracic radiation treatment. 

Non-small cell lung cancers (NSCLC) are the more frequently occurring form of lung 
cancer. They comprise squamous cell carcinoma, adenocarcinoma, and large cell carcinoma 
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and account for more than 75% of all lung cancers. Non-small cell tumors that are localized 
at the time of presentation can sometimes be cured with surgery and/or radiotherapy, but 
usually are not identified until significant metastasis has occurred, which are typically not 
very responsive to surgical, chemotherapy, or radiation treatment.. 
5 The screening of asymptomatic persons at high risk for lung cancer has often proven 

ineffective. In general, only 5 to 15 percent of lung cancer patients have their disease 
detected while they are asymptomatic. Of course, early detection and treatment are critical 
factors in the fight against lung cancer. The average survival rate is 49% for those whose 
cancer is detected early, before the cancer has spread from the lung. Lung cancer often 

1 0 spreads outside of the lung, and it may have spread to the bones or brain by the time it is 
diagnosed. While the prognosis may be better for lung cancers that are detected early, 
because of the lack ofv effective curative treatments, early detection does not necessarily alter 
the total death rate from lung cancer. 

Thus, methods for diagnosis and prognosis of lung cancer and effective treatment of 

1 5 lung cancer would be desirable. Accordingly, provided herein are methods that can be used 
in diagnosis and prognosis of lung cancer. Further provided are methods that can be used to 
screen candidate therapeutic agents for the ability to modulate, e.g., treat, lung cancer. 
Additionally, provided herein are molecular targets and compositions for therapeutic 
intervention in lung disease and other metastatic cancers. 

20 

SUMMARY OF THE INVENTION 
The present invention provides nucleotide sequences of genes that are up- and down- 
regulated in lung cancer cells. Such genes are useful for diagnostic purposes, and also as 
targets for screening for therapeutic compounds that modulate lung cancer, such as 

25 antibodies. The methods of detecting nucleic acids of the invention or their encoded proteins 
can be used for a number of purposes. Examples include early detection of lung cancers, 
monitoring and early detection of relapse following treatment of lung cancers, monitoring 
response to therapy of lung cancers, determining prognosis of lung cancers, directing therapy 
of lung cancers, selecting patients for postoperative chemotherapy or radiation therapy, 

30 selecting therapy, determining tumor prognosis, treatment, or response to treatment, and early 
detection of precancerous lesions of the lung. Examples of benign or precancerous lesions 
include: atelectasis, emphysema, brochitis, chronic obstructive pulmonary disease, fibrosis, 
hypersensitivity pneumonitis (HP), interstitial pulmonary fibrosis (EPF), asthma, and 
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bronchiectasis. Other aspects of the invention will become apparent to the skilled artisan by 

the following description of the invention. 

In one aspect, the present invention provides a method of detecting a lung cancer- 
associated transcript in a cell from a patient, the method comprising contacting a biological 
5 sample from the patient with a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown in Tables 1 A-16. Alternatively, the sample may 
be contacted with a specific binding reagent, e.g., antibody. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at least 
95% identical to a sequence as shown in Tables 1A-16. In another embodiment, the 
10 polynucleotide comprises a sequence as shown in Tables 1A-16. 

In one embodiment, the biological sample is a tissue sample, or a body fluid. In 
another embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent label. In 
one embodiment, the polynucleotide is immobilized on a solid surface. In one embodiment, 
15 the patient is undergoing a therapeutic regimen to treat lung cancer. In another embodiment, 
the patient is suspected of having lung cancer. In one embodiment, the patient is a primate, 
e.g., a human. 

In one embodiment, the method further comprises the step of amplifying nucleic acids 
before the step of contacting the biological sample with the polynucleotide. 

20 In another aspect, the present invention provides a method of monitoring the efficacy 

of a therapeutic treatment of lung cancer, the method comprising the steps of: (i) providing a 
biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
the level of a lung cancer-associated transcript in the biological sample by contacting the 
biological sample with a polynucleotide that selectively hybridizes to a sequence at least 80% 

25 identical to a sequence as shown in Tables 1 A-16, thereby monitoring the efficacy of the 
therapy. Or the sample may be evaluated for protein, e.g., contacting the sample with an 
antibody. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the lung cancer-associated transcript to a level of the lung cancer-associated 
30 transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 
treatment. Or the sample may be evalated for comparison of protein. 

In another aspect, the present invention provides a method of monitoring the efficacy 
of a therapeutic treatment of lung cancer, the method comprising the steps of: (i) providing a 
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biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 

the level of a lung cancer-associated antibody in the biological sample by contacting the 

biological sample with a polypeptide encoded by a polynucleotide that selectively hybridizes 

to a sequence at least 80% identical to a sequence as shown in Tables .1 A-16, wherein the 

5 polypeptide specifically binds to the lung cancer-associated antibody, thereby monitoring the 
efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the lung cancer-associated antibody to a level of the lung cancer-associated antibody 
in a biological sample from the patient prior to, or earlier in, the therapeutic treatment. 

10 In another aspect, the present invention provides a method of monitoring the efficacy 

of a therapeutic treatment of lung cancer, the method comprising the steps of: (i) providing a 
biological sample from a patient undergoing the therapeutic treatment; and (ii) detennining 
the level of a lung cancer-associated polypeptide in the biological sample by contacting the 
biological sample with an antibody, wherein the antibody specifically binds to a polypeptide 

15 encoded by a polynucleotide that selectively hybridizes to a sequence at least 80% identical 
to a sequence as shown in Tables 1 A- 16, thereby monitoring the efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the lung cancer-associated polypeptide to a level of the lung cancer-associated 
polypeptide in a biological sample from the patient prior to, or earlier in, the therapeutic 

20 treatment. In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1 A-16. In one embodiment, an 
expression vector or cell comprises the isolated nucleic acid. In one aspect, the present 
invention provides an isolated polypeptide which is encoded by a nucleic acid molecule 
having polynucleotide sequence as shown in Tables 1 A-16. 

25 In another aspect, the present invention provides an antibody that specifically binds to 

an isolated polypeptide which is encoded by a nucleic acid molecule having polynucleotide 
sequence as shown in Tables 1 A-16. In one embodiment, the antibody is conjugated to an 
effector component, e.g., a fluorescent label, a radioisotope or a cytotoxic chemical. In one 
embodiment, the antibody is an antibody fragment. In another embodiment, the antibody is 

30 humanized. 

In one aspect, the present invention provides a method of detecting lung cancer in a a 
patient, the method comprising contacting a biological sample from the patient with an 
antibody or protein as described herein. 



5 



WO 02/086443 PCT/US02/12476 

In another aspect, the present invention provides a method of detecting antibodies 
specific to a lung cancer gene in a patient, the method comprising contacting a biological 
sample from the patient with a polypeptide encoded by a nucleic acid comprises a sequence 
from Tables 1A-16. 

5 In another aspect, the present invention provides a method for identifying a compound 

that modulates a lung cancer-associated polypeptide, the method comprising the steps of: (i) 
contacting the compound with a lung cancer-associated polypeptide, the polypeptide encoded 
by a polynucleotide that selectively hybridizes to a sequence at least 80% identical to a 
sequence as shown in Tables 1 A-16; and (ii) determining the functional effect of the 

1 0 compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic effect, or a 
chemical effect. In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 
cell membrane. In another embodiment, the polypeptide is recombinant. In one 
embodiment, the functional effect is determined by measuring ligand binding to the 

15 polypeptide. 

In another aspect, the present invention provides a method of inhibiting prohferation 
or another critical process of a lung cancer-associated cell to treat lung cancer in a patient, the 
method comprising the step of administering to the subject a therapeutically effective amount 
of a compound identified as described herein. In one embodiment, the compound is an 
20 antibody. 

In another aspect, the present invention provides a drug screening assay comprising 
the steps of: (i) administering a test compound to a mammal having lung cancer or a cell 
isolated therefrom; (ii) comparing the level of gene expression of a polynucleotide that 
selectively hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 
25 1 A-16 in a treated cell or mammal with the level of gene expression of the polynucleotide in 
a control cell or mammal, wherein a test compound that modulates the level of expression of 
the polynucleotide is a candidate for the treatment of lung cancer. 

In one embodiment, the control is a mammal with lung cancer or a cell therefrom that 
has not been treated with the test compound. In another embodiment, the control is a normal 
30 cell or mammal, or a non-malignant lung disease. 

In another aspect, the present invention provides a method for treating a mammal 
having lung cancer comprising administering a compound identified by the assay described 
herein. 



6 



WO 02/086443 PCT7US02/12476 
In another aspect, the present invention provides a pharmaceutical composition for 

treating a mammal having lung cancer, the composition comprising a compound identified by 

the assay described herein and a physiologically acceptable excipient 

5 DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the objects outlined above, the present invention provides novel 
methods for diagnosis and treatment of lung disease or cancer, as well as methods for 
screening for compositions which modulate lung cancer. 'Treatment, monitoring, detection 
or modulation of lung disease or cancer" includes treatment, monitoring, detection, or 

10 modulation of lung disease in those patients who have lung disease (whether malignant or 
non-malignant, e.g., emphysema, bronchitis, or fibrosis) as well as patients with lung cancers 
in which gene expression from a gene in Tables 1A-16 is increased or decreased, indicating 
that the subject is more likely to have disease. In particular,while these targets are identified 
primarily from lung cancer samples, these same targets are likely to be similarly found in 

1 5 analyses of other medical conditions. These other conditions may result from similar 
pathological processes which affect similar tissues, e.g., lung cancer, small cell lung 
carcinoma (oat cell carcinoma), non-small cell carcinomas (e.g., squamous cell carcinoma, 
adenocarcinoma, large cell lung carcinoma, carcinoid, granulomatous), fibrosis (idiopathic 
pulmonary fibrosis (IPF), hypersensitivity pneumonitis (HP), interstitial pneumonitis, 

20 nonspecific idiopathic pneumonitis (NSEP)), chronic obstructive pulmonary disease (COPD, 
e.g., emphysema, chronic bronchitis), asthma, bronchiectasis, and esophageal cancer. See, 
e.g., the NCI webpage and USSN 60/347,349 and USSN 60/xxx,xxx (docket LFBR-001-1P, 
filed March 29, 2002), each of which is incorporated herein by reference. The treatment may 
be of lung cancer or related condition itself, or treatment of metastasis. 

25 In particular, identification of markers selectively expressed on these cancers allows 

for use of that expression in diagnostic, prognostic, or therapeutic methods. As such, the 
invention defines various compositions, e.g., nucleic acids, polypeptides, antibodies, and 
small molecule agonists/antagonists, which will be useful to selectively identify those 
markers. For example, therapeutic methods may take the form of protein therapeutics which 

30 use the marker expression for selective localization or modulation of function (for those 
markers which have a causative disease effect), for vaccines, identification of binding 
partners, or antagonism, e.g., using antisense or RNAi. The markers may be useful for 
molecular characterization of subsets of lung diseases, which subsets may actually require 
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very different treatments. Moreover, the markers may also be important in related diseases to 

the specific cancers, e.g., which affect similar tissues in non-malignant diseases, or have 

similar mechanisms of induction/maintenance. Metastatic processes or characteristics may 

also be targeted. Diagnostic and prognostic uses are made available, e.g., to subset related 

5 but distinct diseases, or to determine treatment strategy. The detection methods may be based 

upon nucleic acid, e.g., PCR or hybridization techniques, or protein, e.g., ELISA, imaging, 

IHC, etc. The diagnosis may be qualitative or quantitative, and may detect increases or 

decreases in expression levels. 

Tables 1A-16 provide unigene cluster identification numbers for the nucleotide 

10 sequence of genes that exhibit increased or decreased expression in lung cancer samples. The 
tables also provide an exemplar accession number that provides a nucleotide sequence that is 
part of the unigene cluster. In Table 1A, genes marked as "target 1" or "target 2" are 
particularly useful as therapeutic targets. Genes marked as "target 3" are particularly useful 
as diagnostic markers. Genes marked as "chron" are upregulated in chronically diseased lung 

15 (e.g., emphysema, bronchitis, fibrosis) relative to lung tumors and normal tissue. In certain 
analyses, the ratio for the "chron" category was determined using the 70th percentile of 
chronically diseases lung samples divided by the 90th percentile of normal lung samples. 
The ratio for the targets was determined using the 70th percentile of lung tumor samples 
divided by the 90th percentile of normal lung samples. 

20 

Definitions 

The term "lung cancer protein" or "lung cancer polynucleotide" or "lung cancer- 
associated transcript" refers to nucleic acid and polypeptide polymorphic variants, alleles, 
mutants, and interspecies homologs that: (1) have a nucleotide sequence that has greater than 

25 about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or greater nucleotide sequence identity, 
preferably over a region of over a region of at least about 25, 50, 100, 200, 500, 1000, or 
more nucleotides, to a nucleotide sequence of or associated with a unigene cluster of Tables 
1 A-16; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an immunogen 

30 comprising an amino acid sequence encoded by a nucleotide sequence of or associated with a 
unigene cluster of Tables 1A-16, and conservatively modified variants thereof; (3) 
specifically hybridize under stringent hybridization conditions to a nucleic acid sequence, or 
the complement thereof of Tables 1A-16 and conservatively modified variants thereof; or (4) 

8 
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have an amino acid sequence that has greater than about 60% amino acid sequence identity, 
65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 
or 99% or greater amino sequence identity, preferably over a region of over a region of at 
least about 25, 50, 100, 200, 500, 1000, or more amino acid, to an amino acid sequence 
5 encoded by a nucleotide sequence of or associated with a unigene cluster of Tables 1 A-16. A 
polynucleotide or polypeptide sequence is typically from a mammal including, but not 
limited to, primate, e.g., human; rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or 
other mammal. A "lung cancer polypeptide" and a "lung cancer polynucleotide," include 
both naturally occurring or recombinant forms. 

10 A "full length" lung cancer protein or nucleic acid refers to a lung cancer polypeptide 

or polynucleotide sequence, or a variant thereof, that contains the elements normally 
contained in one or more naturally occurring, wild type lung cancer polynucleotide or 
polypeptide sequences. The "full length" may be prior to, or after, various stages of post- 
translational processing or splicing, including alternative splicing. 

15 "Biological sample" as used herein is a sample of biological tissue or fluid that 

contains nucleic acids or polypeptides, e.g., of a lung cancer protein, polynucleotide, or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 
humans, or rodents, e.g., mice, and rats. Biological samples may also include sections of 
tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 

20 archival materials, blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. 
Biological samples also include explants and primary and/or transformed cell cultures 
derived from patient tissues. A biological sample is typically obtained from a eukaryotic 
organism, most preferably a mammal such as a primate, e.g., chimpanzee or human; cow; 
dog; cat; a rodent, e.g., guinea pig, rat, mouse; rabbit; or other mammal; or a bird; reptile; 

25 fish. Livestock and domestic animals are of interest. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 

30 methods of the invention in vivo. Archival tissues or materials, having treatment or outcome 
history, will be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more nucleic 
acids or polypeptide sequences, refer to two or more sequences or subsequences that are the 
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same or have a specified percentage of amino acid residues or nucleotides that are the same 
(e.g., about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
5 measured using, e.g., a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection (see, e.g., NCBI 
web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to 
be "substantially identical." This definition also refers to, or may be applied to, the 
complement of a test sequence. The definition also includes sequences that have deletions 
10 and/or insertions, substitutions, and naturally occurring, e.g., polymorphic or allelic variants, 
and man-made variants. As described below, the preferred algorithms can account for gaps 
and the like. Preferably, identity exists over a region that is at least about 25 amino acids or 
nucleotides in length, or more preferably over a region that is 50-100 amino acids or 
nucleotides in length. 

15 For sequence comparison, typically one sequence acts as a reference sequence, to 

which test sequences are compared. When using a sequence comparison algorithm, test and 
reference sequences are entered into a computer, subsequence coordinates are designated, if 
necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 

20 comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
contiguous positions selected from the group consisting typically of from 20 to 600, usually 
about 50 to about 200, more usually about 100 to about 1 50 in which a sequence may be 

25 compared to a reference sequence of the same number of contiguous positions after the two 
sequences are optimally aligned. Methods of alignment of sequences for comparison are 
well-known in the art. Optimal alignment of sequences for comparison can be conducted, 
e.g., by the local homology algorithm of Smith and Waterman (1981) Adv. AppL Math. 
2:482, by the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 

30 48 :443, by the search for similarity method of Pearson and Lipman (1 988) Proc. Nat'l. Acad. 
Sci. USA 85:2444, by computerized implementations of these algorithms (GAP, BESTFIT, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 
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Group, 575 Science Dr., Madison, WI), or by manual alignment and visual inspection (see, 

e.g., Ausubel, et aL (eds. 1995 and supplements) Current Protocols in Molecular Biology . 

Preferred examples of algorithms that are suitable for determining percent sequence 

identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, which are 

5 described in Altschul, et al. (1977) Nuc. Acids Res. 25:3389-3402 and Altschul, et al. (1990) 

J. Mol. Biol. 215:403-410. BLAST and BLAST 2.0 are used, with the parameters described 

herein, to determine percent sequence identity for the nucleic acids and proteins of the 

invention. Software for performing BLAST analyses is publicly available through the 

National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This 

10 algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul, et al., supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 

15 them. The word hits are extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 
sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 

20 hits in each direction are halted when: the cumulative alignment score falls off by the 

quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 

25 uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff and Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915) alignments (B) of 
50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

30 The BLAST algorithm also performs a statistical analysis of the similarity between 

two sequences (see, e.g., Karlin and Altschul (1993) Proc. Nat'l. Acad. Sci. USA 90:5873- 
5787). One measure of similarity provided by the BLAST algorithm is the smallest sum 
probability (P(N)), which provides an indication of the probability by which a match between 
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two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid 
is considered similar to a reference sequence if the smallest sum probability in a comparison 
of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably 
less than about 0.01, and most preferably less than about 0.001. Log values may be negative 
5 large numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 110, 150, 170, etc. 

An indication that two nucleic acid sequences are substantially identical is that the 
polypeptide encoded by the first nucleic acid is immunologically cross reactive with the 
antibodies raised against the polypeptide encoded by the second nucleic acid. Thus, a 
polypeptide is typically substantially identical to a second polypeptide, e.g., where the two 
1 0 peptides differ only by conservative substitutions. Another indication that two nucleic acid 
sequences are substantially identical is that the two molecules or their complements hybridize 
to each other under stringent conditions. Yet another indication that two nucleic acid 
sequences are substantially identical is that the same primers can be used to amplify the 
sequences. 

15 A "host cell" is a naturally occurring cell or a transformed cell that contains an 

expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g., the American Type Culture 

20 Collection catalog or web site, www.atcc.org). 

The terms "isolated," "purified," or "biologically pure" refer to material that is 
substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 

25 chromatography. A protein or nucleic acid that is the predominant species present in a 

preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 

30 that the nucleic acid or protein is at least about 85% pure, more preferably at least 95% pure, 
and most preferably at least 99% pure. "Purify" or "purification" in other embodiments 
means removing at least one contaminant or component from the composition to be purified. 
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In this sense, purification does not require that the purified compound be homogeneous, e.g., 

100% pure. 

The terms "polypeptide," "peptide" and "protein" are used interchangeably herein to 
refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which 
5 one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally 
occurring amino acid, as well as to naturally occurring amino acid polymers, those containing 
modified residues, and non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, as well 
as amino acid analogs and amino acid mimetics that function similarly to the naturally 

10 occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refer to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 

15 norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 
modified R groups (e.g., norleucine) or modified peptide backbones, but retain some basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refer to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that function similarly to another amino acid. 

20 Amino acids may be referred to herein by either their commonly known three letter 

symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic acid 

25 sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 

30 most proteins. For instance, the codons GCA, GCC, GCG, and GCU each encode the amino 
acid alanine. Thus, at each position where an alanine is specified by a codon, the codon can 
be altered to another of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acid variations are "silent variations," which are one species of 
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conservatively modified variations. Every nucleic acid sequence Herein which encodes a 

polypeptide also describes silent variations of the nucleic acid. In certain contexts each 

codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and 

TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a 

5 functionally similar molecule. Accordingly, a silent variation of a nucleic acid which 

encodes a polypeptide is implicit in a described sequence with respect to the expression 

product, but not necessarily with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual substitutions, 

deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which 

10 alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded 
sequence is a "conservatively modified variant" where the alteration results in the substitution 
of an amino acid with a chemically similar amino acid. Conservative substitution tables 
providing functionally similar amino acids are well known in the art. Such conservatively 
modified variants are in addition to and do not exclude polymorphic variants, interspecies 

15 homologs, and alleles of the invention. Typically conservative substitutions include for one 
another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid (E); 3) Asparagine 
(N) 5 Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), Leucine (L), Methionine 
(M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 7) Serine (S), 
Threonine (T); and 8) Cysteine (C), Methionine (M) (see, e.g., Creighton, Proteins (1984)). 

20 Macromolecular structures such as polypeptide structures can be described in terms of 

various levels of organization. For a general discussion of this organization, see, e.g., 
Alberts, et al. (1994) Molecular Biology of the Cell (3 rd ed.) and Cantor and Schimmel (1980) 
Biophysical Chemistry Part I: The Conformation of Biological Macromolecules . "Primary 
structure" refers to the amino acid sequence of a particular peptide. "Secondary structure" 

25 refers to locally ordered, three dimensional structures within a polypeptide. These structures 
are commonly known as domains. Domains are portions of a polypeptide that often form a 
compact unit of the polypeptide and are typically 25 to approximately 500 amino acids long. 
Typical domains are made up of sections of lesser organization such as stretches of (5-sheet 
and oc-helices. 'Tertiary structure" refers to the complete three dimensional structure of a 

30 polypeptide monomer. "Quaternary structure" refers to the three dimensional structure 
formed, usually by the noncovalent association of independent tertiary units. Anisotropic 
terms are also known as energy terms. 
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"Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical equivalents 

used herein means at least two nucleotides covalently linked together. Oligonucleotides are 

typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more nucleotides in length, up 

to about 100 nucleotides in length. Nucleic acids and polynucleotides are a polymers of any 

5 length, including longer lengths, e.g., 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 10,000, 

etc. A nucleic acid of the present invention will generally contain phosphodiester bonds, 

although in some cases, nucleic acid analogs are included that may have at least one different 

linkage, e.g., phosphoramidate, phosphorothioate, phosphorodithioate, or O- 

methylphophoroamidite linkages (see Eckstein (1992) Oligonucleotides and Analogues: A 

10 Practical Approach Oxford University Press); and peptide nucleic acid backbones and 
linkages. Other analog nucleic acids include those with positive backbones; non-ionic 
backbones, and non-ribose backbones, including those. described in U.S. Patent Nos. 
5,235,033 and 5,034,506, and Chapters 6 and 7, in Sanghui and Cook, eds. Carbohydrate 
Modifications in Antisense Research. ASC Symposium Series 580. Nucleic acids containing 

15 one or more carbocyclic sugars are also included within one definition of nucleic acids. 

Modifications of the ribose-phosphate backbone may be done for a variety of reasons, e.g., to 
increase the stability and half-life of such molecules in physiological environments or as 
probes on a biochip. Mixtures of naturally occurring nucleic acids and analogs can be made; 
alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally occurring 

20 nucleic acids and analogs may be made. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide nucleic 
acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 

25 kinetics. PNAs have larger changes in the melting temperature (T m ) for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4° C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9° C. 
Similarly, due to their non-ionic nature, hybridization of the bases attached to these 
backbones is relatively insensitive to salt concentration. In addition, PNAs are not degraded 

30 by cellular enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or contain 
portions of both double stranded or single stranded sequence. As will be appreciated by those 
in the art, the depiction of a single strand also defines the sequence of the complementary 



15 



WO 02/086443 PCTYUS02/12476 
strand; thus the sequences described herein also provide the complement of the sequence. 

The nucleic acid may be DNA, both genomic and cDNA, RNA, or a hybrid, where the 

nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and combinations 

of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine 

5 hypoxanthine, isocytosine, isoguanine, etc. "Transcript" typically refers to a naturally 

occurring RNA, e.g., a pre-mRNA, hnRNA, or mRNA. As used herein, the term 

"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 

nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 

naturally occurring analog structures. Thus, e.g., the individual units of a peptide nucleic 

10 acid, each containing a base, are referred to herein as a nucleoside. 

A "label" or a "detectable moiety" is a composition detectable by spectroscopic, 
photochemical, biochemical, immunochemical, physiological, chemical, or other physical 
means. For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 

1 5 or other entities which can be made detectable, e.g., by incorporating a radiolabel into the 
peptide or used to detect antibodies specifically reactive with the peptide. The labels may be 
incorporated into the cancer nucleic acids, proteins, and antibodies. Many methods known in 
the art for conjugating the antibody to the label may be employed, including those methods 
described by Hunter, et al. (1962) Nature 144:945; David, et al. (1974) Biochemistry 

20 13:1014-1021; Pain, et al. (1981) J. Immunol. Meth. . 40:219-230; and Nygren (1982) L 
Histochem. and Cvtochem. 30:407-412. 

An "effector" or "effector moiety" or "effector component" is a molecule that is 
bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 

25 The "effector" can be a variety of molecules including, e.g., detection moieties including 

radioactive compounds, fluorescent compounds, an enzyme or substrate, tags such as epitope 
tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an antibiotic; or a 
radioisotope emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 

30 covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
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using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is a nucleic acid capable of 
binding to a target nucleic acid of complementary sequence through one or more types of 
5 chemical bonds, usually through complementary base pairing, e.g., through hydrogen bond 
formation. As used herein, a probe may include natural (i.e., A, G, C, or T) or modified bases 
(7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be joined by a 
linkage other than a phosphodiester bond, preferably one that does not functionally interfere 
with hybridization. Thus, e.g., probes may be peptide nucleic acids in which the constituent 

1 0 bases are joined by peptide bonds rather than phosphodiester linkages. Probes may bind 

target sequences lacking complete complementarity with the probe sequence depending upon 
the stringency of the hybridization conditions. The probes are preferably directly labeled, 
e.g., with isotopes, chromophores, lumiphores, chromogens, or indirectly labeled, e.g., with 
biotin to which a streptavidin complex may later bind. By assaying for the presence or 

15 absence of the probe, one can detect the presence or absence of the select sequence or 

subsequence. Diagnosis or prognosis may be based at the genomic level, or at the level of 
RNA or protein expression. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic acid, 
protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been modified by 

20 the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic 
acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., recombinant 
cells express genes that are not found within the native (non-recombinant) form of the cell or 
express native genes that are otherwise abnormally expressed, under expressed or not 
expressed at all. By the term "recombinant nucleic acid" herein is meant nucleic acid, 

25 originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 
polymerases and endonucleases, in a form not normally found in nature. In this manner, 
operably linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 

30 understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e., using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
recombinantly, although subsequently replicated non-recombinantly, are still considered 
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recombinant for the purposes of the invention. Similarly, a "recombinant protein" is a protein 

made using recombinant techniques, i.e., through the expression of a recombinant nucleic 

acid as depicted above. 

The term "heterologous" when used with reference to portions of a nucleic acid 

5 indicates that the nucleic acid comprises two or more subsequences that are not normally 

found in the same relationship to each other in nature. For instance, the nucleic acid is 

typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 

arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 

coding region from another source. Similarly, a heterologous protein will often refer to two 

10 or more subsequences that are not found in the same relationship to each other in nature (e.g., 

a fusion protein). 

A "promoter" is typically an array of nucleic acid control sequences that direct 
transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic acid 
sequences near the start site of transcription, such as, in the case of a polymerase II type 

15 promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 

20 to a functional linkage between a nucleic acid expression control sequence (such as a 

promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
e.g., wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 

25 synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed in operable linkage to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 

30 duplexing, or hybridizing of a molecule selectively to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 
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The phrase "stringent hybridization conditions" refers to conditions under which a 

probe will hybridize to its target subsequence, typically in a complex mixture of nucleic 

acids, but to essentially no other sequences. Stringent conditions are sequence-dependent and 

will be different in different circumstances. Longer sequences hybridize specifically at 

5 higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 

"Overview of principles of hybridization and the strategy of nucleic acid assays" in Tijssen 

(1993) Techniques in Biochemistry and Molecular Biology— Hybridization with Nucleic 

Probes (vol. 24) Elsevier. Generally, stringent conditions are selected to be about 5-10° C 

lower than the thermal melting point (T m ) for the specific sequence at a defined ionic strength 

10 pH. The T m is the temperature (under defined ionic strength, pH, and nucleic concentration) 
at which 50% of the probes complementary to the target hybridize to the target sequence at 
equilibrium (as the target sequences are present in excess, at T m , 50% of the probes are 
occupied at equilibrium). Stringent conditions will be those in which the salt concentration is 
less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or 

15 other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C for short probes (e.g., 
10 to 50 nucleotides) and at least about 60° C for long probes (e.g., greater than 50 
nucleotides). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as foimamide. For selective or specific hybridization, a positive signal is 
typically at least two times background, preferably 10 times background hybridization. 

20 Exemplary stringent hybridization conditions are often: 50% foimamide, 5x SSC, and 1% 
SDS, incubating at 42° C, or, 5x SSC, 1% SDS, incubating at 65° C, with wash in 0.2x SSC, 
and 0.1% SDS at 65° C. For PCR, a temperature of about 36° C is typical for low stringency 
amplification, although annealing temperatures may vary between about 32° C and 48° C 
depending on primer length. For high stringency PCR amplification, a temperature of about 

25 62° C is typical, although high stringency annealing temperatures can range from about 50° C 
to about 65° C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90° C - 95° C for 
0.5 - 2 min., an annealing phase lasting 0.5 - 2 min., and an extension phase of about 72° C 
for 1 - 2 min. Protocols and guidelines for low and high stringency amplification reactions 

30 are provided, e.g., in Innis, et al.(1990) PCR Protocols. A Guide to Methods and 
Applications . 

Nucleic acids that do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides which they encode are substantially identical. This 
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occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy 

permitted by the genetic code. In such cases, the nucleic acids typically hybridize under 

moderately stringent hybridization conditions. Exemplary "moderately stringent 

hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 

5 1% SDS at 37° C, and a wash in IX SSC at 45° C. A positive hybridization is at least twice 

background. Alternative hybridization and wash conditions can be utilized to provide 

conditions of similar stringency. Additional guidelines for determining hybridization 

parameters are provided in numerous reference, e.g., Ausubel, et al. (ed.) Current Protocols in 

Molecular Biology Lippincott. 

10 The phrase "functional effects" in the context of assays for testing compounds that 

modulate activity of a lung cancer protein includes the determination of a parameter that is 
indirectly or directly under the influence of the lung cancer protein or nucleic acid, e.g., a 
physiological, enzymatic, functional, physical, or chemical effect, such as the ability to 
decrease lung cancer. It includes ligand binding activity; cell viability, cell growth on soft 

15 agar; anchorage dependence; contact inhibition and density limitation of growth; cellular 
proliferation; cellular transformation; growth factor or serum dependence; tumor specific 
marker levels; invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and 
protein expression in cells undergoing metastasis, and other characteristics of lung cancer 
cells. "Functional effects" include in vitro, in vivo, and ex vivo activities. 

20 By "determining the functional effect" is meant assaying for a compound that 

increases or decreases a parameter that is indirectly or directly under the influence of a lung 
cancer protein sequence, e.g., physiological, functional, enzymatic, physical, or chemical 
effects. Such functional effects can be measured by many means known to those skilled in 
the art, e.g., changes in spectroscopic characteristics (e.g., fluorescence, absorbance, 

25 refractive index), hydrodynamic (e.g., shape), chromatographic, or solubility properties for 
the protein, measuring inducible markers or transcriptional activation of the lung cancer 
protein; measuring binding activity or binding assays, e.g., binding to antibodies or other 
ligands, and measuring cellular proliferation. Determination of the functional effect of a 
compound on lung cancer can also be performed using lung cancer assays known to those of 

30 skill in the art such as an in vitro assays, e.g., cell growth on soft agar; anchorage 

dependence; contact inhibition and density limitation of growth; cellular proliferation; 
cellular transformation; growth factor or serum dependence; tumor specific marker levels; 
invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and protein 
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expression in cells undergoing metastasis, and other characteristics of lung cancer cells. The 

functional effects can be evaluated by many means known to those skilled in the art, e.g., 

microscopy for quantitative or qualitative measures of alterations in morphological features, 

measurement of changes in RNA or protein levels for lung cancer-associated sequences, 

5 measurement of RNA stability, identification of downstream or reporter gene expression 

(CAT, luciferase, p-gal, GFP, and the like), e.g., via chemiluminescence, fluorescence, 

colorimetric reactions, antibody binding, inducible markers, and ligand binding assays. 

'Inhibitors", "activators", and "modulators" of lung cancer polynucleotide and 

polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules or 

10 compounds identified using in vitro and in vivo assays of lung cancer polynucleotide and 
polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of lung cancer proteins, e.g., antagonists. Antisense or inhibitory 
nucleic acids may seem to inhibit expression and subsequent function of the protein. 

15 "Activators" are compounds that increase, open, activate, facilitate, enhance activation, 
sensitize, agonize, or up regulate lung cancer protein activity. Inhibitors, activators, or 
modulators also include genetically modified versions of lung cancer proteins, e.g., versions 
with altered activity, as well as naturally occurring and synthetic ligands, antagonists, 
agonists, antibodies, small chemical molecules and the like. Such assays for inhibitors and 

20 activators include, e.g., expressing the lung cancer protein in vitro, in cells, or cell 

membranes, applying putative modulator compounds, and then determining the functional 
effects on activity, as described above. Activators and inhibitors of lung cancer can also be 
identified by incubating lung cancer cells with the test compound and determining increases 
or decreases in the expression of 1 or more lung cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 

25 25, 30, 40, 50 or more lung cancer proteins, such as lung cancer proteins encoded by the 
sequences set out in Tables 1A-16. 

Samples or assays comprising lung cancer proteins that are treated with a potential 
activator, inhibitor, or modulator are compared to control samples without the inhibitor, 
activator, or modulator to examine the extent of inhibition. Control samples (untreated with 

30 inhibitors) are assigned a relative protein activity value of 1 00%. Inhibition of a polypeptide 
is achieved when the activity value relative to the control is about 80%, preferably 50%, more 
preferably 25-0%. Activation of a lung cancer polypeptide is achieved when the activity 
value relative to the control (untreated with activators) is 110%, more preferably 150%, more 
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preferably 200-500% (i.e., two to five fold higher relative to the control), more preferably 

1000-3000% higher. 

The phrase "changes in cell growth" refers to any change in cell growth and 

proliferation characteristics in vitro or in vivo, such as cell viability, formation of foci, 

5 anchorage independence, semi-solid or soft agar growth, changes in contact inhibition and 

density limitation of growth, loss of growth factor or serum requirements, changes in cell 

morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 

ability to form or suppress tumors when injected into suitable animal hosts, and/or 

immortalization of the cell. See, e.g., Freshney (1994) Culture of Animal Cells a Manual of 

10 Basic Technique pp. 231-241 (3 rd ed.). 

"Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 
"Cancer cells," "transformed" cells, or "transformation" in tissue culture, refers to 
spontaneous or induced phenotypic changes that do not necessarily involve the uptake of new 
genetic material. Although transformation can arise from infection with a transforming virus 

15 and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 

spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 
aberrant growth control, nonmorphological changes, and/or malignancy (see, Freshney 
(1994) Culture of Animal Cells a Manual of Basic Technique (3 rd ed.)). 

20 "Antibody" refers to a polypeptide comprising a framework region from an 

immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 

25 gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD, and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 
Fundamental Immunology . 

An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. Each 

30 tetramer is composed of two identical pairs of polypeptide chains, each pair having one 
"light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each 
chain defines a variable region of about 100 to 110 or more amino acids primarily responsible 
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for antigen recognition. The terms variable light chain (V L ) and variable heavy chain (V H ) 

refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well-characterized 

fragments produced by digestion with various peptidases. Thus, e.g., pepsin digests an 

5 antibody below the disulfide linkages in the hinge region to produce F(ab)' 2 , a dimer of Fab 
which itself is a light chain joined to V H -C H 1 by a disulfide bond. The F(ab)' 2 may be 
reduced under mild conditions to break the disulfide linkage in the hinge region, thereby 
converting the F(ab)' 2 dimer into an Fab 9 monomer. The Fab' monomer is essentially Fab 
with part of the hinge region (see Paul (ed. 1999) Fundamenta l Immunology (4th ed.). While 

10 various antibody fragments are defined in terms of the digestion of an intact antibody, one of 
skill will appreciate that such fragments may be synthesized de novo either chemically or by 
using recombinant DNA methodology. Thus, the term antibody, as used herein, also includes 
antibody fragments either produced by the modification of whole antibodies, or those 
synthesized de novo using recombinant DNA methodologies (e.g., single chain Fv) or those 

15 identified using phage display libraries (see, e.g., McCafferty, et al. (1990) Nature 348:552- 
554). 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many technique known in the art can be used (see, e.g., Kohler and Milstein 
(1975) Nature 256:495-497; Kozbor, et al. (1983) Immunology Today 4:72; Cole, et al. 

20 (1985), pp. 77-96 in Monoclonal Antibodies and Cancer Therapy: Coligan (1991 and 
supplements) Current Protocols in Immunology : Harlow and Lane (1988) Antibodies, A 
Laboratory Manual; and Goding (1986) Monoclonal Antibodies: Principles and Practice (2d 
ed.)). Techniques for the production of single chain antibodies (U.S. Patent 4,946,778) can 
be adapted to produce antibodies to polypeptides of this invention. Also, transgenic mice, or 

25 other organisms such as other mammals, may be used to express humanized antibodies. 
Alternatively, phage display technology can be used to identify antibodies and heteromeric 
Fab fragments that specifically bind to selected antigens (see, e.g., McCafferty, et al. (1990) 
Nature 348:552-554; Marks, et al. (1992) Biotechnology 10:779-783). 

A "chimeric antibody" is an antibody molecule in which, e.g, (a) the constant region, 

30 or a portion thereof, is altered, replaced, or exchanged so that the antigen binding site 
(variable region) is linked to a constant region of a different or altered class, effector 
function, and/or species, or an entirely different molecule which confers new properties to the 
chimeric antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the 
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variable region, or a portion thereof, is altered, replaced, or exchanged with a variable region 

having a different or altered antigen specificity. 



Identification of lung cancer-associated sequences 

5 In one aspect, the expression levels of genes are determined in different patient 

samples for which diagnosis information is desired, to provide expression profiles. An 
expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have any particular gene similarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 

10 characteristic of the state of the cell. That is, normal tissue may be distinguished from 
cancerous or metastatic cancerous tissue, or metastatic cancerous tissue can be compared 
with tissue from surviving cancer patients. By comparing expression profiles of tissue in 
known different lung cancer states, information regarding which genes are important 
(including both up- and down-regulation of genes) in each of these states is obtained. 

1 5 Molecular profiling may distinguish subtypes of a currently collective disease designation, 
e.g., different forms of lung cancer (chronic disease, adenocarcinoma, etc.) 

The identification of sequences that are differentially expressed in lung cancer versus 
non-lung cancer tissue allows the use of this information in a number of ways. For example, 
a particular treatment regime may be evaluated: does a chemotherapeutic drug act to down- 

20 regulate lung cancer, and thus tumor growth or recurrence, in a particular patient. 

Alternatively, a treatment step may induce other markers which may be used as targets to 
destroy tumor cells. Similarly, diagnosis and treatment outcomes may be done or confirmed 
by comparing patient samples with the known expression profiles. Malignant diseasemay be 
compared to non-malignant conditions. Metastatic tissue can also be analyzed to determine 

25 the stage of lung cancer in the tissue, or origin of primary tumor, e.g., metastasis from a 

remote primary site. Furthermore, these gene expression profiles (or individual genes) allow 
screening of drug candidates with an eye to mimicking or altering a particular expression 
profile; e.g., screening can be done for drugs that suppress the lung cancer expression profile. 
This may be done by making biochips comprising sets of the important lung cancer genes, 

30 which can then be used in these screens. PCR methods may be applied with selected primer 
pairs, and analysis may be of RNA or of genomic sequences. These methods can also be 
done on the protein basis; that is, protein expression levels of the lung cancer proteins can be 
evaluated for diagnostic purposes or to screen candidate agents. In addition, the lung cancer 
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nucleic acid sequences can be administered for gene therapy purposes, including the 

administration of antisense nucleic acids, or the lung cancer proteins (including antibodies 

and other modulators thereof) administered as therapeutic drugs or as protein or DNA 

vaccines. 

5 Thus the present invention provides nucleic acid and protein sequences that are 

differentially expressed in lung cancer relative to normal tissues and/or non-malignant lung 
disease, or in different types of lung disease, herein termed "lung cancer sequences." As 
outlined below, lung cancer sequences include those that are up-regulated (i.e., expressed at a 
higher level) in lung cancer, as well as those that are down-regulated (i.e., expressed at a 

10 lower level). In a preferred embodiment, the lung cancer sequences are from humans; 
however, as will be appreciated by those in the art, lung cancer sequences from other 
organisms may be useful in animal models of disease and drug evaluation; thus, other lung 
cancer sequences are provided, from vertebrates, including mammals, including rodents (rats, 
mice, hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 

15 horses, etc.) and pets (dogs, cats, etc.). Lung cancer sequences from other organisms may be 
obtained using the techniques outlined below. 

Lung cancer sequences can include both nucleic acid and amino acid sequences. As 
will be appreciated by those in the art and is more fully outlined below, lung cancer nucleic 
acid sequences are useful in a variety of applications, including diagnostic applications, 

20 which will detect naturally occurring nucleic acids, as well as screening applications; e.g., 
biochips comprising nucleic acid probes or PCR microtiter plates with selected probes to the 
lung cancer sequences can be generated. 

A lung cancer sequence can be initially identified by substantial nucleic acid and/or 
amino acid sequence homology to the lung cancer sequences outlined herein. Such 

25 homology can be based upon the overall nucleic acid or amino acid sequence, and is 

generally determined as outlined below, e.g., using homology programs or hybridization 
conditions. 

For identifying lung cancer-associated sequences, the lung cancer screen typically 
includes comparing genes identified in different tissues, e.g., normal and cancerous tissues, 
30 cancer and non-malignant conditions, non-malignant conditions and normal tissues, or tumor 
tissue samples from patients who have metastatic disease vs. non metastatic tissue. Other 
suitable tissue comparisons include comparing lung cancer samples with metastatic cancer 
samples from other cancers, such as, breast, other gastrointestinal cancers, prostate, ovarian, 
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etc. Samples of, non metastatic disease tissue and tissue undergoing metastasis are applied to 

biochips comprising nucleic acid probes. The samples are first microdissected, if applicable, 
and treated as is known in the art for the preparation of mRNA. Suitable biochips are 
commercially available, e.g., from Affymetrix, Santa Clara, CA. Gene expression profiles as 
5 described herein are generated and the data analyzed. 

In one embodiment, the genes showing changes in expression as between normal and 
disease states are compared to genes expressed in other normal tissues, preferably normal 
lung, but also including, and not limited to colon, heart, brain, liver, breast, kidney, muscle, 
prostate, small intestine, large intestine, spleen, bone, and/or placenta. In a preferred 
10 embodiment, those genes identified during the lung cancer screen that are expressed in 
significant amounts in other tissues (e.g., essential organs) are removed from the profile, 
although in some embodiments, this is not necessary (e.g., where organs may be dispensible 
at a later stage of life). That is, when screening for drugs, it is usually preferable that the 
target expression be disease specific, to minimize possible side effects on other organs. 
15 In a preferred embodiment, lung cancer sequences are those that are up-regulated in 

lung cancer; that is, the expression of these genes is higher in cancerous tissue than in normal 
lung or other tissue. "Up-regulation" as used herein means, when the ratio is presented as a 
number greater than one, that the ratio is greater than one, preferably 1.5 or greater, more 
preferably 2.0 or greater. Another embodiment is directed to sequences up-regulated in non- 
20 malignant conditions relative to normal. Unigene cluster identification numbers and 

accession numbers herein are for the GenBank sequence database and the sequences of the 
accession numbers are hereby expressly incorporated by reference. GenBank is known in the 
art, see, e.g., Benson, DA, et al (1998) Nucleic Acids Research 26:1-7 and 
http://www.ncbi.nlm.nih.gov/. Sequences are also available in other databases, e.g., 
25 European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 
Another embodiment is directed to sequences up-regulated in non-malignant conditions 
relative to normal. In some situations, the sequences may be derived from assembly of 
available sequences or be predicted from genomic DNA using exon prediction algorithms, 
such as FGENESH (Salamov and Solovyev (2000) Genome Res. 10:516-522). In other 
30 situations, sequences have been derived from cloning and sequencing of isolated nucleic 
acids. 

In another preferred embodiment, lung cancer sequences are those that are dojvn- 
regulated in the lung cancer; that is, the expression of these genes is lower in cancerous tissue 
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or normal lung or other tissue. "Down-regulation" as used herein means, when the ratio is 

presented as a number greater than one, that the ratio is greater than one, preferably 1.5 or 

greater, more preferably 2.0 or greater, or, when the ratio is presented as a number less than 

one, that the ratio is less than one, preferably 0.5 or less, more preferably 0.25 or less. 

5 

Informatics 

The ability to identify genes that are over or under expressed in lung cancer can 
additionally provide high-resolution, high-sensitivity datasets which can be used in the areas 
of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 

10 biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with lung cancer. Or as another 
example, subcellular toxicological information can be generated to better direct drug structure 
and activity correlation (see Anderson (1998) Pharmaceutical Proteomics: Targets, 
Mechanism, and Function, paper presented at the BBC Proteomics conference, Coronado, CA 

15 (June 1 1-12, 1998)). Subcellular toxicological information can also be utilized in a biological 
sensor device to predict the likely toxicological effect of chemical exposures and likely 
tolerable exposure thresholds (see U.S. Patent No. 5,811,231). Similar advantages accrue 
from datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, 
saccharides, lipids, drugs, and the like). 

20 Thus, in another embodiment, the present invention provides a database that includes 

at least one set of assay data. The data contained in the database is acquired, e.g., using array 
analysis either singly or in a library format. The database can be in a form in which data can 
be maintained and transmitted, but is preferably an electronic database. The electronic 
database of the invention can be maintained on any electronic device allowing for the storage 

25 of and access to the database, such as a personal computer, but is preferably distributed on a 
wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence data is for 
clarity of illustration only. It will be apparent to those of skill in the art that similar databases 
can be assembled for assay data acquired using an assay of the invention. 

30 The compositions and methods for identifying and/or quantitating the relative and/or 

absolute abundance of a variety of molecular and macromolecular species from a biological 
sample representing lung cancer, i.e., the identification of lung cancer-associated sequences 
described herein, provide an abundance of information, which can be correlated with 
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pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, gene- 
disease causal linkages, identification of correlates of immunity and physiological status, 
among others. Although the data generated from the assays of the invention is suited for 
manual review and analysis, in a preferred embodiment, data processing using high-speed 
5 computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is known 
in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational database 
system for storing biomolecular sequence information in a manner that allows sequences to 
be catalogued and searched according to one or more protein function hierarchies. U.S. 

10 Patent 5,953,727 discloses a relational database having sequence records containing 
information in a format that allows a collection of partial-length DNA sequences to be 
catalogued and searched according to association with one or more sequencing projects for 
obtaining full-length sequences from the collection of partial length sequences. U.S. Patent 
5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 

1 5 sequence similar to a sequence data item in a gene database based on the degree of similarity 
between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926 5 818 discloses a multi- 

20 dimensional database comprising a functionality for multi-dimensional data analysis 
described as on-line analytical processing (OLAP), which entails the consolidation of 
projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 

25 fields stored in a hierarchical topological map which can be viewed as a tree structure or as 
the merger of two or more such tree structures. 

See also Mount, et al. (2001) Bioinformatics; Durbin, et al. (eds., 1999) Biological 
Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids (; Baxevanis and 
Oeullette (eds., 1998) Bioinformatics: A Practical Guide to th e Analysis of Genes and 

30 Proteins) : Rashidi and Buehler (1 999) Bioinformatics: Basic Applications in Biological 
Science and Medicine : Setubal, et al. (eds 1997) Introduction to Computational Molecular 
Biology : Misener and Krawetz (eds, 2000) Bioinformatics: Methods and Protocols : Higgins 
and Taylor (eds., 2000) Bioinformatics: Sequence. Structure, and Data banks: A Practical 
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Approach: Brown (2001) Bioinformatics: A Biologist's Guide to Bioc om puting and the 

Internet : Han and Kamber (2000) Data Mining: Concepts and Techniques (2000); and 

Waterman (1995) Introduction to Computational Biology: Maps. Sequences, and Genomes . 

The present invention provides a computer database comprising a computer and 

5 software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 

with data specifying the source of the target-containing sample from which each sequence 

specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing sample 

is from a control tissue sample known to be free of pathological disorders. In a variation, at 

10 least one of the sources is a known pathological tissue specimen, e.g., a neoplastic lesion or 
another tissue specimen to be analyzed for lung cancer. In another variation, the assay 
records cross-tabulate one or more of the following parameters for each target species in a 
sample: (1) a unique identification code, which can include, e.g., a target molecular structure 
and/or characteristic separation coordinate (e.g., electrophoretic coordinates); (2) sample 

15 source; and (3) absolute and/or relative quantity of the target species present in the sample. 

The invention also provides for the storage and retrieval of a collection of target data 
in a computer data storage apparatus, which can include magnetic disks, optical disks, 
magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, magnetic 
bubble memory devices, and other data storage devices, including CPU registers and on-CPU 

20 data storage arrays. Typically, the target data records are stored as a bit pattern in an array of 
magnetic domains on a magnetizable medium or as an array of charge states or transistor gate 
states, such as an array of cells in a DRAM device (e.g., each cell comprised of a transistor 
and a charge storage area, which may be on the transistor). In one embodiment, the invention 
provides such storage devices, and computer systems built therewith, comprising a bit pattern 

25 encoding a protein expression fingerprint record comprising unique identifiers for at least 10 
target data records cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably provides a 
method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 

30 or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
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be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 

determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM-compatible 

(DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format (e.g., Linux, 

5 SunOS, Solaris, ADC, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or hard (fixed, 

Winchester) disk drive, comprising a bit pattern encoding data from an assay of the invention 

in a file format suitable for retrieval and processing in a computerized sequence analysis, 

comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing devices 

10 linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, ISDN 

line, wireless network, optical fiber, or other suitable signal transmission medium, whereby at 
least one network device (e.g., computer, disk array, etc.) comprises a pattern of magnetic 
domains (e.g., magnetic disk) and/or charge domains (e.g,, an array of DRAM cells) 
composing a bit pattern encoding data acquired from an assay of the invention. 

15 The invention also provides a method for transmitting assay data that includes 

generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

20 In a preferred embodiment, the invention provides a computer system for comparing a 

query target to a database containing an array of data structures, such as an assay result 
obtained by the method of the invention, and ranking database targets based on the degree of 
identity and gap weight to the target data. A central processor is preferably initialized to load 
and execute the computer program for alignment and/or comparison of the assay results. 

25 Data for a query target is entered into the central processor via an I/O device. Execution of 
the computer program results in the central processor retrieving the assay data from the data 
file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to secondary 
memory, which is typically random access memory (e.g., DRAM, SRAM, SGRAM, or 

30 SDRAM). Targets are ranked according to the degree of correspondence between a selected 
assay characteristic (e.g., binding to a selected affinity moiety) and the same characteristic of 
the query target and results are output via an I/O device. For example, a central processor 
can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, PA-8000, SPARC, 
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MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or public domain 

molecular biology software package (e.g., UWGCG Sequence Analysis Software, Darwin); a 

data file can be an optical or magnetic disk, a data server, a memory device (e.g., DRAM, 

SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, etc.); an I/O device can 

5 be a terminal comprising a video display and a keyboard, a modem, an ISDN terminal 

adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or other suitable I/O 

device. 

The invention also preferably provides the use of a computer system, such as that 
described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
10 collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

1 5 Characteristics of lung cancer-associated proteins 

Lung cancer proteins of the present invention may be classified as secreted proteins, 
transmembrane proteins or intracellular proteins. In one embodiment, the lung cancer protein 
is an intracellular protein. Intracellular proteins may be found in the cytoplasm and/or in the 
nucleus. Intracellular proteins are involved in all aspects of cellular function and replication 

20 (including, e.g., signaling pathways); aberrant expression of such proteins often results in 
unregulated or disregulated cellular processes (see, e.g., Alberts (ed. 1994) Molecular 
Biology of the Cell (3d ed.). For example, many intracellular proteins have enzymatic 
activity such as protein kinase activity, protein phosphatase activity, protease activity, 
nucleotide cyclase activity, polymerase activity and the like. Intracellular proteins also serve 

25 as docking proteins that are involved in organizing complexes of proteins, or targeting 

proteins to various subcellular localizations, and are involved in maintaining the structural 
integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence in the 
proteins of one or more structural motifs for which defined functions have been attributed. In 

30 addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
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domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 

targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 

few, have been shown to mediate protein-protein interactions. Some of these may also be 

involved in binding to phospholipids or other second messengers. As will be appreciated by 

5 one of ordinary skill in the art, these motifs can be identified on the basis of amino acid 

sequence; thus, an analysis of the sequence of proteins may provide insight into both the 

enzymatic potential of the molecule and/or molecules with which the protein may associate. 

One usefiil database is Pfam (protein families), which is a large collection of multiple 

sequence alignments and hidden Markov models covering many common protein domains. 

10 Versions are available via the internet from Washington University in St. Louis, the Sanger 

Center in England, and the Karolinska Institute in Sweden (see, e.g., Bateman, et al (2000) 

Nuc. Acids Res. 28:263-266; Sonnhammer, et al. (1997) Proteins 28:405-420; Bateman, et al. 

(1999) Nuc. Acids Res. 27:260-262; and Sonnhammer, et al. (1998) Nuc. Acids Res. 26:320- 

322). 

15 In another embodiment, the lung cancer sequences are transmembrane proteins. 

Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. They may 
have an intracellular domain, an extracellular domain, or both. The intracellular domains of 
such proteins may have a number of functions including those already described for 
intracellular proteins. For example, the intracellular domain may have enzymatic activity 

20 and/or may serve as a binding site for additional proteins. Frequently the intracellular 

domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

25 Transmembrane proteins may contain from one to many transmembrane domains. 

For example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl cyclases 
and receptor serine/threonine protein kinases contain a single transmembrane domain. 
However, various other proteins including channels, pumps, and adenylyl cyclases contain 
numerous transmembrane domains. Many important cell surface receptors such as G protein 

30 coupled receptors (GPCRs) are classified as "seven transmembrane domain" proteins, as they 
contain 7 membrane spanning regions. Characteristics of transmembrane domains include 
approximately 17 consecutive hydrophobic amino acids that may be followed by charged 
amino acids. Therefore, upon analysis of the amino acid sequence of a particular protein, the 
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localization and number of transmembrane domains within the protein may be predicted (see, 

e.g., PSORT web site http://psort.nibb.ac.jp/). 

The extracellular domains of transmembrane proteins are diverse; however, conserved 

motifs are found repeatedly among various extracellular domains. Conserved structure 

5 and/or functions have been ascribed to different extracellular motifs. Many extracellular 

domains are involved in binding to other molecules. In one aspect, extracellular domains are 

found on receptors. Factors that bind the receptor domain include circulating ligands, which 

may be peptides, proteins, or small molecules such as adenosine and the like. For example, 

growth factors such as EGF, FGF, and PDGF are circulating growth factors that bind to their 

10 cognate receptors to initiate a variety of cellular responses. Other factors include cytokines, 
mitogenic factors, hormones, neurotrophic factors and the like. Extracellular domains also 
bind to cell-associated molecules. In this respect, they may mediate cell-cell interactions. 
Cell-associated ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol 
(GPI) anchor, or may themselves be transmembrane proteins. Extracellular domains may 

1 5 also associate with the extracellular matrix and contribute to the maintenance of the cell 
structure. 

Lung cancer proteins that are transmembrane are particularly preferred in the present 
invention as they are readily accessible targets for extracellular immunotherapeutics, as are 
described herein. In addition, as outlined below, transmembrane proteins can be also useful 

20 in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
situ or in histological analysis. Alternatively, antibodies can also label intracellular proteins, 
in which case analytical samples are typically permeablized to provide access to intracellular 
proteins. In addition, some membrane proteins can be processed to release a soluble protein, 
or to expose a residual fragment. Released soluble proteins may be useful diagnostic 

25 markers, processed residual protein fragments may be useful lung markers of disease. 

It will also be appreciated by those in the art that a transmembrane protein can be 
made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 
secreted through recombinant means by adding an appropriate signal sequence. 

30 In another embodiment, the lung cancer proteins are secreted proteins; the secretion of 

which can be either constitutive or regulated. These proteins may have a signal peptide or 
signal sequence that targets the molecule to the secretory pathway. Secreted proteins are 
involved in numerous physiological events; e.g., if circulating, they often serve to transmit 
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signals to various other cell types. The secreted protein may function in an autocrine manner 

(acting on the cell that secreted the factor), a paracrine manner (acting on cells in close 

proximity to the cell that secreted the factor), an endocrine manner (acting on cells at a 

distance, e.g., secretion into the blood stream), or exocrine (secretion, e.g., through a duct or 

5 to adjacent epithelial surface as sweat glands, sebaceous glands, pancreatic ducts, lacrimal 

glands, mammary glands, sax producing glands of the ear, etc.). Thus secreted molecules 

often find use in modulating or altering numerous aspects of physiology. Lung cancer 

proteins that are secreted proteins are particularly preferred in the present invention as they 

serve as good targets for diagnostic markers, e.g., for blood, plasma, serum, or stool tests. 

10 Those which are enzymes may be antibody or small molecule targets. Others may be useful 

as vaccine targets, e.g., via CTL mechanisms. 

Use of lung cancer nucleic acids 

As described above, lung cancer sequence is initially identified by substantial nucleic 
15 acid and/or amino acid sequence homology or linkage to the lung cancer sequences outlined 
herein. Such homology can be based upon the overall nucleic acid or amino acid sequence, 
and is generally determined as outlined below, using either homology programs or 
hybridization conditions. Typically, linked sequences on a mRNA are found on the same 
molecule. 

20 The lung cancer nucleic acid sequences of the invention, e.g., the sequences in Tables 

1A-16, can be fragments of larger genes, i.e., they are nucleic acid segments. "Genes" in this 
context includes coding regions, non-coding regions, and mixtures of coding and non-coding 
regions. Accordingly, as will be appreciated by those in the art, using the sequences provided 
herein, extended sequences, in either direction, of the lung cancer genes can be obtained, 

25 using techniques well known in the art for cloning either longer sequences or the full length 
sequences; see Ausubel, et al., supra. Much can be done by informatics and many sequences 
can be clustered to include multiple sequences corresponding to a single gene, e.g., systems 
such as UniGene (see, http://www.ncbi.nlm.nih.gov/UniGene/). 

Once a lung cancer nucleic acid is identified, it can be cloned and, if necessary, its 

30 constituent parts recombined to form the entire lung cancer nucleic acid coding regions or the 
entire mRNA sequence. Once isolated from its natural source, e.g., contained within a 
plasmid or other vector or excised therefrom as a linear nucleic acid segment, the 
recombinant lung cancer nucleic acid can be further-used as a probe to identify and isolate 
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other lung cancer nucleic acids, e.g., extended coding regions. It can also be used as a 

"precursor" nucleic acid to make modified or variant lung cancer nucleic acids and proteins. 

The lung cancer nucleic acids of the present invention are used in several ways. In a 

first embodiment, nucleic acid probes to the lung cancer nucleic acids are made and attached 

5 to biochips to be used in screening and diagnostic methods, as outlined below, or for 

administration, e.g., for gene therapy, RNAi, vaccine, and/or antisense applications. 

Alternatively, the lung cancer nucleic acids that include coding regions of lung cancer 

proteins can be put into expression vectors for the expression of lung cancer proteins, again 

for screening purposes or for administration to a patient. 

10 In a preferred embodiment, nucleic acid probes to lung cancer nucleic acids (both the 

nucleic acid sequences outlined in the figures and/or the complements thereof) are made. 
The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the lung cancer nucleic acids, i.e., the target sequence (either the target 
sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 

15 hybridization of the target sequence and the probes of the present invention occurs. As 

outlined below, this complementarity need not be perfect; there may be any number of base 
pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 

20 conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 
sequences to hybridize under appropriate reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single and 

25 partially double stranded. The strandedness of the probe is dictated by the structure, 

composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally 
complements of ORFs or whole genes are not used. In some embodiments, nucleic acids of 

30 lengths up to hundreds of bases can be used. 

In a preferred embodiment, more than one probe per sequence is used, with either 
overlapping probes or probes to different sections of the target being used. That is, two, 
three, four or more probes, with three being preferred, are used to build in a redundancy for a 
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particular target. The probes can be overlapping (i.e., have some sequence in common), or 

separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 

immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 

5 equivalents herein is meant the association or binding between the nucleic acid probe and the 

solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 

removal as outlined below. The binding can typically be covalent or non-covalent By "non- 

covalent binding" and grammatical equivalents herein is typically meant one or more of 

electrostatic, hydrophilic, and hydrophobic interactions. Included in non-covalent binding is 

10 the covalent attachment of a molecule, such as, streptavidin to the support and the non- 
covalent binding of the biotinylated probe to the streptavidin. By "covalent binding" and 
grammatical equivalents herein is meant that the two moieties, the solid support and the 
probe, are attached by at least one bond, including sigma bonds, pi bonds and coordination 
bonds. Covalent bonds can be formed directly between the probe and the solid support or can 

15 be formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

In general, the probes are attached to a biochip in a wide variety of ways, as will be 
appreciated by those in the art. As described herein, the nucleic acids can either be 

20 synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid support" or 
other grammatical equivalents herein is meant a material that can be modified for the 
attachment or association of the nucleic acid probes and is amenable to at least one detection 

25 method. Often the substrate may contain discrete individual sites appropriate for ndivitual 
partitioning and identification. As will be appreciated by those in the art, the number of 
possible substrates are very large, and include, but are not limited to, glass and modified or 
fimctionalized glass, plastics (including acrylics, polystyrene and copolymers of styrene and 
other materials, polypropylene, polyethylene, polybutylene, polyurethanes, Teflon, etc.), 

30 polysaccharides, nylon or nitrocellulose, resins, silica or silica-based materials including 
silicon and modified silicon, carbon, metals, inorganic glasses, plastics, etc. In general, the 
substrates allow optical detection and do not appreciably fluoresce. A preferred substrate is 
described in US application entitled Reusable Low Fluorescent Plastic Biochip, U.S. 
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Application Serial No. 09/270,214, filed March 15, 1999, herein incorporated by reference in 

its entirety. 

Generally the substrate is planar, although as will be appreciated by those in the art, 
other configurations of substrates may be used as well. For example, the probes may be 
5 placed on the inside surface of a tube, for flow-through sample analysis to minimize sample 
volume. Similarly, the substrate may be flexible, such as a flexible foam, including closed 
cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 

10 the biochip is derivatized with a chemical functional group including, but not limited to, 
amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g., using linkers as are known in the art; e.g., 

15 homo-or hetero-bifunctional linkers as are well known (see 1994 Pierce Chemical Company 
catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 
additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized, and then attached to the surface 
20 of the solid support. Either the 5' or 3' terminus may be attached to the solid support, or 

attachment may be via linkage to an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very strong, 

yet non-covalent. For example, biotinylated oligonucleotides can be made, which bind to 

surfaces covalently coated with streptavidin, resulting in attachment. 
25 Alternatively, the oligonucleotides may be synthesized on the surface, as is known in 

the art. For example, photoactivation techniques utilizing photopolymerization compounds 

and techniques are used. In a preferred embodiment, the nucleic acids can be synthesized in 

situ, using known photolithographic techniques, such as those described in WO 95/251 16; 

WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references cited within, all of 
30 which are expressly incorporated by reference; these methods of attachment form the basis of 

the Affymetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression level of 

lung cancer-associated sequences. These assays are typically performed in conjunction with 
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reverse transcription. In such assays, a lung cancer-associated nucleic acid sequence acts as a 

template in an amplification reaction (e.g., Polymerase Chain Reaction, or PCR). In a 

quantitative amplification, the amount of amplification product will be proportional to the 

amount of template in the original sample. Comparison to appropriate controls provides a 

5 measure of the amount of lung cancer-associated RNA. Methods of quantitative 

amplification are well known to those of skill in the art. Detailed protocols for quantitative 

PCR are provided, e.g., in Innis, et al. (1990) PCR Protocols, A Guide to Methods and 

Applications . 

In some embodiments, a TaqMan based assay is used to measure expression. 

10 TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
dye and a 3' quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3 9 end. When the PCR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3 9 

1 5 quenching agent, thereby resulting in an increase in fluorescence as a function of 

amplification (see, e.g., literature provided by Perkin-Ehner, e.g., www2.perkin-elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase chain 
reaction (LCR) (see Wu and Wallace (1989) Genomics 4:560, Landegren, et al. (1988) 
Science 241:1077, and Barringer, et al. (1990) Gene 89:117), transcription amplification 

20 (Kwoh, et al (1989) Proc. Natl. Acad. Sci. USA 86:1 173), self-sustained sequence 

replication (Guatelli, et al. (1990) Proc. Nat. Acad. Sci. USA 87:1874), dot PCR, and linker 
adapter PCR, etc. 

Expression of lung cancer proteins from nucleic acids 

25 In a preferred embodiment, lung cancer nucleic acids, e.g., encoding lung cancer 

proteins, are used to make a variety of expression vectors to express lung cancer proteins 
which can then be used in screening assays, as described below. Expression vectors and 
recombinant DNA technology are well known to those of skill in the art (see, e.g., Ausubel, 
supra, and Fernandez and Hoeffler (eds 1999) Gene Expression Systems) and are used to 

30 express proteins. The expression vectors may be either self-replicating extrachromosomal 
vectors or vectors which integrate into a host genome. Generally, these expression vectors 
include transcriptional and translational regulatory nucleic acid operably linked to the nucleic 
acid encoding the lung cancer protein. The term "control sequences" refers to DNA 
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sequences used for the expression of an operably linked coding sequence in a particular host 

organism. Control sequences that are suitable for prokaryotes, e.g., include a promoter, 

optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are known to 

utilize promoters, polyadenylation signals, and enhancers. 

5 Nucleic acid is "operably linked" when it is placed into a functional relationship with 

another nucleic acid sequence. For example, DNA for a presequence or secretory leader is 

operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in 

the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding 

sequence if it affects the transcription of the sequence; or a ribosome binding site is operably 

10 linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 
"operably linked" means that the DNA sequences being linked are contiguous, and, in the 
case of a secretory leader, contiguous and in reading phase. However, enhancers do not have 
to be contiguous. Linking is typically accomplished by ligation at convenient restriction 
sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are used in 

1 5 accordance with conventional practice. Transcriptional and translational regulatory nucleic 
acid will generally be appropriate to the host cell used to express the lung cancer protein. 
Numerous types of appropriate expression vectors, and suitable regulatory sequences are 
known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, but are 

20 not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop 
sequences, translational start and stop sequences, and enhancer or activator sequences. In a 
preferred embodiment, the regulatory sequences include a promoter and transcriptional start 
and stop sequences. 

Promoter sequences may be either constitutive or inducible promoters. The promoters 
25 may be either naturally occurring promoters or hybrid promoters. Hybrid promoters, which 
combine elements of more than one promoter, are also known in the art, and are useful in the 
present invention. 

In addition, an expression vector may comprise additional elements. For example, the 
expression vector may have two replication systems, thus allowing it to be maintained in two 
30 organisms, e.g., in mammalian or insect cells for expression and in a prokaryotic host for 
cloning and amplification. Furthermore, for integrating expression vectors, the expression 
vector often contains at least one sequence homologous to the host cell genome, and 
preferably two homologous sequences which flank the expression construct. The integrating 
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vector may be directed to a specific locus in the host cell by selecting the appropriate 

homologous sequence for inclusion in the vector. Constructs for integrating vectors are well 

known in the art (e.g., Fernandez and Hoeffler, supra). 

In addition, in a preferred embodiment, the expression vector contains a selectable 

5 marker gene to allow the selection of transformed host cells. Selection genes are well known 

in the art and will vary with the host cell used. 

The lung cancer proteins of the present invention are usually produced by culturing a 

host cell transformed with an expression vector containing nucleic acid encoding a lung 

cancer protein, under the appropriate conditions to induce or cause expression of the lung 

1 0 cancer protein. Conditions appropriate for lung cancer protein expression will vary with the 
choice of the expression vector and the host cell, and will be easily ascertained by one skilled 
in the art through routine experimentation or optimization. For example, the use of 
constitutive promoters in the expression vector will require optimizing the growth and 
proliferation of the host cell, while the use of an inducible promoter requires the appropriate 

15 growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect and 
animal cells, including mammalian cells. Of particular interest are Saccharomyces cerevisiae 

20 and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, CI 29 cells, 293 cells, Neurospora, BHK, 
CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial cells), THP1 cells (a 
macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the lung cancer proteins are expressed in mammalian 
cells. Mammalian expression systems are also known in the art, and include retroviral and 

25 adenoviral systems. Of particular use as mammalian promoters are the promoters from 
mammalian viral genes, since the viral genes are often highly expressed and have a broad 
host range. Examples include the S V40 early promoter, mouse mammary tumor virus LTR 
promoter, adenovirus major late promoter, herpes simplex virus promoter, and the CMV 
promoter (see, e.g., Fernandez and Hoeffler, supra). Typically, transcription termination and 

30 polyadenylation sequences recognized by mammalian cells are regulatory regions located 3' 
to the translation stop codon and thus, together with the promoter elements, flank the coding 
sequence. Examples of transcription terminator and polyadenylation signals include those 
derived form SV40. 
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The methods of introducing exogenous nucleic acid into mammalian hosts, as well as 
other hosts, is well known in the art, and will vary with the host cell used. Techniques 
include dextran-mediated transfection, calcium phosphate precipitation, polybrene mediated 
transfection, protoplast fusion, electroporation, viral infection, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

In a preferred embodiment, lung cancer proteins are expressed in bacterial systems. 
Promoters from bacteriophage may also be used and are known in the art. In addition, 
synthetic promoters and hybrid promoters are also useful; e.g., the tac promoter is a hybrid of 
the trp and lac promoter sequences. Furthermore, a bacterial promoter can include naturally 
occurring promoters of non-bacterial origin that have the ability to bind bacterial RNA 
polymerase and initiate transcription. In addition to a functioning promoter sequence, an 
efficient ribosome binding site is desirable. The expression vector may also include a signal 
peptide sequence that provides for secretion of the lung cancer protein in bacteria. The 
protein is either secreted into the growth media (gram-positive bacteria) or into the 
periplasmic space, located between the inner and outer membrane of the cell (gram-negative 
bacteria). The bacterial expression vector may also include a selectable marker gene to allow 
for the selection of bacterial strains that have been transformed. Suitable selection genes 
include genes which render the bacteria resistant to drugs such as ampicillin, 
chloramphenicol, erythromycin, kanamycin, neomycin and tetracycline. Selectable markers 
also include biosynthetic genes, such as those in the histidine, tryptophan and leucine 
biosynthetic pathways. These components are assembled into expression vectors. Expression 
vectors for bacteria are well known in the art, and include vectors for Bacillus subtilis, E. 
coli, Streptococcus cremoris, and Streptococcus lividans, among others (e.g., Fernandez and 
Hoeffler, supra). The bacterial expression vectors are transformed into bacterial host cells 
using techniques well known in the art, such as calcium chloride treatment, electroporation, 
and others. 

In one embodiment, lung cancer proteins are produced in insect cells. Expression 
vectors for the transformation of insect cells, and in particular, baculo virus-based expression 
vectors, are well known in the art. 

In a preferred embodiment, lung cancer protein is produced in yeast cells. Yeast 
expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula polymorpha, 
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Kluyveromyces fragilis and K. lactis, Pichia guillerimondii, and P. pastoris, 

Schizosaccharomyces pombe, and Yarrowia lipolytica. 

The lung cancer protein may also be made as a fusion protein, using techniques well 

known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the desired epitope 

5 is small, the lung cancer protein may be fused to a carrier protein to form an immunogen. 

Alternatively, the lung cancer protein may be made as a fusion protein to increase expression 

for affinity purification purposes, or for other reasons. For example, when the lung cancer 

protein is a lung cancer peptide, the nucleic acid encoding the peptide may be linked to other 

nucleic acid for expression purposes. 

10 In a preferred embodiment, the lung cancer protein is purified or isolated after 

expression. Lung cancer proteins may be isolated or purified in a variety of appropriate 
ways. Standard purification methods include electrophoretic,molecular, immunological and 
chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the lung cancer protein 

1 5 may be purified using a standard anti-lung cancer protein antibody column. Ultrafiltration 
and diafiltration techniques, in conjunction with protein concentration, are also useful. For 
general guidance in suitable purification techniques, see Scopes (1982) Protein Purification . 
The degree of purification necessary will vary depending on the use of the lung cancer 
protein. In some instances no purification will be necessary. 

20 Once expressed and purified if necessary, the lung cancer proteins and nucleic acids 

are useful in a number of applications. They may be used as immunoselection reagents, as 
vaccine reagents, as screening agents, therapeutic entities, for production of antibodies, as 
transcription or translation inhibitors, etc. 

25 Variants of lung cancer proteins 

In one embodiment, the lung cancer proteins are derivative or variant lung cancer 
proteins as compared to the wild-type sequence. That is, as outlined more fully below, the 
derivative lung cancer peptide will often contain at least one amino acid substitution, deletion 
or insertion, with amino acid substitutions being particularly preferred. The amino acid 
30 substitution, insertion or deletion may occur at a particular residue within the lung cancer 
peptide. 

Also included within one embodiment of lung cancer proteins of the present invention 
are amino acid sequence variants. These variants typically fall into one or more of three 
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classes: substitutional, insertional or deletional variants. These variants ordinarily are 

prepared by site specific mutagenesis of nucleotides in the DNA encoding the lung cancer 

protein, using cassette or PCR mutagenesis or other techniques, to produce DNA encoding 

the variant, and thereafter expressing the DNA in recombinant cell culture as outlined above. 

5 However, variant lung cancer protein fragments having up to about 100-1 50 residues may be 

prepared by in vitro synthesis. Amino acid sequence variants are characterized by the 

predetermined nature of the variation, a feature that sets them apart from naturally occurring 

allelic or interspecies variation of the lung cancer protein amino acid sequence. The variants 

typically exhibit a similar qualitative biological activity as the naturally occurring analogue, 

10 although variants can also be selected which have modified characteristics as will be more 
fully outlined below. 

While the site or region for introducing an amino acid sequence variation is often 
predetermined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be 

1 5 conducted at the target codon or region and the expressed lung cancer variants screened for 
the optimal combination of desired activity. Techniques exist for making substitution 
mutations at predetermined sites in DNA having a known sequence, e.g., M13 primer 
mutagenesis and PCR mutagenesis. Screening of mutants is often done using assays of lung 
cancer protein activities. 

20 Amino acid substitutions are typically of single residues; insertions usually will be on 

the order of from about 1 to 20 amino acids, although considerably larger insertions may be 
occasionally tolerated. Deletions generally range from about 1 to about 20 residues, although 
in some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to arrive 

25 at a final derivative. Generally these changes are done on a few amino acids to minimize the 
alteration of the molecule. Larger changes may be tolerated in certain circumstances. When 
small alterations in the characteristics of a lung cancer protein are desired, substitutions are 
generally made in accordance with the amino acid substitution chart provided in the 
definition section. 

30 Variants typically exhibit essentially the same qualitative biological activity and will 

elicit the same immune response as a naturally-occurring analog, although variants also are 
selected to modify the characteristics of lung cancer proteins as needed. Alternatively, the 
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variant may be designed or reorganized such that a biological activity of the lung cancer 

protein is altered. For example, glycosylation sites may be added, altered, or removed. 

Covalent modifications of lung cancer polypeptides are included within the scope of 

this invention. One type of covalent modification includes reacting targeted amino acid 

5 residues of a lung cancer polypeptide with an organic derivatizing agent that is capable of 

reacting with selected side chains or the N-or C-terminal residues of a lung cancer 

polypeptide. Derivatization with bifiinctional agents is useful, for instance, for crosslinking 

lung cancer polypeptides to a water-insoluble support matrix or surface for use in a method 

for purifying anti-lung cancer polypeptide antibodies or screening assays, as is more fully 

10 described below. Commonly used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2- 
phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, e.g., esters with 4-azidosalicylic 
acid, homobifunctional imidoesters, including disuccinimidyl esters such as 3,3'- 
dithiobis(succinimidylpropionate), bifiinctional maleimides such as bis-N-maleimido-1,8- 
octane and agents such as methyl-3-((p-azidophenyl)<hthio)propipimidate. 

15 Other modifications include deamidation of glutaminyl and asparaginyl residues to 

the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proline and 
lysine, phosphorylation of hydroxyl groups of serinyl, threonyl or tyrosyl residues, 
methylation of the y-amino groups of lysine, arginine, and histidine side chains (Creighton 
(1983) Proteins: Structure and Molecular Properties , pp. 79-86), acetylation of the N-terminal 

20 amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the lung cancer polypeptide encompassed by 
this invention is an altered native glycosylation pattern of the polypeptide. "Altering the 
native glycosylation pattern" is intended herein to mean adding to or deleting one or more 
carbohydrate moieties of a native sequence lung cancer polypeptide. Glycosylation patterns 

25 can be altered in many ways. For example the use of different cell types to express lung 
cancer-associated sequences can result in different glycosylation patterns. 

Addition of glycosylation sites to lung cancer polypeptides may also be accomplished 
by altering the amino acid sequence thereof. The alteration may be made, e.g., by the 
addition of, or substitution by, one or more serine or threonine residues to the native sequence 

30 lung cancer polypeptide (for O-linked glycosylation sites). The lung cancer amino acid 
sequence may optionally be altered through changes at the DNA level, particularly by 
mutating the DNA encoding the lung cancer polypeptide at preselected bases such that 
codons are generated that will translate into the desired amino acids. 
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Another means of increasing the number of carbohydrate moieties on the lung cancer 

polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. Such 

methods are described in the art, e.g., in WO 87/05330, and in Aplin and Wriston (1981) 

CRC Crit. Rev. Biochem.. pp. 259-306. 

5 Removal of carbohydrate moieties present on the lung cancer polypeptide may be 

accomplished chemically or enzymatically or by mutational substitution of codons encoding 

for amino acid residues that serve as targets for glycosylation. Chemical deglycosylation 

techniques are known in the art and described, for instance, by HaJrimuddin, et al. (1987) 

Arch. Biochem. Biophvs.» 259:52 and by Edge, et al. fl98n Anal. Biochem.. 118:131. 

10 Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a 

variety of endo-and exo-glycosidases as described by Thotakura, et al. (1987) Meth. 

Enzvmol.. 138:350. 

Another type of covalent modification of lung cancer comprises linking the lung . 
cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene 

15 glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in U.S. Patent 
Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192, or 4,179,337. 

Lung cancer polypeptides of the present invention may also be modified in a way to 
form chimeric molecules comprising a lung cancer polypeptide fused to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 

20 molecule comprises a fusion of a lung cancer polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino-or carboxyl-terminus of the lung cancer polypeptide. The 
presence of such epitope-tagged forms of a lung cancer polypeptide can be detected using an 
antibody against the tag polypeptide. Also, provision of the epitope tag enables the lung 

25 cancer polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a lung cancer polypeptide with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 
chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

30 Various tag polypeptides and their respective antibodies are well known and examples 

include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; HIS6 and metal 
chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field, et al. (1988) MoL 
Cell. BioL 8:2159-2165); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies 
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thereto (Evan, et al. (1985) Molecular and Cellular Biology 5:3610-3616); and the Herpes 

Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky, et al. (1990) Protein 

Engineering 3(6):547-553). Other tag polypeptides include the Flag-peptide (Hopp, et al. 

(1988) BioTechnologv 6: 1204-1210); the KT3 epitope peptide (Martin, et al. (1992) Science 

5 255:192-194); tubulin epitope peptide (Skinner, et al. (1991) J. Biol. Chem. 266:15163- 

15166); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth, et al. (1990) Proc. Nat'l 

Acad. Sci. USA 87:6393-6397). 

Also included are other lung cancer proteins of the lung cancer family, and lung 

cancer proteins from other organisms, which are cloned and expressed as outlined below. 

10 Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be used to 
find other related lung cancer proteins from primates or other organisms. As will be 
appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include unique areas of the lung cancer nucleic acid sequence. As is generally known in the 
art, preferred PCR primers are from about 15 to about 35 nucleotides in length, with from 

15 about 20 to about 30 being preferred, and may contain inosine as needed. PCR reaction 
conditions are well known in the art (e.g., Innis, PCR Protocols, supra). 

Antibodies to lung cancer proteins 

In a preferred embodiment, when a lung cancer protein is to be used to generate 
20 antibodies, e.g., for immunotherapy or immunodiagnosis, the lung cancer protein should 
share at least one epitope or determinant with the full length protein. By "epitope" or 
"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller lung cancer protein will be able to bind to the full-length protein, 
25 particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. 

Methods of preparing polyclonal antibodies are well known (e.g., Coligan, supra; and 
Harlow and Lane, supra). Polyclonal antibodies can be raised in a mammal, e.g., by one or 
more injections of an immunizing agent and, if desired, an adjuvant. Typically, the 
30 immunizing agent and/or adjuvant will be injected in the mammal by multiple subcutaneous 
or intraperitoneal injections. The immunizing agent may include a protein encoded by a 
nucleic acid of Tables 1A-16 or fragment thereof or a fusion protein thereof. It may be useful 
to conjugate the immunizing agent to a protein known to be immunogenic in the mammal 
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being immunized. Immunogenic proteins include, e.g., keyhole limpet hemocyanin, serum 

albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Adjuvants include, e.g., 

Freund's complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic 

trehalose dicorynomycolate). The immunization protocol may be selected by one skilled in 

5 the art. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies 
may be prepared using hybridoma methods, such as those described by Kohler and Milstein 
(1975) Nature 256:495. In a hybridoma method, a mouse, hamster, or other appropriate host 
animal, is typically immunized with an immunizing agent to elicit lymphocytes that produce 
10 or are capable of producing antibodies that will specifically bind to the immunizing agent. 
Alternatively, the lymphocytes may be immunized in vitro. The immunizing agent will 
typically include a polypeptide encoded by a nucleic acid of the tables, or fragment thereof, 
or a fusion protein thereof. Generally, either peripheral blood lymphocytes ("PBLs") are 
used if cells of human origin are desired, or spleen cells or lymph node cells are used if non- 
15 human mammalian sources are desired. The lymphocytes are then fused with an 

immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma cell (Goding (1986) Monoclonal Antibodies: Principles and Practice, pp. 59-103 ). 
Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells 
of rodent, bovin, or primate origin. Usually, rat or mouse myeloma cell lines are employed. 
20 The hybridoma cells may be cultured in a suitable culture medium that preferably contains 
one or more substances that inhibit the growth or survival of the unfused, immortalized cells. 
For example, if the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl 
transferase (HGPRT or HPRT) 3 the culture medium for the hybridomas typically will include 
hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the 
25 growth of HGPRT-deficient cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies are 
typically monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens or that have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
30 protein encoded by a nucleic acid of the tables or a fragment thereof, the other one is for any 
other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 
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In a preferred embodiment, the antibodies to lung cancer protein are capable of 

reducing or eliminating a biological function of a lung cancer protein, in a naked form or 

conjugated to an effector moiety. That is, the addition of anti-lung cancer protein antibodies 

(either polyclonal or preferably monoclonal) to lung cancer tissue (or cells containing lung 

5 cancer) may reduce or eliminate the lung cancer. Generally, at least a 25% decrease in 

activity, growth, size or the like is preferred, with at least about 50% being particularly 

preferred and about a 95-100% decrease being especially preferred. 

In a preferred embodiment the antibodies to the lung cancer proteins are humanized 

antibodies (e.g., Xenerex Biosciences, Medarex, Inc., Abgenix, Inc., Protein Design Labs, 

10 Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric molecules of 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', 
F(ab')2 or other antigen-binding subsequences of antibodies) which contain minimal 
sequence derived from non-human immunoglobulin. Humanized antibodies include human 
immunoglobulins (recipient antibody) in which residues from a complementary determining 

1 5 region (CDR) of the recipient are replaced by residues from a CDR of a non-human species 
(donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and 
capacity. In some instances, Fv framework residues of a human immunoglobulin are 
replaced by corresponding non-human residues. Humanized antibodies may also comprise 
residues which are found neither in the recipient antibody nor in the imported CDR or 

20 framework sequences. In general, a humanized antibody will comprise substantially all of at 
least one, and typically two, variable domains, in which all or substantially all of the CDR 
regions correspond to those of a non-human immunoglobulin and all or substantially all of 
the framework (FR) regions are those of a human immunoglobulin consensus sequence. A 
humanized antibody optimally also will typically comprise at least a portion of an 

25 immunoglobulin constant region (Fc), typically that of a human immunoglobulin (Jones, et 
al. (1986) Nature 321 :522-525; Riechmann, et al. (1988) Nature 332:323-329; and Presta . 
(1992) Curr. Op. Struct. Biol. 2:593-596). Humanization can be performed following the 
method of Winter and co-workers (Jones, et al. (1986) Nature 321:522-525; Riechmann, et al. 
(1988) Nature 332:323-327; Verhoeyen, et al. (1988) Science 239:1534-1536), by 

30 substituting rodent CDRs or CDR sequences for corresponding sequences of a human 

antibody. Accordingly, such humanized antibodies are chimeric antibodies (U.S. Patent No. 
4,816,567), wherein substantially less than an intact human variable domain has been 
substituted by corresponding sequence from a non-human species. 
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Human-like antibodies can also be produced using various techniques known in the 

art, including phage display libraries (Hoogenboom and Winter (1991) J. Mol. Biol . 227:381; 

Marks, et al. (1991) J. Mol. Biol. 222:581). The techniques of Cole, et al. and Boerner, et al. 

are also available for the preparation of human monoclonal antibodies (Cole, et al. (1985) 

5 Monoclonal Antibodies and Cancer Therapy, p. 77 and Boerner, et al. (1991) J. Immunol. 

147(l):86-95), Similarly, human antibodies can be made by introducing human 

immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 

immunoglobulin genes have been partially or completely inactivated. Upon challenge, 

human antibody production is observed, which closely resembles that seen in humans in 

10 nearly all respects, including gene rearrangement, assembly, and antibody repertoire. This 
approach is described, e.g., in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 
5,633,425; 5,661,016, and in the following scientific publications: Marks, et al. (1992) 
Bio/Technology 10:779-783; Lonberg, et al. (1994) Nature 368:856-859; Morrison (1994) 
Nature 368:812-13; Fishwild, et al. (1996) Nature Biotechnology 14:845-51; Neuberger 

15 (1996) Nature Biotechnology 14:826; and Lonberg and Huszar (1995) Intern. Rev. Immunol. 
13:65-93. 

By immunotherapy is meant treatment of lung cancer with an antibody raised against 
a lung cancer proteins. As used herein, immunotherapy can be passive or active. Passive 
immunotherapy as defined herein is the passive transfer of antibody to a recipient (patient). 

20 Active immunization is the induction of antibody and/or T-cell responses in a recipient 
(patient). Induction of an immune response is the result of providing the recipient with an 
antigen to which antibodies are raised. The antigen may be provided by injecting a 
polypeptide against which antibodies are desired to be raised into a recipient, or contacting 
the recipient with a nucleic acid capable of expressing the antigen and under conditions for 

25 expression of the antigen, leading to an immune response. 

In a preferred embodiment the lung cancer proteins against which antibodies are 
raised are secreted proteins as described above. Without being bound by theory, antibodies 
used for treatment, may bind and prevent the secreted protein from binding to its receptor, 
thereby inactivating the secreted lung cancer protein. 

30 In another preferred embodiment, the lung cancer protein to which antibodies are 

raised is a transmembrane protein. Without being bound by theory, antibodies used for 
treatment may bind the extracellular domain of the lung cancer protein and prevent it from 
binding to other proteins, such as circulating ligands or cell-associated molecules. The 
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antibody may cause down-regulation of the transmembrane lung cancer protein. The 

antibody may be a competitive, non-competitive or uncompetitive inhibitor of protein binding 

4 

to the extracellular domain of the lung cancer protein. The antibody may be an antagonist of 
the lung cancer protein or may prevent activation of a transmembrane lung cancer protein, or 
5 may induce or suppress a particular cellular pathway. In some embodiments, when the 
antibody prevents the binding of other molecules to the lung cancer protein, the antibody 
prevents growth of the cell. The antibody may also be used to target or sensitize the cell to 
cytotoxic agents, including, but not limited to TNF-oc, TNF-p, EL-1, INF-y, and IL-2, or 
chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, methotrexate, 

10 and the like. In some instances the antibody may belong to a sub-type that activates serum 
complement when complexed with the transmembrane protein thereby mediating cytotoxicity 
or antigen-dependent cytotoxicity (ADCC). Thus, lung cancer may be treated by 
administering to a patient antibodies directed against the transmembrane lung cancer protein. 
Antibody-labeling may activate a co-toxin, localize a toxin payload, or otherwise provide 

15 means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector moiety. 
The effector moiety can be various molecules, including labeling moieties such as radioactive 
labels or fluorescent labels, or can be a therapeutic moiety. In one aspect the therapeutic 
moiety is a small molecule that modulates the activity of a lung cancer protein. In another 

20 aspect the therapeutic moiety may modulate an activity of molecules associated with or in 
close proximity to a lung cancer protein. The therapeutic moiety may inhibit enzymatic or 
signaling activity such as protease or collagenase activity associated with lung cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic agent. In 
this method, targeting the cytotoxic agent to lung cancer tissue or cells results in a reduction 

25 in the number of afflicted cells, thereby reducing symptoms associated with lung cancer. 

Cytotoxic agents are numerous and varied and include, but are not limited to, cytotoxic drugs 
or toxins or active fragments of such toxins. Suitable toxins and their corresponding 
fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A chain, curcin, 
crotin, phenomycin, enomycin, saporin, auristatin, and the like. Cytotoxic agents also include 

30 radiochemicals made by conjugating radioisotopes to antibodies raised against lung cancer 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to. 
the antibody. Targeting the therapeutic moiety to transmembrane lung cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the lung cancer 



50 



WO 02/086443 PCT/US02/12476 

afflicted area, but also serves to reduce deleterious side effects that may be associated with 
the untargeted therapeutic moiety. 

In another preferred embodiment, the lung cancer protein against which the antibodies 
are raised is an intracellular protein. In this case, the antibody may be conjugated to a protein 
5 or other entity which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
the individual or cell. Moreover, wherein the lung cancer protein can be targeted within a 
cell, i.e., the nucleus, an antibody theretomay contain a signal for that target localization, i.e., 
a nuclear localization signal. 
10 The lung cancer antibodies of the invention specifically bind to lung cancer proteins. 

By "specifically bind" herein is meant that the antibodies bind to the protein with a Kd of at 
least about 0.1 mM, more usually at least about 1 pM, preferably at least about 0.1 (iM or 
better, and most preferably, 0.01 pM or better. Selectivity of binding to the specific target 
and not to related other sequences is also important. 

15 

Detection of lung cancer sequence for diagnostic and therapeutic applications 

In one aspect, the RNA expression levels of genes are determined for different 
cellular states in the lung cancer phenotype. Expression levels of genes in normal tissue (e.g., 
not undergoing lung cancer), in lung cancer tissue (and in some cases, for varying severities 

20 of lung cancer that relate to prognosis, as outlined below), or in non-malignant disease are 
evaluated to provide expression profiles. A gene expression profile of a particular cell state 
or point of development is essentially a "fingerprint" of the state of the cell. While two states 
may have a particular gene similarly expressed, the evaluation of a number of genes 
simultaneously allows the generation of a gene expression profile that is reflective of the state 

25 of the cell. By comparing expression profiles of cells in different states, information 

regarding which genes are important (including both up- and down-regulation of genes) in 
each of these states is obtained. Then, diagnosis may be performed or confirmed to 
determine whether a tissue sample has the gene expression profile of normal or cancerous 
tissue. This will provide for molecular diagnosis of related conditions. 

30 "Differential expression," or grammatical equivalents as used herein, refers to 

qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
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normal versus lung cancer tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more states, A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
5 not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 

10 GeneChip™ expression arrays, Lockhart (1996) Nature Biotechnology 14: 1675-1680, hereby 
expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is typically 
at least about 50%, more preferably at least about 100%, more preferably at least about 

15 150%, more preferably at least about 200%, with from 300 to at least 1000% being especially 
preferred. 

Evaluation may be at the gene transcript or the protein level. The amount of gene 
expression may be monitored using nucleic acid probes to the RNA or DNA equivalent of the 
gene transcript, and the quantification of gene expression levels, or, alternatively, the final 

20 gene product itself (protein) can be monitored, e.g., with antibodies to the lung cancer protein 
and standard immunoassays (ELISAs, etc.) or other techniques, including mass spectroscopy 
assays, 2D gel electrophoresis assays, etc. Proteins corresponding to lung cancer genes, e.g., 
those identified as being important in a lung cancer or disease phenotype, can be evaluated in 
a lung cancer diagnostic test. In a preferred embodiment, gene expression monitoring is 

25 performed simultaneously on a number of genes. 

The lung cancer nucleic acid probes may be attached to biochips as outlined herein for 
the detection and quantification of lung cancer sequences in a particular cell. The assays are 
further described below in the example. PCR techniques can be used to provide greater 
sensitivity. Multiple protein expression monitoring can be performed as well. Similarly, 

30 these assays may be performed on an individual basis as well. 

In a preferred embodiment nucleic acids encoding the lung cancer protein are 
detected. Although DNA or RNA encoding the lung cancer protein may be detected, of 
particular interest are methods wherein an mRNA encoding a lung cancer protein is detected. 
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Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is complementary to 

and hybridizes with the mRNA and includes, but is not limited to, oligonucleotides, cDNA or 

RNA. Probes also should contain a detectable label, as defined herein. In one method the 

mRNA is detected after immobilizing the nucleic acid to be examined on a solid support such 

5 as nylon membranes and hybridizing the probe with the sample. Following washing to 

remove the non-specifically bound probe, the label is detected. In another method detection 

of the mRNA is performed in situ. In this method permeabilized cells or tissue samples are 

contacted with a detectably labeled nucleic acid probe for sufficient time to allow the probe 

to hybridize with the target mRNA. Following washing to remove the non-specifically bound 

10 probe, the label is detected. For example a digoxygenin labeled riboprobe (RNA probe) that 
is complementary to the mRNA encoding a lung cancer protein is detected by binding the 
digoxygenin with an anti-digoxygenin secondary antibody and developed with nitro blue 
tetrazolium and 5-bromo-4-chloro-3-indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins as 

15 described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The lung cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing lung cancer sequences are used in diagnostic assays. This can be performed on an 
individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 

20 techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, lung cancer proteins, including intracellular, 
transmembrane, or secreted proteins, find use as markers of lung cancer, e.g., for prognostic 
or diagnostic purposes. Detection of these proteins in putative lung cancer tissue allows for 

25 detection, prognosis, or diagnosis of lung cancer or similar disease, and perhaps for selection 
of therapeutic strategy. In one embodiment, antibodies are used to detect lung cancer 
proteins. A preferred method separates proteins from a sample by electrophoresis on a gel 
(typically a denaturing and reducing protein gel, but may be another type of gel, including 
isoelectric focusing gels and the like). Following separation of proteins, the lung cancer 

30 protein is detected, e.g., by immunoblotting with antibodies raised against the lung cancer 
protein. Methods of immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the lung cancer protein find use in in situ 
imaging techniques, e.g., in histology (e.g., Asai (ed. 1993) Methods in Cell Biology: 
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Antibodies in Cell Biology, volume 37. In this method cells are contacted with from one to 

many antibodies to the lung cancer protein(s). Following washing to remove non-specific 

antibody binding, the presence of the antibody or antibodies is detected. In one embodiment 

the antibody is detected by incubating with a secondary antibody that contains a detectable 

5 label, e.g., multicolor fluorescence or confocal imaging. In another method the primary 

antibody to the lung cancer protein(s) contains a detectable label, e.g., an enzyme marker that 

can act on a substrate. In another preferred embodiment each one of multiple primary 

antibodies contains a distinct and detectable label. This method finds particular use in 

simultaneous screening for a plurality of lung cancer proteins. Many other histological 

10 imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the ability 
to detect and distinguish emissions of different wavelengths. In addition^ a fluorescence 
activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing lung cancer from 

15 blood, serum, plasma, stool, and other samples. Such samples, therefore, are useful as 

samples to be probed or tested for the presence of lung cancer proteins. Antibodies can be 
used to detect a lung cancer protein by previously described immunoassay techniques 
including ELISA, immunoblotting (western blotting), immunoprecipitation, BIACORE 
technology and the like. Conversely, the presence of antibodies may indicate an immune 

20 response against an endogenous lung cancer protein or vaccine. 

In a preferred embodiment, in situ hybridization of labeled lung cancer nucleic acid 
probes to tissue arrays is done. For example, arrays of tissue samples, including lung cancer 
tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, supra) is then 
performed. When comparing the fingerprints between an individual and a standard, the 

25 skilled artisan can make a diagnosis, a prognosis, or a prediction based on the findings. It is 
further understood that the genes which indicate the diagnosis may differ from those which 
indicate the prognosis and molecular profiling of the condition of the cells may lead to 
distinctions between responsive or refractory conditions or may be predictive of outcomes. 
In a preferred embodiment, the lung cancer proteins, antibodies, nucleic acids, 

30 modified proteins and cells containing lung cancer sequences are used in prognosis assays. 
As above, gene expression profiles can be generated that correlate to lung cancer, clinical, 
pathological, or other information, in terms of long term prognosis. Again, this may be done 
on either a protein or gene level, with the use of genes being preferred. Single or multiple 
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genes may be useful in various combinations. As above, lung cancer probes may be attached 

to biochips for the detection and quantification of lung cancer sequences in a tissue or patient. 

The assays proceed as outlined above for diagnosis. PCR method may provide more 

sensitive and accurate quantification. 

5 

Assays for therapeutic compounds 

In a preferred embodiment, the proteins, nucleic acids, and antibodies as described 
herein are used in drug screening assays. The lung cancer proteins, antibodies, nucleic acids, 
modified proteins and cells containing lung cancer sequences are used in drug screening . 

10 assays or by evaluating the effect of drug candidates on a "gene expression profile" or 

expression profile of polypeptides. In a preferred embodiment, the expression profiles are 
used, preferably in conjunction with high throughput screening techniques to allow 
monitoring for expression profile genes after treatment with a candidate agent (e.g., 
Zlokarnik, et al. (1998) Science 279:84-8; Heid (1996) Genome Res. 6:986-94. 

15 In a preferred embodiment, the lung cancer proteins, antibodies, nucleic acids, 

modified proteins and cells containing the native or modified lung cancer proteins are used in 
screening assays. That is, the present invention provides novel methods for screening for 
compositions which modulate the lung cancer phenotype or an identified physiological 
function of a lung cancer protein. As above, this can be done on an individual gene level or 

20 by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 

embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 

Having identified differentially expressed genes herein, a variety of assays may be 

25 performed. In a preferred embodiment, assays may be run on an individual gene or protein 
level. That is, having identified a particular gene with altered regulation in lung cancer, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
lung cancer protein. "Modulation" thus includes an increase or a decrease in gene 
expression. The preferred amount of modulation will depend on the original change of the 

30 gene expression in normal versus tissue undergoing lung cancer, with changes of at least 

10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in lung cancer tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in lung 
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cancer tissue compared to normal tissue often provides a target value of a 10-fold increase in 

expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes and the 

quantification of gene expression levels, or, alternatively, the gene product itself can be 

5 monitored, e.g., through the use of antibodies to the lung cancer protein and standard 

immunoassays. Proteomics and separation techniques may also allow quantification of 

expression. 

In a preferred embodiment, gene or protein expression monitoring of a number of 
entities, i.e., an expression profile, is monitored simultaneously. Such profiles will typically 
1 0 involve a plurality of those entities described herein. 

In this embodiment, the lung cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of lung cancer sequences in a particular 
cell. Alternatively, PCR may be used. Thus, a series, e.g., of microtiter plate, may be used 
with dispensed primers in desired wells. A PCR reaction can then be performed and analyzed 
15 for each well. 

Expression monitoring can be performed to identify compounds that modify the 
expression of one or more lung cancer-associated sequences, e.g., a polynucleotide sequence 
set out in the tables. Generally, in a preferred embodiment, a test compound is added to the 
cells prior to analysis. Moreover, screens are also provided to identify agents that modulate 

20. lung cancer, modulate lung cancer proteins, bind to a lung cancer protein; or interfere with 
the binding of a lung cancer protein and an antibody, substrate, or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes a molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or^ 

25 indirectly alter the lung cancer phenotype or the expression of a lung cancer sequence, e.g., a 
nucleic acid or protein sequence. In preferred embodiments, modulators alter expression 
profiles of nucleic acids or proteins provided herein. In one embodiment, the modulator 
suppresses a lung cancer phenotype, e.g., to a normal or non-malignant tissue fingerprint. In 
another embodiment, a modulator induces a lung cancer phenotype. Generally, a plurality of 

30 assay mixtures are run in parallel with different agent concentrations to obtain a differential 
■ response to the various concentrations. Typically, one of these concentrations serves as a 
negative control, i.e., at zero concentration or below the level of detection. 
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In one aspect, a modulator will neutralize the effect of a lung cancer protein. By 
"neutralize" is meant that activity of a protein and the consequent effect on the cell is 
inhibited or blocked. 

In certain embodiments, combinatorial libraries of potential modulators will be 
5 screened for an ability to bind to a lung cancer polypeptide or to modulate activity. 

Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 

10 are employed for such an analysis. 

La one preferred embodiment, high throughput screening methods involve providing a 
library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 

15 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical compounds 
generated by either chemical synthesis or biological synthesis by combining a number of 
chemical "building blocks" such as reagents. For example, a linear combinatorial chemical 

10 library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of chemical 
building blocks called amino acids in every possible way for a given compound length (i.e., 
the number of amino acids in a polypeptide compound). Millions of chemical compounds 
can be synthesized through such combinatorial mixing of chemical building blocks (Gallop, 
et al. (1994) J. Med. Chem. 37(9): 1233-1251). 

15 Preparation and screening of combinatorial chemical libraries is well known to those 

of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka (1991) Pept. Prot. Res. 37:487- 
493, Houghton, et al. (1991) Nature, 354:84-88), peptoids (PCT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 

SO Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as 
hydantoins, benzodiazepines and dipeptides (Hobbs, et al. (1993) Proc. Nat. Acad. Sci. USA 
90:6909-6913), vinylogous polypeptides (Hagihara, et al. (1992) J. Amer. Chem. Soc. 
1 14:6568), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding (Hirschmann, et 
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al. (1992) J. Amer. Chem. Soc. 114:9217-9218), analogous organic syntheses of small 

compound libraries (Chen, et al. (1994) J. Amer. Chem. Soc. 116:2661), oligocarbamates 

(Cho, et al. (1993) Science 261 :1303), and/or peptidyl phosphonates (Campbell, et al. (1994) 

J. Ore. Chem. 59:658). See, generally, Gordon, et al. (1994) J. Med. Chem. 37:1385, nucleic 

5 acid libraries (see, e.g., Stratagene, Corp.), peptide nucleic acid libraries (see, e.g., U.S. 

Patent 5,539,083), antibody libraries (see, e.g., Vaughn, et al. (1996) Nature Biotechnology 

14(3):309-314, and PCT/US96/10287), carbohydrate libraries (see, e.g., Liang, et al. (1996) 

Science 274:1520-1522, and U.S. Patent No. 5,593,853), and small organic molecule libraries 

(see, e.g., benzodiazepines, Baum (1993) C&EN, Jan 18, page 33; isoprenoids, U.S. Patent 

10 No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. 5,549,974; pyrrolidines, 
U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, U.S. Patent No. 
5,506,337; benzodiazepines, U.S. Patent No. 5,288,514; and the like). . 

Devices for the preparation of combinatorial libraries are commercially available (see, 
e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, Rainin, 

15 Wobum, MA, 433 A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, Bedford, 
MA). 

A number of well known robotic systems have also been developed for solution phase 
chemistries. These systems include automated workstations like the automated synthesis 
apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic 

20 systems utilizing robotic arms (Zymate n, Zymark Corporation, Hopkinton, Mass.; Orca, 
Hewlett-Packard, Palo Alto, Calif.), which mimic the manual synthetic operations performed 
by a chemist. The above devices, with appropriate modification, are suitable for use with the 
present invention. In addition, numerous combinatorial libraries are themselves 
commercially available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, 

25 Inc., St. Louis, MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek 
Biosciences, Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect modulation of lung cancer gene transcription, polypeptide 
expression, and polypeptide activity. 

30 High throughput assays for evaluating the presence, absence, quantification, or other 

properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 
e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
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U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 

binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 

throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available (see, e.g., 

5 Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 

Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 

typically automate procedures, including sample and reagent pipetting, liquid dispensing, 

timed incubations, and final readings of the microplate in detector(s) appropriate for the 

assay. These configurable systems provide high throughput and rapid start up as well as a 

1 0 high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring proteins or 

15 fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing proteins, or 
random or directed digests of proteinaceous cellular extracts, may be used. In this way 
libraries of proteins may be made for screening in the methods of the invention. Particularly 
preferred in this embodiment are libraries of bacterial, fungal, viral, and mammalian proteins, 
with the latter being preferred, and human proteins being especially preferred. Particularly 

20 useful test compound will be directed to the class of proteins to which the target belongs, e.g., 
substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 30 
amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 to 
about 15 being particularly preferred. The peptides may be digests of naturally occurring 

25 proteins, random peptides, or biased" random peptides. By "randomized" or grammatical 
equivalents herein is meant that the nucleic acid or peptide consists of essentially random 
sequences of nucleotides and amino acids, respectively. Since these random peptides (or 
nucleic acids, discussed below) are often chemically synthesized, they may incorporate a 
nucleotide or amino acid at any position. The synthetic process can be designed to generate 

30 randomized proteins or nucleic acids, to allow the formation of all or most of the possible 

combinations over the length of the sequence, thus forming a library of randomized candidate 
bioactive proteinaceous agents. 
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In one embodiment, the library is fully randomized, with no sequence preferences or 

constants at any position. In a preferred embodiment, the library is biased. That is, some 

positions within the sequence are either held constant, or are selected from a limited number 

of possibilities. In a preferred embodiment, the nucleotides or amino acid residues are 

5 randomized within a defined class, e.g., of hydrophobic amino acids, hydrophilic residues, 

sterically biased (either small or large) residues, towards the creation of nucleic acid binding 

domains, the creation of cysteines, for cross-linking, prolines for SH-3 domains, serines, 

threonines, tyrosines or histidines for phosphorylation sites, etc. 

Modulators of lung cancer can also be nucleic acids, as defined above. 

10 As described above generally for proteins, nucleic acid modulating agents may be 

naturally occurring nucleic acids, random nucleic acids, or biased" random nucleic acids. 
Digests of procaryotic or eucaryotic genomes may be used as is outlined above for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical moieties, a 
wide variety of which are available in the literature. 

1 5 After a candidate agent has been added and the cells allowed to incubate for some 

period of time, the sample containing a target sequence is analyzed. If required, the target 
sequence is prepared using known techniques. For example, the sample may be treated to 
lyse the cells, using known lysis buffers, electroporation, etc., with purification and/or 
amplification such as PCR performed as appropriate. For example, an in vitro transcription 

20 with labels covalently attached to the nucleotides is performed. Generally, the nucleic acids 
are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a fluorescent, a 
chemiluminescent, a chemical, or a radioactive signal, to provide a means of detecting the 
target sequence's specific binding to a probe. The label also can be an enzyme, such as, 

25 alkaline phosphatase or horseradish peroxidase, which when provided with an appropriate 
substrate produces a product that can be detected. Alternatively, the label can be a labeled 
compound or small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or 
altered by the enzyme. The label also can be a moiety or compound, such as, an epitope tag 
or biotin which specifically binds to streptavidin. For the example of biotin, the streptavidin 

30 is labeled as described above, thereby, providing a detectable signal for the bound target 
sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

Nucleic acid assays can be direct hybridization assays or can comprise "sandwich 
assays", which include the use of multiple probes, as is generally outlined in U.S. Patent Nos. 
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5,681,702, 5,597,909, 5,545,730, 5,594,117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 

5,591,584, 5,624,802, 5,635,352, 5,594,118, 5,359,100, 5,124,246 and 5,681,697, all of 

which are hereby incorporated by reference. In this embodiment, in general, the target nucleic 

acid is prepared as outlined above, and then added to the biochip comprising a plurality of 

5 nucleic acid probes, under conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, including 

high, moderate and low stringency conditions as outlined above. The assays are generally 

run under stringency conditions which allow formation of the label probe hybridization 

complex only in the presence of target. Stringency can be controlled by altering a step 

10 parameter that is a thermodynamic variable, including, but not limited to, temperature, 
formamide concentration, salt concentration, chaotropic salt concentration, pH, organic 
solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally 
outlined in U.S. Patent No. 5,681,697: Thus it may be desirable to perform certain steps at 

15 higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. Components 
of the reaction may be added simultaneously, or sequentially, in different orders, with 
preferred embodiments outlined below. In addition, the reaction may include a variety of 
other reagents. These include salts, buffers, neutral proteins, e.g., albumin, detergents, etc. 

20 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 
The assay data are analyzed to determine the expression levels, and changes in 

25 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the lung cancer phenotype. In one 
embodiment, screening is performed to identify modulators that can induce or suppress a 
particular expression profile, thus preferably generating the associated phenotype. In another 
embodiment, e.g., for diagnostic applications, having identified differentially expressed genes 

30 important in a particular state, screens can be performed to identify modulators that alter 

expression of individual genes. In an another embodiment, screening is performed to identify 
modulators that alter a biological function of the expression product of a differentially 
expressed gene. Again, having identified the importance of a gene in a particular state, 
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screens are performed to identify agents that bind and/or modulate the biological activity of 

the gene product, or evaluate genetic polymorphisms. 

Genes can be screened for those that are induced in response to a candidate agent 

After identifying a modulator based upon its ability to suppress a lung cancer expression 

5 pattern leading to a normal expression pattern, or to modulate a single lung cancer gene 

expression profile so as to mimic the expression of the gene from normal tissue, a screen as 

described above can be performed to identify genes that are specifically modulated in 

response to the agent. Comparing expression profiles between normal tissue and agent 

treated lung cancer tissue reveals genes that are not expressed in normal tissue or lung cancer 

10 tissue, but are expressed in agent treated tissue. These agent-specific sequences can be 
identified and used by methods described herein for lung cancer genes or proteins. In 
particular these sequences and the proteins they encode find use in marking or identifying 
agent treated cells. In addition, antibodies can be raised against the agent induced proteins 
and used to target novel therapeutics to the treated lung cancer tissue sample. 

15 Thus, in one embodiment, a test compound is administered to a population of lung 

cancer cells, that have an associated lung cancer expression profile. By "administration" or 
"contacting" herein is meant that the candidate agent is added to the cells in such a manner as 
to allow the agent to act upon the cell, whether by uptake and intracellular action, or by 
action at the cell surface. In some embodiments, nucleic acid encoding a proteinaceous 

20 candidate agent (i.e., a peptide) may be put into a viral construct such as an adenoviral or 
retroviral construct, and added to the cell, such that expression of the peptide agent is 
accomplished, e.g., PCT US97/01019: Regulatable gene therapy systems can also be used. 

Once a test compound has been administered to the cells, the cells can be washed if 
desired and are allowed to incubate under preferably physiological conditions for some 
. 25 period of time. The cells are then harvested and a new gene expression profile is generated, 
as outlined herein. 

Thus, e.g., lung cancer or non-malignant tissue may be screened for agents that 
modulate, e.g., induce or suppress a lung cancer phenotype. A change in at least one gene, 
preferably many, of the expression profile indicates that the agent has an effect on lung 
30 cancer activity. By defining such a signature for the lung cancer phenotype, screens for new 
drugs that alter the phenotype can be devised. With this approach, the drug target need not be 
known and need not be represented in the original expression screening platform, nor does 
the level of transcript for the target protein need to change. 
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Measure of lung cancer polypeptide activity, or of lung cancer or the lung cancer 

phenotype can be performed using a variety of assays. For example, the effects of the test 

compounds upon the function of the metastatic polypeptides can be measured by examining 

parameters described above. A suitable physiological change that affects activity can be used 

5 to assess the influence of a test compound on the polypeptides of this invention. When the 

functional consequences are determined using intact cells or animals, one can also measure a 

variety of effects such as, in the case of lung cancer associated with tumors, tumor growth, 

tumor metastasis, neovascularization, hormone release, transcriptional changes to both known 

and uncharacterized genetic markers (e.g., northern blots), changes in cell metabolism such as 

10 cell growth or pH changes, and changes in intracellular second messengers such as cGMP. In 
the assays of the invention, mammalian lung cancer polypeptide is typically used, e.g., 
mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in vitro. 
For example, a lung cancer polypeptide is first contacted with a potential modulator and 

15 incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, the 
lung cancer polypeptide levels are determined in vitro by measuring the level of protein or 
mRNA. The level of protein is typically measured using immunoassays such as western 
blotting, ELIS A and the like with an antibody that selectively binds to the lung cancer 
polypeptide or a fragment thereof. For measurement of mRNA, amplification, e.g., using 

20 PCR, LCR, or hybridization assays, e.g., northern hybridization, RNAse protection, dot 

blotting, are preferred. The level of protein or mRNA is typically detected using directly or 
indirectly labeled detection agents, e.g., fluorescently or radioactively labeled nucleic acids, 
radioactively or enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using a lung cancer protein 

25 promoter operably linked to a reporter gene such as luciferase, green fluorescent protein, 
CAT, or (3-gal. The reporter construct is typically transfected into a cell. After treatment 
with a potential modulator, the amount of reporter gene transcription, translation, or activity 
is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on individual 

30 genes and gene products (proteins). That is, having identified a particular differentially 

expressed gene as important in a particular state, screening of modulators of the expression of 
the gene or the gene product itself can be done. The gene products of differentially expressed 
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genes are sometimes referred to herein as "lung cancer proteins." The lung cancer protein 

may be a fragment, or alternatively, be the full length protein to a fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes is 

performed. Typically, the expression of only one or a few genes are evaluated. In another 

5 embodiment, screens are designed to first find compounds that bind to differentially 

expressed proteins. These compounds are then evaluated for the ability to modulate 

differentially expressed activity. Moreover, once initial candidate compounds are identified, 

variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or isolated 

10 gene product is used; that is, the gene products of one or more differentially expressed 

nucleic acids are made. For example, antibodies are generated to the protein gene products, 

and standard immunoassays are run to determine the amount of protein present. Alternatively, 

cells comprising the lung cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a lung cancer 

15 protein and a candidate compound, and determining the binding of the compound to the lung 

cancer protein. Preferred embodiments utilize the human lung cancer protein, although other 

mammalian proteins may also be used, e.g., for the development of animal models of human 

disease. In some embodiments, as outlined herein, variant or derivative lung cancer proteins 

may be used. 

20 Generally, in a preferred embodiment of the methods herein, the lung cancer protein 

or the candidate agent is non-diffusably bound to an insoluble support, preferably having 
isolated sample receiving areas (e.g., a microtiter plate, an array, etc.). The insoluble 
supports may be made of a composition to which the compositions can be bound, is readily 
separated from soluble material, and is otherwise compatible with the overall method of 

25 screening. The surface of such supports may be solid or porous and of a convenient shape. 
Examples of suitable insoluble supports include microtiter plates, arrays, membranes and 
beads. These are typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon 
or nitrocellulose, teflon™, etc. Microtiter plates and arrays are especially convenient because 
a large number of assays can be carried out simultaneously, using small amounts of reagents 

30 and samples. The particular manner of binding of the composition is typically not crucial so 
long as it is compatible with the reagents and overall methods of the invention, maintains the 
activity of the composition, and is nondiffusable. Preferred methods of binding include the 
use of antibodies (which do not sterically block either the ligand binding site or activation 
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sequence when the protein is bound to the support), direct binding to "sticky" or ionic 

supports, chemical crosslinking, the synthesis of the protein or agent on the surface, etc. 
Following binding of the protein or agent, excess unbound material is removed by washing. 
The sample receiving areas may then be blocked through incubation with bovine serum 
5 albumin (BSA), casein or other innocuous protein or other moiety. 

In a preferred embodiment, the lung cancer protein is bound to the support, and a test 
compound is added to the assay. Alternatively, the candidate agent is bound to the support 
and the lung cancer protein is added. Novel binding agents include specific antibodies, non- 
natural binding agents identified in screens of chemical libraries, peptide analogs, etc/Of 

10 particular interest are screening assays for agents that have a low toxicity for human cells. A 
wide variety of assays may be used for this purpose, including labeled in vitro protein-protein 
binding assays, electrophoretic mobility shift assays, immunoassays for protein binding, 
functional assays (phosphorylation assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the lung cancer 

1 5 protein may be done in a number of ways. In a preferred embodiment, the compound is 

labeled, and binding determined directly, e.g., by attaching all or a portion of the lung cancer 
protein to a solid support, adding a labeled candidate agent (e.g., a fluorescent label), washing 
off excess reagent, and determining whether the label is present on the solid support. Various 
blocking and washing steps may be utilized as appropriate. 

20 In some embodiments, only one of the components is labeled, e.g., the proteins (or 

proteinaceous candidate compounds) can be labeled. Alternatively, more than one 
component can be labeled with different labels, e.g., l25 I for the proteins and a fluorophor for 
the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

25 In one embodiment, the binding of the test compound is determined by competitive 

binding assay. The competitor may be a binding moiety known to bind to the target molecule 
(i.e., a lung cancer protein), such as an antibody, peptide, binding partner, ligand, etc. Under 
certain circumstances, there may be competitive binding between the compound and the 
binding moiety, with the binding moiety displacing the compound. In one embodiment, the 

30 test compound is labeled. Either the compound, or the competitor, or both, is added first to 
the protein for a time sufficient to allow binding, if present. Incubations may be performed at 
a temperature which facilitates optimal activity, typically between 4 and 40° C. Incubation 
periods are typically optimized, e.g., to facilitate rapid high throughput screening. Typically 
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between 0. 1 and 1 hour will be sufficient. Excess reagent is generally removed or washed 

away. The second component is then added, and the presence or absence of the labeled 

component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by a test 

5 compound. Displacement of the competitor is an indication that the test compound is binding 

to the lung cancer protein and thus is capable of binding to, and potentially modulating, the 

activity of the lung cancer protein. In this embodiment, either component can be labeled. 

Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 

displacement by the agent. Alternatively, if the test compound is labeled, the presence of the 

1 0 label on the support indicates displacement. 

In an alternative embodiment, the test compound is added first, with incubation and 
washing, followed by the competitor. The absence of binding by the competitor may indicate 
that the test compound is bound to the lung cancer protein with a higher affinity. Thus, if the 
test compound is labeled, the presence of the label on the support, coupled with a lack of * 

1 5 competitor binding, may indicate that the test compound is capable of binding to the lung 
cancer protein. 

In a preferred embodiment, the methods comprise differential screening to identity 
agents that are capable of modulating the activity of the lung cancer proteins. In one 
embodiment, the methods comprise combining a lung cancer protein and a competitor in a 

20 first sample. A second sample comprises a test compound, a lung cancer protein, and a 

competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the lung cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 

25 agent is capable of binding to the lung cancer protein. 

Alternatively, differential screening is used to identify drug candidates that bind to the 
native lung cancer protein, but cannot bind to modified lung cancer proteins. The structure of 
the lung cancer protein may be modeled, and used in rational drug design to synthesize agents 
that interact with that site. Drug candidates that affect the activity of a lung cancer protein 

30 are also identified by screening drugs for the ability to either enhance or reduce the activity of 
the protein. 

Positive controls and negative controls may be used in the assays. Preferably control 
and test samples are performed in at least triplicate to obtain statistically significant results. 
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Incubation of all samples is for a time sufficient for the binding of the agent to the protein. 

Following incubation, samples are washed free of non-specifically bound material and the 

amount of bound, generally labeled agent determined. For example, where a radiolabel is 

employed, the samples may be counted in a scintillation counter to determine the amount of 

5 bound compound. 

A variety of other reagents may be included in the screening assays. These include 
reagents like salts, neutral proteins, e.g., albumin, detergents, etc. which may be used to 
facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
1 0 protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a lung cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising lung cancer 
15 proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
nucleic acid that encodes a lung cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous or 
subsequent exposure of physiological signals, e.g., hormones, antibodies, peptides, antigens, 
20 cytokines, growth factors, action potentials, pharmacological agents including 

chemotherapeutics, radiation, carcinogenics, or other cells (e.g., cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate lung cancer agents are identified. Compounds 
with pharmacological activity are able to enhance or interfere with the activity of the lung 
25 cancer protein. Once identified, similar structures are evaluated to identify critical structural 
feature of the compound. 

In one embodiment, a method of inhibiting lung cancer cell division is provided. The 
method comprises administration of a lung cancer inhibitor. In another embodiment, a 
method of inhibiting lung cancer is provided. The method may comprise administration of a 
30 lung cancer inhibitor. In a further embodiment, methods of treating cells or individuals with 
lung cancer are provided, e.g., comprising administration of a lung cancer inhibitor. 

In one embodiment, a lung cancer inhibitor is an antibody as discussed above. In 
another embodiment, the lung cancer inhibitor is an antisense molecule. 
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A variety of cell growth, proliferation, viability, and metastasis assays are known to 

those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 
5 Normal cells require a solid substrate to attach and grow. When the cells are 

transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft agar. The transformed cells, when transfected with tumor 
suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 

10 grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of lung cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compound would reduce or eliminate 
the host cells' ability to grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft. 

15 Techniques for soft agar growth or colony formation in suspension assays are 

described in Freshney (1994) Culture of Animal Cells a Manual of Basic Technique (3 rd ed.), 
herein incorporated by reference. See also, the methods section of Garkavtsev, et al. (1996), 
supra, herein incorporated by reference. 

20 Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until they 
touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 

25 higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 

30 normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a preferred 
method of measuring density limitation of growth. Transformed host cells are transfected 
with a lung cancer-associated sequence and are grown for 24 hours at saturation density in 
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non-limiting medium conditions. The percentage of cells labeling with ( 3 H)-thymidine is 

determined autoradiographically. See, Freshney (1994), supra. 

Growth factor or serum dependence 
5 Transformed cells typically have a lower serum dependence than their normal 

counterparts (see, e.g., Temin (1966) J.Natl. Cancer Insti. 37:167-175; Eagle, et al. (1970) I 
Exp. Med. 13 1 :836-879); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

10 

Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells (see, e.g., 
1 5 Gullino, " Angiogenesis, tumor vascularization, and potential interference with tumor growth" 
in Mihich (ed. 1985) Biological Responses in Cancer, pp. 178-184). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Folkman (1992) "Angiogenesis and Cancer" in Sem Cancer Biol.) . 

Various techniques which measure the release of these factors are described in 
20 Freshney (1994), supra. Also, see, Unkeless. et al. (1974) J. Biol. Chem. 249:4295-4305: 
Strickland and Beers (1976) J. Biol. Chem . 251:5694-5702; Whur, et al. (1980) Br. J. Cancer 
42:305-312; Gullino, "Angiogenesis, tumor vascularization, and potential interference with 
tumor growth" in Mihich (ed. 1985) Biological Responses in Cancer , pp. 178-184; Freshney 
Anticancer Res. 5:11 1-130 (1985). 

25 

Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate lung cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
30 invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 
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Techniques described in Freshney (1994), supra, can be used. Briefly, the level of 

invasion of host cells can be measured by using filters coated with Matrigel or some other 

extracellular matrix constituent. Penetration into the gel, or through to the distal side of the 

filter, is rated as invasiveness, and rated histologically by number of cells and distance 

5 moved, or by prelabeling the cells with 125 I and counting the radioactivity on the distal side of 

the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 

Tumor growth in vivo 

Effects of lung cancer-associated sequences on cell growth can be tested in transgenic 

10 or immune-suppressed mice. Knock-out transgenic mice can be made, in which the lung 
cancer gene is disrupted or in which a lung cancer gene is inserted. Knock-out transgenic 
mice can be made by insertion of a marker gene or other heterologous gene into the 
endogenous lung cancer gene site in the mouse genome via homologous recombination. 
Such mice can also be made by substituting the endogenous lung cancer gene with a mutated 

1 5 version of the lung cancer gene, or by mutating the endogenous lung cancer gene, e.g., by 
exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 
is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 

20 that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion (see, e.g., Capecchi, et al. (1989) Science 244:1288). Chimeric targeted mice can be 
derived according to Hogan, et al. (1988) Manipulating the Mouse Embryo: A Laboratory 
Manual Cold Spring Harbor Laboratory and Robertson (ed. 1987) Teratocarcinomas and 

25 Embryonic Stem Cells: A Practical Approach. , IRL Press, Washington, D.C. 

Alternatively, various immune-suppressed or immune-deficient host animals can be 
used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella, et al. (1974) J. 
Natl. Cancer Inst. 52:921), a SCID mouse, a thymectomized mouse, or an irradiated mouse 
(see, e.g., Bradley, et al. (1978) Br. J. Cancer 38:263; Selby, et al. (1980) Br. J. Cancer 41 :52) 

30 can be used as a host. Transplantable tumor cells (typically about 10 6 cells) injected into 

isogenic hosts will produce invasive tumors in a high proportions of cases, while normal cells 
of similar origin will not. In hosts which developed invasive tumors, cells expressing a lung 
cancer-associated sequences are injected subcutaneously. After a suitable length of time, 
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preferably 4-8 weeks, tumor growth is measured (e.g., by volume or by its two largest 
dimensions) and compared to the control. Tumors that have statistically significant reduction 
(using, e.g., Student's T test) are said to have inhibited growth. 

5 Polynucleotide modulators of lung cancer 

Antisense and RNAi Polynucleotides 

In certain embodiments, the activity of a lung cancer-associated protein is 
downregulated, or entirely inhibited, by the use of antisense or an inhibitory polynucleotide, 
i.e., a nucleic acid complementary to, and which can preferably hybridize specifically to, a 

10 coding mRNA nucleic acid sequence, e.g., a lung cancer protein mRNA, or a subsequence 
thereof. Binding of the antisense polynucleotide to the mRNA reduces the translation and/or 
stability of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise naturally- 
occurring nucleotides, or synthetic species formed from naturally-occurring subunits or their 

1 5 close homologs. Antisense polynucleotides may also have altered sugar moieties or inter- 
sugar linkages. Exemplary among these are the phosphorothioate and other sulfur containing 
species which are known for use in the art. Analogs are comprehended by this invention so 
long as they function effectively to hybridize with the lung cancer protein mRNA. See, e.g., 
Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

20 Such antisense polynucleotides can readily be synthesized using recombinant means, 

or can be synthesized in vitro. Equipment for such synthesis is sold by several vendors, 
including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 
Antisense molecules as used herein include antisense or sense oligonucleotides. 

25 Sense oligonucleotides can, e.g., be employed to block transcription by binding to the anti- 
sense strand. The antisense and sense oligonucleotide comprise a single-stranded nucleic 
acid sequence (either RNA or DNA) capable of binding to target mRNA (sense) or DNA 
(antisense) sequences for lung cancer molecules. A preferred antisense molecule is for a lung 
cancer sequence in the tables, or for a ligand or activator thereof. Antisense or sense 

30 oligonucleotides, according to the present invention, comprise a fragment generally at least 
about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an 
antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given protein 
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is described in, e.g., Stein and Cohen (1988) Cancer Res. 48:2659 and van der Krol, et al. 

(1988) BioTechniques 6:958). 

RNA interference is a mechanism to suppress gene expression in a sequence specific 

manner. See, e.g., Brumelkamp, et al. (2002^ Sciencexpress (21March2002); Sharp (1999) 

5 Genes Dev. 13:139-141; and Cathew (2001) Curr. Op. Cell Biol. 13:244-248. In mammalian 

cells, short, e.g., 21 nt, double stranded small interfering RNAs (siRNA) have been shown to 

be effective at inducing an RNAi response. See, e.g., Elbashir, et al. (2001) Nature 41 1 :494- 

498. The mechanism may be used to downregulate expression levels of identified genes, e.g., 

treatment of, or validation of relevance to disease. 

10 

Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and inhibit 
transcription of lung cancer-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes have 

15 been described, including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, 
RNase P, and axhead ribozymes (see, e.g., Castanotto, et al. (1994) Adv. in Pharmacology 
25: 289-317 for a general review of the properties of different ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel, et al. (1990) 
Nucl. Acids Res. 18:299-304; European Patent Publication No. 0 360 257; U.S. Patent No. 

20 5,254,678. Methods of preparing are well known to those of skill in the art (see, e.g., WO 
94/26877; Ojwang, et al. (1993) Proc. Natl. Acad. Sci. USA 90:6340-6344; Yamada, et al. 
(1994) Human Gene Therapy 1:39-45; Leavitt, et al. (1995) Proc. Natl Acad. Sci. USA 
92:699-703; Leavitt, et al. (19994) Human Gene Therapy 5:1151-120; and Yamada, et al. 
(1994) Virology 205: 121-126). 

25 Polynucleotide modulators of lung cancer may be introduced into a cell containing the 

target nucleotide sequence by formation of a conjugate with a ligand binding molecule, as 
described in WO 91/04753. Suitable ligand binding molecules include, but are not limited to, 
cell surface receptors, growth factors, other cytokines, or other ligands that bind to cell 
surface receptors. Preferably, conjugation of the ligand binding molecule does not 

30 substantially interfere with the ability of the ligand binding molecule to bind to its 

corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of lung 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
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formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 

understood that the use of antisense molecules or knock out and knock in models may also be 

used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating lung cancer in cells or organisms 

5 are provided. In one embodiment, the methods comprise administering to a cell an anti-lung 

cancer antibody that reduces or eliminates the biological activity of an endogenous lung 

cancer protein. Alternatively, the methods comprise administering to a cell or organism a 

recombinant nucleic acid encoding a lung cancer protein. This may be accomplished in any 

number of ways. In a preferred embodiment, e.g., when the lung cancer sequence is down- 

1 0 regulated in lung cancer, such state may be reversed by increasing the amount of lung cancer 
gene product in the cell. This can be accomplished, e.g., by overexpressing the endogenous 
lung cancer gene or administering a gene encoding the lung cancer sequence, using known 
gene-therapy techniques. In a preferred embodiment, the gene therapy techniques include the 
incorporation of the exogenous gene using enhanced homologous recombination (EHR), e.g., 

15 as described in PCT/US93/03868, hereby incorporated by reference in its entirety. 

Alternatively, e.g., when the lung cancer sequence is up-regulated in lung cancer, the activity 
of the endogenous lung cancer gene is decreased, e.g., by the administration of a lung cancer - 
antisense or RNAi nucleic acid. 

In one embodiment, the lung cancer proteins of the present invention may be used to 

20 generate polyclonal and monoclonal antibodies to lung cancer proteins. Similarly, the lung 
cancer proteins can be coupled, using standard technology, to affinity chromatography 
columns. These columns may then be used to purify lung cancer antibodies useful for 
production, diagnostic, or therapeutic purposes. In a preferred embodiment, the antibodies 
are generated to epitopes unique to a lung cancer protein; that is, the antibodies show little or 

25 no cross-reactivity to other proteins. The lung cancer antibodies may be coupled to standard 
affinity chromatography columns and used to purify lung cancer proteins. The antibodies 
may also be used as blocking polypeptides, as outlined above, since they will specifically 
bind to the lung cancer protein. 

30 Methods of identifying variant lung cancer-associated sequences 

Without being bound by theory, expression of various lung cancer sequences is 
correlated with lung cancer. Accordingly, disorders based on mutant or variant lung cancer 
genes may be determined. In one embodiment, the invention provides methods for 
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identifying cells containing variant lung cancer genes, e.g., determining all or part of the 

sequence of at least one endogenous lung cancer genes in a cell. In a preferred embodiment, 

the invention provides methods of identifying the lung cancer genotype of an individual, e.g., 

determining all or part of the sequence of at least one lung cancer gene of the individual. 

5 This is generally done in at least one tissue of the individual, and may include the evaluation 

of a number of tissues or different samples of the same tissue. The method may include 

comparing the sequence of the sequenced lung cancer gene to a known lung cancer gene, i.e., 

a wild-type gene. 

The sequence of all or part of the lung cancer gene can then be compared to the 
10 sequence of a known lung cancer gene to determine if any differences exist. This can be 
done using known homology programs, such as Bestfit, etc. In a preferred embodiment, the 
presence of a difference in the sequence between the lung cancer gene of the patient and the 
known lung cancer gene correlates with a disease state or a propensity for a disease state, as 
outlined herein. 

15 In a preferred embodiment, the lung cancer genes are used as probes to determine the 

number of copies of the lung cancer gene in the genome. 

In another preferred embodiment, the lung cancer genes are used as probes to 

determine the chromosomal localization of the lung cancer genes. Information such as 

chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
20 chromosomal abnormalities such as translocations, and the like are identified in the lung 

cancer gene locus. 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, 'a therapeutically effective dose of a lung cancer protein or 
25 modulator thereof, is administered to a patient. By "therapeutically effective dose" herein is 
meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques (e.g., Ansel, et al. (1992) Pharmaceutical Dosage Forms and Drug 
Delivery ; Lieberman, Pharmaceutical Dosage Forms (vols. 1-3), Dekker, ISBN 0824770846, 
30 082476918X, 0824712692, 0824716981; Lloyd (1999) The Art. Science and Technology of 
Pharmaceutical Compounding ; and Pickar (1999) Dosage Calculations) . Adjustments for 
lung cancer degradation, systemic versus localized delivery, and rate of new protease 
synthesis, as well as the age, body weight, general health, sex, diet, time of administration, 
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drug interaction and the severity of the condition may be necessary, and will be ascertainable 

with routine experimentation by those skilled in the art. 

A "patient" for the purposes of the present invention includes both humans and other 

animals, particularly mammals. Thus the methods are applicable to both human therapy and 

5 veterinary applications. In the preferred embodiment the patient is a mammal, preferably a 

primate, and in the most preferred embodiment the patient is human. 

The administration of the lung cancer proteins and modulators thereof of the present 

invention can be done in a variety of ways, including, but not limited to, orally, 

subcutaneously, intravenously, intranasally, transdermally, intraperitoneally, intramuscularly, 

10 intrapulmonary, vaginally, rectally, or intraocularly. In some instances, e.g., in the treatment 
of wounds and inflammation, the lung cancer proteins and modulators may be directly 
applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a lung cancer 
protein in a form suitable for administration to a patient. In the preferred embodiment, the 

15 pharmaceutical compositions are in a water soluble form, such as being present as 

pharmaceutically acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 

20 sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Phannaceutically acceptable base addition salts" include those derived from inorganic bases 

25 such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 
potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 

30 ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 
tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the following: 
carrier proteins such as serum albumin; buffers; fillers such as microcrystalline cellulose, 
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lactose, corn and other starches; binding agents; sweeteners and other flavoring agents; 

coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit dosage 

forms depending upon the method of administration. For example, unit dosage forms 

5 suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that lung cancer protein modulators (e.g., antibodies, antisense 
constructs, ribozymes, small organic molecules, etc.) when administered orally, should be 
protected from digestion. This is typically accomplished either by complexing the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 

10 packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise a lung cancer protein 
modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous carrier. 
A variety of aqueous carriers can be used, e.g., buffered saline and the like. These solutions 

15 are sterile and generally free of undesirable matter. These compositions may be sterilized by 
conventional, well known sterilization techniques. The compositions may contain 
pharmaceutically acceptable auxiliary substances as required to approximate physiological 
conditions such as pH adjusting and buffering agents, toxicity adjusting agents and the like, 
e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, sodium lactate 

20 and the like. The concentration of active agent in these formulations can vary widely, and 
will be selected primarily based on fluid volumes, viscosities, body weight and the like in 
accordance with the particular mode of administration selected and the patient's needs (e.g., 
Remington's Pharmaceutical Science (15th ed., 1980) and Hardman, et al. (eds. 1996) 
Goodman and Gilman: The Pharmacologial Basis of Therapeutics! 

25 Thus, a typical pharmaceutical composition for intravenous administration would be 

about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per patient per 
day may be used, particularly when the drug is administered to a secluded site and not into 
the blood stream, such as into a body cavity or into a lumen of an organ. Substantially higher 
dosages are possible in topical administration. Actual methods for preparing parenterally 

30 administrate compositions will be known or apparent to those skilled in the art, e.g., 

Remington's Pharmaceutical Science and Goodman and Gilman, The Pharmacologial Basis 
of Therapeutics, supra. 
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The compositions containing modulators of lung cancer proteins can be administered 

for therapeutic or prophylactic treatments. In therapeutic applications, compositions are 

administered to a patient suffering from a disease (e.g., a cancer) in an amount sufficient to 

cure or at least partially arrest the disease and its complications. An amount adequate to 

5 accomplish this is defined as a "therapeutically effective dose." Amounts effective for this 

use will depend upon the severity of the disease and the general state of the patient's health. 

Single or multiple administrations of the compositions may be administered depending on the 

dosage and frequency as required and tolerated by the patient. In any event, the composition 

should provide a sufficient quantity of the agents of this invention to effectively treat the 

10 patient. An amount of modulator that is capable of preventing or slowing the development of 
cancer in a mammal is referred to as a "prophylactically effective dose." The particular dose 
required for a prophylactic treatment will depend upon the medical condition and history of 
the mammal, the particular cancer being prevented, as well as other factors such as age, 
weight, gender, administration route, efficiency, etc. Such prophylactic treatments may be 

1 5 used, e.g., in a mammal who has previously had cancer to prevent a recurrence of the cancer, 
or in a mammal who is suspected of having a significant likelihood of developing cancer 
based, at least in part, upon gene expression profiles. Vaccine strategies may be used, in 
either a DNA vaccine form, or protein vaccine. 

It will be appreciated that the present lung cancer protein-modulating compounds can 

20 be administered alone or in combination with additional lung cancer modulating compounds 
or with other therapeutic agent, e.g., other anti-cancer agents or treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in the tables, such as antisense or RNAi 
polynucleotides or ribozymes, will be introduced into cells, in vitro or in vivo. The present 

25 invention provides methods, reagents, vectors, and cells useful for expression of lung cancer- 
associated polypeptides and nucleic acids using in vitro (cell-free), ex vivo, or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 

30 introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and other well known methods for introducing cloned genomic 
DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, e.g., 
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Berger and Kimmel, Guide to Molecular Clo ning Techni ques. Methods in Enzvmologv 

volume 152 (Berger), Ausubel, et al. (eds. 1999) Current Protocols (supplemented through 

1999), and Sambrook, et al. (1989) Molecular Cloning - A Laboratory Manual (2nd ed., Vol. 

1-3). 

5 In a preferred embodiment, lung cancer proteins and modulators are administered as 

therapeutic agents, and can be formulated as outlined above. Similarly, lung cancer genes 
(including both the fulUength sequence, partial sequences, or regulatory sequences of the 
lung cancer coding regions) can be administered in a gene therapy application. These lung 
cancer genes can include antisense or inhibitory applications, e.g., as inhibitory RNA or gene 

10 therapy (e.g., for incorporation into the genome) or as antisense compositions. 

Lung cancer polypeptides and polynucleotides can also be administered as vaccine 
compositions to stimulate HTL, CTL, and antibody responses.. Such vaccine compositions 
can include, e.g., lipidated peptides (see, e,g.,Vitiello, et al. (1995) J. Clin. Invest. 95:341), 
peptide compositions encapsulated in poly(DL-lactide-co-g!ycolide) ("PLG") microspheres 

15 (see, e.g., Eldridge, et al. (1991) Molec. Immunol. 28:287-294; Alonso, et al. (1994) Vaccine 
12:299-306; Jones, et al. (1995) Vaccine 13:675-681), peptide compositions contained in 
immune stimulating complexes (ISCOMS) (see, e.g., Takahashi, et al (1990) Nature 
344:873-875; Hu, et al. (1998) Clin Exp Immunol. 113:235-243), multiple antigen peptide 
systems (MAPs) (see, e.g., Tarn (1988) Proc. Natl. Acad. Sci. U.S.A. 85:5409-5413; Tarn 

20 (1996) J. Immunol. Methods 196: 17-32), peptides formulated as multivalent peptides; 

peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 
vectors (Perkus, et al., p. 379 In: Kaufinann (ed. 1996) Concepts in vaccine development : 
Chakrabarti, et al. (1986) Nature 320:535; Hu, et al. (1986) Nature 320:537; Kieny, et al. 
(1986) AIDS Bio/Technology 4:790; Top, et al. (1971) J. Infect. Pis. 124:148; Chanda, et al. 

25 (1990) Viroloev 175:535), particles of viral or synthetic origin (see, e.g., Kofler, et al. (1996) 
J. Immunol. Methods 192:25; Eldridge, et al. (1993) Sem. HematoL 30:16; Falo, et al. (1995) 
Nature Med. 7:649), adjuvants (Warren, et al (1986) Annu. Rev. Immunol. 4:369; Gupta, et 
al. (1993) Vaccine 1 1 :293), liposomes (Reddy, et al. (1992) J. Immunol. 148:1585; Rock 
(1996) Immunol. Today 17:131), or, naked or particle absorbed cDNA (Ulmer, et al. (1993) 

30 Science 259:1745; Robinson, et al. (1993) Vaccine 1 1:957; Shiver, et al., p. 423 In: 

Kaufinann (ed. 1996) Concepts in vaccine development : Cease and Berzofsky (1994) Annu. 
Rev. Immunol. 12:923 and Eldridge, et al. (1993) Sem. Hematol. 30:16). Toxin-targeted 



78 



WO 02/086443 PCT/US02/12476 
delivery technologies, also known as receptor mediated targeting, such as those of Avant 

Lnmunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a substance 

designed to protect the antigen from rapid catabolism, such as aluminum hydroxide or 

5 mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 

Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 

as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 

MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 

Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 

10 aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 

tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 

polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 

Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 

used as adjuvants. 

1 5 Vaccines can be administered as nucleic acid compositions wherein DNA or RNA 

encoding one or more of the polypeptides, or a fragment thereof, is administered to a patient. 
This approach is described, for instance, in Wolff, et. al. (1990) Science 247:1465 as well as 
U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; WO 
98/04720; and in more detail below. Examples of DNA-based delivery technologies include 

20 "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, cationic lipid 
complexes, and particle-mediated ("gene gun") or pressure-mediated delivery (see, e.g., U.S. 
Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the invention 
can be expressed by viral or bacterial vectors. Examples of expression vectors include 

25 attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
vaccinia virus, e.g., as a vector to express nucleotide sequences that encode lung cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 

30 Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover, et al. (1991) Nature 351:456-460. A wide variety of other vectors useful 
for therapeutic administration or immunization e.g., adeno and adeno-associated virus 
vectors, retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the 
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like, will be apparent to those skilled in the art from the description herein (see, e.g., Shata, et 

al. (2000) Mol Med Today 6:66-71; Shedlock, et al. (2000) J. Leukoc. Biol. 68:793-806; 

Hipp, et al. (2000) In Vivo 14:571-85). 

Methods for the use of genes as DNA vaccines are well known, and include placing a 

5 lung cancer gene or portion of a lung cancer gene under the control of a regulatable promoter 

or a tissue-specific promoter for expression in a lung cancer patient. The lung cancer gene 

used for DNA vaccines can encode full-length lung cancer proteins, but more preferably 

encodes portions of the lung cancer proteins including peptides derived from the lung cancer 

protein. In one embodiment, a patient is immunized with a DNA vaccine comprising a 

10 plurality of nucleotide sequences derived from a lung cancer gene. For example, lung cancer- 
associated genes or sequence encoding subfragments of a lung cancer protein are introduced 
into expression vectors and tested for their immunogenicity in the context of Class I MHC 
and an ability to generate cytotoxic T cell responses. This procedure provides for production 
of cytotoxic T cell responses against cells which present antigen, including intracellular 

15 epitopes. 

In a preferred embodiment, DNA vaccines include a gene encoding an adjuvant 
molecule with the DNA vaccine. Such adjuvant molecules include cytokines that increase 
the immunogenic response to the lung cancer polypeptide encoded by the DNA vaccine. 
Additional or alternative adjuvants are available. 

20 In another preferred embodiment lung cancer genes find use in generating animal 

models of lung cancer. When the lung cancer gene identified is repressed or diminished in 
metastatic tissue, gene therapy technology, e.g., wherein antisense or inhibitory RNA directed 
to the lung cancer gene will also diminish or repress expression of the, gene. Animal models 
of lung cancer find use in screening for modulators of a lung cancer-associated sequence or 

25 modulators of lung cancer. Similarly, transgenic animal technology including gene knockout 
technology, e.g., as a result of homologous recombination with an appropriate gene targeting 
vector, will result in the absence or increased expression of the lung cancer protein. When 
desired, tissue-specific expression or knockout of the lung cancer protein may be necessary. 
It is also possible that the lung cancer protein is overexpressed in lung cancer. As 

30 such, transgenic animals can be generated that overexpress the lung cancer protein. 

Depending on the desired expression level, promoters of various strengths can be employed 
to express the transgene. Also, the number of copies of the integrated transgene can be 
determined and compared for a determination of the expression level of the transgene. 
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Animals generated by such methods will find use as animal models of lung cancer and are 

additionally useful in screening for modulators to treat lung cancer. 



Kits for Use in Diagnostic and/or Prognostic Applications 

5 For use in diagnostic, research, and therapeutic applications suggested above, kits are 

also provided by the invention. In diagnostic and research applications such kits may include 
at least one of the following: assay reagents, buffers, lung cancer-specific nucleic acids or 
antibodies, hybridization probes and/or primers, antisense polynucleotides, ribozymes, RNAi, 
dominant negative lung cancer polypeptides or polynucleotides, small molecule inhibitors of 

10 lung cancer-associated sequences, etc. A therapeutic product may include sterile saline or 
another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing instructions (e.g., 
protocols) for the practice of the methods of this invention. While the instructional materials 
typically comprise written or printed materials they are not limited to such. A medium 

1 5 capable of storing such instructions and communicating them to an end user is contemplated 
by this invention. Such media include, but are not limited to electronic storage media (e.g., 
magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. Such 
media may include addresses to internet sites that provide such instructional materials. 
The present invention also provides for kits for screening for modulators of lung 

20 cancer-associated sequences. Such kits can be prepared from readily available materials and 
reagents. For example, such kits can comprise one or more of the following materials: a lung 
cancer-associated polypeptide or polynucleotide, reaction tubes, and instructions for testing 
lung cancer-associated activity. Optionally, the kit contains biologically active lung cancer 
protein. A wide variety of kits and components can be prepared according to the present 

25 invention, depending upon the intended user of the kit and the particular needs of the user. 
Diagnosis would typically involve evaluation of a plurality of genes or products. The genes 
typically will be selected based on correlations with important parameters in disease which 
may be identified in historical or outcome data. 
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Example 1: Gene Chip Analysis 

Molecular profiles of various normal and cancerous tissues were determined and 
5 analyzed using gene chips. RNA was isolated and gene chip analysis was performed as 

described (Glynne, et al. (2000) Nature 403:672-676; Zhao, et al. (2000) Genes Dev. 14:981- 
993). 



82 



WO 02/086443 



PCT/US02/12476 



Tables 1A and 1B were previously filed on April 18, 2001 in USSN 60/284,770 (18501-001500US) and on November 29, 2001 in USSN 60/334,370 
(18501-001520US) 
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"ESTs, Highly similar to integral membra 


0.99 


1.8 


104158 


AA454908 


Hs.8127 


KIAA0144 gene product 


0.96 


1.29 


104558 


R56678 


Hs 88959 


Human DNA seauence from clone 967N21 on 


1.23 


7.23 


104689 


AA010665 




ESTs 


0.96 


211 


104733 


AA019498 


Hs.23071 


ESTs 


1.18 


1.88 


104906 


AA055809 


Hs.26802 


Protein kinase domains containing protei 


1.11 


3.15 


104978 


AA088458 


Hs.1 9322 


ESTs; Weakly similar to 1!!! ALU SUBFAMI 


1.64 


2.89 


105012 


AA1 16036 


Hs,9329 


"Homo sapiens mRNA for fls353, complete 


1.19 


3.91 


105175 


AA186804 


Hs.25740 


ESTs; Weakly similar to unknown [S.cerev 


0.9 


4.63 


105263 


AA227926 


Hs.6682 


ESTs 


0.95 


287 


105298 


AA233459 


Hs.26369 


ESTs 


1 


1.13 


105312 


AA233854 


Hs.23348 


S-phase kinase-associated protein 2 (p45 


1.32 


3.01 


105719 


AA291644 


Hs 36793 


HvDothetical nrotein FLJ23188 


1.28 


231 


105743 


AA293300 


Hs.9598 


ESTs 


1 


1 


106012 . 


AA411621 ' 


Hs.8895 


ESTs; same as BFH6? 


0.94 


2.04 


106231 


AA429571 


Hs 38002 


KIAA1 355 protein 


1.04 


1.5 


106540 


AA454607 


Hs.38114 


Hypothetical protein FLJ 1 1 100 


1.26 


2.26 


106575 


AA456039 


Hs.1 05421 


ESTs 


1 


2 


106632 


AA459897 


Hs.1 1950 


GPt-anchored metastasis-associated prote 


0.87 


1.32 


106727 


AA465342 


Hs.34045 


Hypothetical protein FU20764 


0.87 


1.59 


106906 


AA490237 


Hs.222024 


Transcription factor BMAL2 (cycle-like f 


0.61 


1.6 


107059 


AA608545 


Hs.23044 


RAD51 (S. cerevisiae) homolog (E coti Re 


0.48 


267 


107104 


AA609786 


Hs.1 5243 


Nucleolar protein 1 (120kO) 


1.01 


1.44 


107151 


AA621169 


Hs 8687 


ESTs; procollagen l-N proteinase 


0.97 


289 


107284 


S74039 


Hs.291904 


Accessory prote/ns 8AP3iyBAP29 


1.15 


165 


107901 


AA026418 


Hs.91539 


ESTs 


0.72 


3.44 


107922 


AA028028 


Hs,61460 


Ig superfamity receptor LNIR precursor 


1 


248 


107932 


AA029317 


Hs 18878 


Hypothetical protein FU21620 


1 


1 


108695 


AA1 21315 


Hs.70823 


KLAA1077 protein 


0.91 


3.53 


108857 


AA1 33250 


Hs.62180 


ESTs 


1 


1 


108860 


AA1 33334 


Hs.1 29911 


ESTs 


0.73 


7.3 


108990 


AA152296 


Hs.72045 


ESTs 


1 


1 


109166 


AA179845 


Hs.73625 


"RAB6 interacting, kinesin-like (rabkine 


1 


4.55 


109424 


AA227919 


Hs.85962 


Hyaturonan synthase 3 


1 


1.28 


109665 


F05012 


Hs.27027 


Hypothetical protein DKFZp762H1311 


1.42 


2 


109970 


H09281 


Hs.13234 


ESTs 


1.13 
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110015 


H10998 


Hs.7164 


A disintegrin and melaOoproleinase doma 


0.84 


1.95 


110156 


H18957 


Hs.4213 


ESTs 


0.94 


1.41 


110561 


H59617 


Hs.5199 


HSPC150 protein similar to ubiquitirvcon 


0.91 


3,18 


111223 


N68921 


Hs.34806 


ESTs; Weakly similar to neogenin [H.sapi 


0.91 


ai3 


111345 


N89820 


Hs.14559 


Hypothetical protein FLi 10540 


1 


1.25 


111876 


R38239 


Hs.293246 


"ESTs, Weakly similar to putative p1 50 [ 


0.83 


1.27 


111902 


R39191 


Hs. 109445 


KIAA1 020 protein 


0.91 


0.91 


112244 


R51309 


Hs.70823 


K1AA1077 protein 


0.77 


3.01 


112973 


T17271 




"cONA FU13308 fis, done OVARC1001436, 


1 


1 


112989 


T23482 


Hs.89981 


■Diacyiglycerol kinase, zeta (104kD)' 


0.55 


1.03 


113047 


T25867 


Hs.7549 


ESTs 


0.87 


2 


113095 


T40920 


Hs.1 26733 


ESTs 


1 


1 


113531 


T90345 


Hs.16740 


Hypothetical protein FU11036 


0.42 


1.44 


113970 


W86748 


Hs.6109 


ESTs 


1.17 


1.73 


114346 


Z41450 


Hs.1 30489 


"ATPase, aminophosphonpid transporter-! 


0.86 


0.82 


114407 


AA010188 


Hs.103305 


ESTs 


0.8 


1.88 


114471 


AA028074 


Hs.104613 


RP42 homctog 


1.06 


1.34 


114509 


AA043551 


Hs.101799 


K1AA1350 protein 


1.82 


Z32 


115060 


AA253214 


Hs.198249 


"Gap junction protein, beta 5 (connexin 


0.79 


1.49 


115091 


AA255900 


Hs.184523 


K1AA0965 protein 


0.72 


1.92 


115123 


AA256642 


Ks.236894 


"ESTs, High sim to LRP1_hu low density 1 


0.59 


1.97 


115291 


AA279943 


Hs.122579 


ESTs 


1 


1.25 


115506 


AA292537 


Hs.45207 


Hypothetical protein KIAA1335 


1.15 


1.48 


115522 


AA331393 


Hs.47378 


ESTs 


0.5 


129 


115536 


AA347193 


Hs.62180 


ESTs 


1 


1 


115697 


AA411502 


Hs.63325 


Homo sapiens type 11 membrane serine pro 


1 


6.53 


115909 


AA436666 


Hs.59761 


ESTs 


1 


6.98 


115978 


AA447522 


Hs.69517 


Differentially expressed in Fanconi anem 


1 


2.31 


116028 


AA452112 


Hs.42644 


thioredoxin-Jike 


0.99 


1.68 


116107 


AA456968 


Hs.92030 


ESTs 


1.14 


1.8 


116134 


AA460246 


Hs.50441 


CGI-04 protein 


1.11 


1.86 


116157 


AA461063 


Hs.44298 


Hypothetical protein 


0.99 


1.9 


116158 


AA461187 


Hs.61762 


Hypoxia-inducible protein 2 

"Homo sapiens cDNA FU10238 fis, clone H 


0.44 


0.86 


116335 


AA495830 


Hs.87013 


0.62 


3.89 


116483 


C14092 


Hs.76118 


Ubiquitin carboxyUermlna) esterase LI 


1.04 


2.36 


117320 


N23239 


Hs.211092 


LUNX protein; PLUNC(pa)ate lung & nasal 


0.51 


0.64 


117557 


N33920 


Hs.44532 


Diubiquitin 


1.11 


2.63 


117693 


N40939 


Hs.1 121 10 


PTD007 protein 


0.98 


1.79 


117881 


N50073 


Hs.260622 


Butyrate-induced transcript 1 


1 


1.43 


118368 

1 IOJUU 


N64339 


Hs.48956 


ESTs 


0.67 


Z86 


118566 


N68558 


Hs.42824 


Hypothetical protein FU10718 


1.21 


0.83 


118695 


N71781 


Hs.50081 


KIAA1199seeCVA7.doc 


0.88 


1.63 


119780 


W72967 


Hs.1 91381 


ESTs; Weakly similar to hypothetical pro 


1 


1 


119845 


W79920 


Hs.58561 


G protein-coupled receptor 87 

•ESTs, Moderately similar to p53 regulat 


1 


1 


120102 


W95428 


Hs.1 32927 


1 


1 


120104 


W95477 


Hs.180479 


ESTs 


0.69 


3.07 


120486 


AA253400 


Hs.1 37569 


Tumor protein 63 kOa with strong homotog 


1.08 


12.05 


120859 


AA350158 


Hs.1 619 


Achaete-scute complex (Drosophila) homo! 


1 


1 


120880 


AA360240 


Hs.97019 


EST 


1 


1 


120948 


AA397822 


Hs.104650 


Hypothetical protein FU10292 


1.04 


2.15 


120983 


AA398209 


Hs.97587 


EST 


1 


1 


121362 


M405500 


Hs.97932 


Chondromodulin 1 precursor 


1 


1 


121369 


AA405657 


Hs.1 28791 


CGI-09 prolein 


1 


1.8 


121791 


AA423978 


Hs.293317 


'ESTs, Weakly similar to JM27 [H.sapiens 


1 


1 


123005 


M479726 


Hs.1 05577 


ESTs 


1 


1 


123044 


AA481549 


Hs.1 30881 


B-ceP CLUlymphoma 1 1 A (zinc finger pro 


0.95 


1.88 


123160 


AA488687 


Hs.284235 


ESTs 


1.59 


4.98 


123479 


AA599469 


Hs.135056 


clone RP5-850E9 on chromosome 20 


1.19 


1.64 


123571 


AA608956 


Hs.1 12619 


"ESTs, Weakly similar to PQ0109 Purkinje 


1.03 


1.14 


123829 


AA62G697 


Hs.1 12208 


XAGE-1 protein 


1.39 


2.2 


124006 


D60302 


Hs.1 08977 


ESTs 


1 


4.85 


124059 


F13673 


Hs.99769 


ESTs 


1.49 


8.62 


124980 


T15386 


Hs.1 94766 


Seizure related gene 6 (mouse)-Iike 


0.76 


0.77 


125218 


W73561 


Hs.1 10024 


NADH:ublquinone oxidoreductase MLRQ subu 


1.33 


1.77 


125453 


R06041 


Hs.1 8048 


"Melanoma antigen, family A, 10" 


0.8 


1.42 


125759 


AA425587 


Hs.82226 


Glycoprotein (transmembrane) nmb 


1.52 


226 


125972 


AA434562 


Hs.35406 


'ESTs, Highly similar to unnamed protein 


1.05 


2.48 


125994 


H55782 


Hs.270799 


EST 


1 


1.95 


126395 


N7Q192 


Hs.278956 


Hypothetical protein FLJ 12929 


1 


1.35 


126645 


A)1 67942 


Hs.61635 


STEAP1 (Homo sapiens BAC clone RG041D1 1 


1 


2.23 


127221 


A1354332 


Hs.72365 


ESTs 


0.73 


3.27 


127479 


AA513722 


Hs.179729 


collagen; type X; alpha 1 (Schmid metaph 


0.51 


1.94 


128192 


A1204246 




K1AA1 085 protein 


1.8 


3.16 


128610 


L38608 


Hs.1 0247 


activated leucocyte cell adhesion moiecu 


0.89 


0.97 


128777 


U46006 


Hs.1 0526 


Cysteine and gtycine-rich protein 2 


1 


1 


128924 


AA234962 


Hs.26557 


PlakophQin 3 

"Solute carrier family 2 (facilitated gl 


1.3 


197 


129041 


H58873 


Hs.1 69902 


0.84 


Z04 


129099 


H50398 


Hs.1 08660 


"ATP-blnding cassette, sub-family C (CFT 


0.87 


1.04 


129404 


AA172056 


Hs,111128 


ESTs 


1 


1 


129466 


L42583 




"Genbank Homo sapiens keratin 6 isoform 


0.72 


12.67 


129605 


S72493 


Hs.1 15947 


Keratin 16 (focal non-epidermolytic palm 


0.92 


1.5 


129628 


U26727 


Hs.1 174 


'CycHn-dependent kinase inhibitor 2A (m 


0.85 


1.93 


130023 


X13461 


Hs.239600 


Calmodulin^ 3 


0.84 


1.22 


130080 


X14850 


Hs.147097 


"H2A histone family, member X" 


0.98 


1.96 


130385 


AA126474 


Hs.1 55223 


stanniocalcin 2 


1 


1 
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130410 


V01514 


Hs.155421 


Alpha-fetoprotein 


0.63 


0.63 


130441 


U35835 


Hs.301387 


•Human DNA-PK mRNA, partial cds" 


1.15 


3.65 


130482 


132866 


Hs.1578 


Bacuioviral IAP repeal-containing 5 (sur 


1 


1.88 


130553 


AA430032 


Hs.252587 


Pituitary tumor-transforming 1 


0.92 


1.95 


130577 


M35410 


Hs.162 


Insulin-iike growth factor rinding prole 


1.17 


4.7 


130627 


L23808 


Hs.1695 


Matrix metaitoproteinase 12 (macrophage 


0.69 


4.05 


130800 


AA223386 


Hs.19574 


ESTs; Weakly similar to katanin p80 subu 


1.13 


2.41 


130939 


AA598689 


Hs.21400 


ESTs 


0.8 


0.89 


131046 


X02530 


Hs.2248 


INTERFERON-GAMMA INDUCED PROTEIN PRECURS 0.8 


1.15 


131244 


D38076 


Hs.24763 


RAN binding protein 1 


1.13 


1.85 


131877 


J04088 


Hs.156346 


Topotsomerase (DNA) 11 alpha (170kD) 


1 


1 


131927 


AA461549 


Hs.34780 


"Doublecortex; Hssencephary, X-iinked ( 


0.81 


0.62 


131965 


W90146 


Hs.35962 


ESTs 


0.74 


3.27 


131978 


D80008 


Hs.36232 


KIAA0186 gene product 


1 


1 


132354 


L05187 


Hs.211913 


Small praline-rich protein 1A 


0.69 


1.43 


132543 


AM17152 


Hs.51Q1 


ESTs; Highly similar to protein regulati 


0.79 


4.27 


132632 


N59764 


Hs.5398 


guanine-monophosphate synthetase 
"lamtnin gamma2 chain gene (LAMC2), excn 


1 


1.08 


132653 


U31201 


Hs.54451 


1 


1 


132659 


Z75190 


Hs.54481 


low density lipoprotein receptor-relate 
"Serins (or cysteine) proteinase inhibit 


0.89 


0.89 


132710 


W93726 


Hs.55279 


0.64 


4.41 


132758 


W52432 


Hs.56105 


"ESTs, Weakly simBar to WDNM RAT WDNM1 


1.55 


108 


132767 


L05188 


Hs.231622 


Small praline-rich protein 2B 


0.83 ^ 


1.66 


132816 


M74542 


Hs.575 


Aldehyde dehydrogenase 3 


0.55 


0.55 


132990 


AA458761 


Hs.18387 


transcription facto/ AP-2 alpha (actfvat 
"A disintegrin and metaDoproteinase dom 


1 


3.53 


133070 


U69611 


Hs.64311 


1.16 


2 


133282 


U52960 


Hs.286145 


"SRB7 (suppressor of RNA polymerase B, y 


1 


Z7 


133317 


AA215299 


Hs.70830 


U6 snRNA-associated Sm-like protein LSm7 


0.95 


1.42 


133370 


M156897 


Hs.72157 


Homo sapiens mRNA; cDNA DKF2p564l1922 


1.12 


2.55 


133391 


X57579 


Hs.727 


Ksapiens activin beta-A subunit (exon 2 


1.65 


1.76 


133832 


H03387 


Hs.241305 


estrogerKesponsive B box protein (EBBP) 
"Serine (or cysteine) proteinase inhibit 


1.02 


1.39 


134032 


Z81326 


Hs.78589 


1 


1 


134168 


AA398908 


Hs.181634 


"Homo sapiens cDNA: FU 23602 fis, done 


0.95 


1.53 


134218 


AA227480 


Hs.80205 


Pin>2 oncogene 
"""collagen, type XI, alpha 1"* 


1.36 


2.48 


134405 


R67275 


Hs.82772 


0.76 


2.86 


134453 


X70683 


Hs.83484 


SRY (sex determining region Y)-box 4 


lls9 


3.78 


134470 


X54942 


Hs.83758 


CDC28 protein kinase 2 


1.82 


4.11 


134545 


U87459 


Hs.167379 


"Cancer/testis antigen (NY-ESO-1, CTAG1, 


0.82 


0.83 


134781 


M17183 


Hs.89625 


Parathyroid hormone-like hormone 


1 


1 


135002 


U19147 


Hs.272484 


G antigen 6 


1 


1 


100040 


M97935 




AFFX control: STAT1 


0.92 


1.25 


101201 


122524 


Hs.2256 


matrix melalloproteinase 7 (matrflysin; 


2.92 


8.5 


101664 


M60752 


Hs.121017 


H2A histone family, member A 


t 


1 


102025 


U03911 


Hs.78934 


mutS (E rain homoloa 2 /colon cancer 

1 1 Hi Iv wily II wl 1 iWK/y & ^vviwl 1 wuj tUCI , 


0.8 


1.61 


102031 


U04898 


. Hs.2156 


RAR-rslated orahan rprpDtor A 


1 


1 


102221 


U24576 




UM domain only 4 


1 


• 1 


102270 


U30255 


Hs.75888 


nhfrcnhtYifiifnnafA rfphvrfmfipnasp 


1.08 


1.43 


102339 


U37022 


Hs.95577 


cvniirvdpnpndpnt kinasp A 

wjf vi II ruujjul IU wl 1 1 imiiqov ■ 


0.88 


1.32 


102391 


U41668 


Hs.77494 


dpnwnnprtfVsfnft kin asp 


1.07 


1.58 


103000 


X51956 


Hs.146580 


prtolase 9* /namma: npnronaU 


0.91 


1.49 


103395 


X94754 


Hs.1 19503 


mpthinnlnp-IRMA <;vnthnta<;p 

J II^U li \Jt III 1 tT^ll Mir\ OJIIMIwlQww 


0.89 


1.32 


105638 


AA281599 


Hs.20418 


Hnmn ^pntanQ mRNA fhr fhr hictnno H9R* r 
nuiuu odpiGiKD inrvixr\ tui Wl luoiuiio nt.o t \» 


0.91 


1.25 


105726 


AA292328 


Hs.9754 


activating transcription factor 5 


0.94 


1.48 


114841 


AA234722 


Hs.55408 


F^TV Mnrkratptv similar tn P.AI Oil IWLDPPF 


f)7R 


1.56 


115206 


AA262491 


Hs.1 86572 


ESTs 


1 


1 


115906 


AA436616 


Hs.82302 


ESTs 


0.74 


2.52 


119132 


R49046 


Hs.107911 


ATP-hinrifnn rnt^pfrP' Qiih-famih/ R /MnR/ 
m r^iiiiu4iiy i/ooooiic, oui^iaJiuiy o yivju/rv 


1.1 


1.51 


124163 


H30539 


Hs.189838 


ESTs 


} 


1 


126487 


AA482505 


Hs.184601 


solute carrier family 7 (cationic amino 


1,01 


1.46 


127141 


AA307960 


Hs.75478 


KIAA0956 nrotetn 


0.85 


1.4 


128034 


AA905754 


Hs.75103 


tvrosinp 3-mnnfiowopnflSfl/hvntenhpn 5-mn 

tjiuouic iiui luuAjryci iaoo/u jfpiujjiiai i v/i i iw 




1.18 


128609 


AA234365 


Hs.102456 


survival of motor neuron Drotein interac 

OUITIVWI VI •IIWlwl llMUIWll pIVlWlll UllQIOU 


1 


1.5 


128895 


R37753 


Hs.106985 


ESTs 


1.7 


2 


130199 


Z48579 


Hs.172028 


a disintegrin and metalloprote^e domain 


1 


1 


130524 


U89995 


Hs.159234 


forkhead box E1 


1 


1 


133000 


U24152 


Hs.62402 


p21/Cdc42/Rac1 -activated kinase 1 (yeast 


1 


1 


133658 


M25756 


Hs.75426 


spcreloaranin II fchromonranln CI 

dWVlWlv^lOIIII J II \V/I *l wl IIU^I Ql HII \SJ 




1 


135047 


AA460466 


Hs.93597 


ESTs 


1 


1 


100053 


M27830 




AFFX control: 28S ribosomal RNA 


0.88 


1.53 


100114 


D00596 


Hs.82962 


thvmidvlata synthetase 


0.68 


1.86 


100128 


D11094 


Hs.61153 


proteasome (prosome; macropain) 26S subu 


1.29 


2.03 


100154 


D14657 


Hs.81892 


K1AA0101 gene product 


0.71 


4.26 


100161 


D14694 


Hs.77329 


phosphatidyiserine synthase 1 


1.02 


. 1.56 


100168 


D14874 


Hs.394 


adrenonneduIOn 


0.46 


1.17 


100187 


D17793 


Hs.78183 


aldo-keto reductase family 1 ; member C3 


1 


1 


100188 


D21063 


Hs.57101 


nTinjchromosome maintenance deficient (S. 


0.97 


1.4 


100217 


D266O0 


Hs.89545 


proteasome (prosome; macropain) subunit; 


1.13 


1.9 


100220 


028364 




""Human mRNA for annexin II, 5UTR (seq 


1.11 


1.53 


100287 


D43950 


Hs.1 600 


chaperonin containing TCP1 ; subunit 5 (e 


1.13 


2.09 


\00297 


D49489 


Hs.182429 


protein disulfide isomerase-related prat 


0.92 


1.78 


100330 


D55716 


Hs.77152 


minichromosome maintenance deficient (S. 


1.07 


1.61 


100355 


D78129 




•""Homo sapiens mRNA for squalene epoxid 


0.96 


1.87 


100364 


D78586 


Hs.154868 


carbamoyl-phosphate synthetase 2; aspart 


1.49 


2.46 


100368 


D79987 


Hs.153479 


extra spm<Se poles; S. cerevlsiae; homo 


0.59 


1.32 


100398 


D84557 


Hs.1 55462 


minichromosome maintenance deficient (mi 


1.08 


1.9 


100438 


D87448 


Hs.91417 


topoisomerase (DNA) II binding protein 


1 


Z15 
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100455 


D87953 


Hs.75789 


N-myc downstream regulated 


0.91 


1.48 


100491 


HG1153-HT1153 


Nucleoside Diphosphate Kinase Nm23-H2s 


0.99 


1.41 


100518 


HG174-HT174 




DesmoplaWn I 
•"Nexin, GDa-Derived"" 
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1.28 


163 


129703 


AA401348 


Hs.179999 


ESTs 


0.97 


1.63 


129720 


AA476582 


Hs.12152 


ESTs; Moderately similar to SIGNAL RECOG 


1.09 


1.79 


129850 


N20593 


Hs.56845 


GDP dissodalion Inhibitor 2 


0.74 


1.68 


129896 


AA043021 


Hs.13225 


UDP-Gal:betaGlcNAc beta 1 ;4- galactosylt 


1.43 


4.19 


130069 


AA055896 


Hs.146428 


collagen; type V; alpha 1 


1.17 


1.98 


130405 


H88359 


Hs.1 55396 


nuclear factor (erythrold-derived 2)-!ik 


1.26 


1.79 


130541 


X05608 


Hs.211584 


neurofilament; tight polypeptide (68kD) 


1 


1 


130599 


M91670 


Hs.1 74070 


ubiquitin carrier protein 


1.07 


1.66 


130867 


J04093 


Hs.2056 


UDP glycosyltransferase 1 


1 


4.8 


131009 


AA063596 


Hs.22142 


ESTs; Weakly similar to NAOH-CYTOCHROME 


0.93 


1.05 


131028 


U20240 


Hs.2227 


CCAAT/enhancer binding protein (C/EBP); 


1 


1.23 


131083 


U66661 


Hs.22785 


garnma-aminobutyric add (GABA) A recepto 


1.1 


1.8 


131091 


T35341 


Hs.22880 


ESTs; Highly similar to dipeptidyl pepti 


1.28 


1.98 


131144 


C14412 


Hs.23528 


ESTs; Highly simitar to HSPC038 protein 


1.43 


106 


131148 


C00038 


Hs.23579 


ESTs 


0.88 


3.38 


131164 


Y00503 


Hs.1 82265 


keratin 19 


1.19 


177 


131185 


M25753 


Hs.23960 


cyclln B1 


0.86 


3.84 


131219 


C00476 


Hs.24395 


small Inducible cytokine subfamily B (Cy 


0.66. 


196 


131454 


AA455896 


Hs.2699 


glypican 1 


0.99 


1.54 


131687 


L11066 


Hs.3069 


heat shock 70kD protein 9B (mortailn-2) 


1 


1.18 


131689 


AA599653 


Hs 30696 


transcription factor-like 5 (basic helix 


1 


1.95 


131692 


050914 


Hs.30736 


KIAA0124 protein 


1.55 


139 


131786 


AA135554 


Hs.32125 


ESTs 


1 


1.33 


131843 


AA195893 


Hs. '184052 


ESTs; Moderately similar to putative Rab 


0.83 


1.63 


131860 


U02082 


Hs.334 


Oncogene TIM 


1.08 


12 


131884 


H90124 


Hs.3463 


ribosomal protein S23 


1.23 


1.24 


131903 


AA481723 


Hs.3436 


deleted In oral cancer (mouse; homolog) 


0.91 


1.18 


131945 


M87339 


Hs.35120 


reolication factor C (activator 1)4 (37 


1 


18 


131958 


AA093998 


Hs.3566 


ESTs; Highly similar to phosphorylation 


0.87 


1.36 


131964 


W42508 


Hs.3593 


ESTs 


1 


1.25 


132001 


J00277 


Hs.37003 


v-Ha-ras Harvey rat sarcoma viral oncoge 


1.12 


1.43 


132040 


AA146843 


Hs.1 72894 


BH3 interacting domain death agonist 


1 


1.55 


132065 


082226 


Hs.211594 


proteasome (prosome; macropain) 26S subu 


0.89 


1.27 


132109 


AA599801 


Hs.40098 


ESTs 


1 


1.05 


132112 


M1 50661 


Hs.40154 


jumonji (mouse) homolog 


0.99 


1.44 


132123 


AA447123 


Hs.2507O5 


ESTs 


1.06 


146 


132162 


H89551 


Hs.41241 


ESTs 


1.08 


146 


132180 


AA405569 


Hs.418 


fibroblast activation protein; alpha; se 


1.02 


4.56 


132309 


M460917 


Hs.2780 


Jun D proto-oncogene 


1.16 


1.8 


132371 


AA235448 


Hs.46677 


ESTs 


0.8 


1.26 


132618 


AA253330 


Hs.5344 


adaptor-related protein complex 1; gamma 


0.5 


1.49 


132736 


U68019 


Hs.211578 


MAO (mothers against decapentaplegic; Dr 


1.21 


1.81 
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132771' 


AA488432 


Hs.56407 


pbosphoserine phosphatase 


1 


1.3 


132833 


U78525 


Hs.57783 


eukaryotic translation initiation factor 


0.91 


1.43 


132922 


T23641 


Hs.6066 


K1AA1 112 protein 


1.16 


1.53 


132959 


AA028103 


Hs.61472 


ESTs; Weakly similar to unknown [S.cerev 


1.02 


1.88 


132994 


AA505133 


Hs.7594 


solute carrier famHy 2 (facilitated glu 


0.72 


2.97 


133005 


C21400 


Hs.103329 


K1AA0970 protein 


0.88 


1.34 


133055 


X62535 


Hs.172690 


diacylgtycero) kinase; alpha (BOkD) 


0.93 


1.23 


133083 


N70633 


Hs.6456 


chaperonin containing TCP1; subunit 2 (b 


1.14 


1.76 


133086 


L17131 


Hs.139800 


high-mobility group (nonhistone chromoso 


0.97 


1.43 


133134 


T89703 


Hs.65643 


RNA binding mofif protein 8 


1.1 


1.8 


133195 


AA350744 


Hs.181409 


WAA1007 protein 


2.29 


2.69 


133313 


AA249427 


Hs.70704 


ESTs 


1.07 


1.68 


133331 


T62039 


Hs.1 58675 


ribosomal protein L14 


0.85 


1.18 


133438 


D13370 


Hs.73722 


APEX nudease (multifunctional DMA repal 


0.91 


1.45 


133445 


T99303 


Hs.73797 


guanine nucleotide binding protein (G pr 


0.94 


1.68 


133483 


X52426 


Hs.74070 


keratin 13 


0.85 


1.14 


133492 


L40397 


Hs.74137 


transmembrane trafficking protein 


1.1 


1.69 


133504 


W95070 


Hs.74316 


desmoplakin (DPI; DPJI) 


0.7 


a2i 


133517 


X52947 


Hs.74471 


gap junction protein; alpha 1; 43kO (con 


0.95 


1.3 


133540 


D78151 


Hs.74619 


proteasome (prosome; macropain) 26S subu 


0.91 


1.25 


133594 


107758 


Hs.172589 


nuclear phosphoprotein similar to S. cer 


0.84 


1.29 


133627 


U09587 


Hs.75280 


grycykRNA synthetase 


1.09 


1.99 


133671 


T25747 


Hs.75471 


zinc finger protein 146 


1.02 


1.5 


133859 


U86782 


Hs.178761 


26S proteasome-assoctated pad1 homolog 


1.11 


3,33 


133865 


F09315 


Hs.170290 


discs; large (Drosophila) homolog 5 


1.84 


6.7 


133913 


W84712 


Hs.7753 


calumenin 


1.15 


1.86 


133963 


L34587 


Hs.184693 


transcription elongation factor B (Sill) 


1.3 


1.91 


133982 


U47621 


Hs.207251 


nucleolar autoantigen (55kD) simitar to 


1.3 


1.99 


134100 


L07540 


Hs.1 71075 


replication factor C (activator 1) 5 (36 


0.72 


1.65 


134110 


U41060 


Hs.79136 


LIV-1 protein; estrogen regulated 


1.04 


1.62 


134158 


U15174 


Hs.79428 


BCL2fedenowrus E1B 19kD-interacting pro 


1 


1.55 


134161 


U97188 


Hs.79440 


IGF-tl mRNA-binding protein 3 


0.82 


1.95 


134193 


F09570 


Hs.7980 


ESTs 


0.98 


1.48 


134367 


X54199 


Hs.82285 


phosphonteylgrytinamide formyltransfer 


1 


2.8 


134402 


U25165 


Hs.82712 


fragile X menial retardation; autosomal 


1.26 


2 


134457 


D86963 


Hs.174044 


dishevelled 3 (homologous to Drosophila 


1 


1.47 


134469 


X17567 


Hs.83753 


small nuclear ribonucleoprotein potypept 


0.94 


1.57 


134498 


M63180 


Hs.84131 ' 


threonyURNA synthetase 


1.2 


2.64 


134501 


W84870 


Hs.211568 


eukaryotic translation initiation factor 


0.84 


1.36 


134507 


M63488 


Hs.84318 


replication protein A1 (70kD) 


1.7 


Z93 


134548 


U41515 


Hs.85215 


Deleted In split-hand/spfit-foot 1 regio 


1.46 


2.73 


134599 


X99226 


Hs.86297 


Fanconi anemia; complementation group A 


1.36 


2.22 


134692 


R73567 


Hs.8850 


a disintegrin and metaitoproteinase doma 


0.77 


1.64 


134693 


N70361 


Hs.8854 


ESTs 


1.09 


1.82 


134806 


Z49099 


Hs.89718 


spermine synthase 


0.98 


1.35 


134821 


Z34974 


Hs.198382 


plakophilin 1 (ectodermal dysplasta/skin 


0.99 


1.4 


134864 


Y08999 


Hs.90370 


actin related protein 2/3 complex; subun 


0.95 


1.42 


134914 


U29615 


Hs.91093 


chitinase 1 (chilotriosidase) 


1.16 


1.29 


134953 


L10678 


Hs.91747 


profilin 2 


0.95 


1.76 


134993 


AA282343 


Hs.9242 


purine-rich element binding protein B 


0.98 


1.73 


135051 


C15324 


Hs.93668 


ESTs 


1.35 


2.11 


135158 


U51711 




Human desmocoJlin-2 mRNA; 3* UTR 


0.86 


1.16 



Table 1B shows the accession numbers for those pkeys in Table 1A lacking unigenelD's. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on'sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
Accession column. 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 



CAT 



Accessions 



100661 23182 J BE623001 L05096 AA383604 AW965416 N53295 AA460213 AW571519 M603655 

100667 26401 3 L05424 X56794 S66400 X55150 W60071 AW351820 X55938 M83326 BE005289 BE070059 M83324 BE005248 BE069717 BE181648 BE069700 

AW606203 BE069721 AW3821 38 AW803776 BE463954 BE005334 BE005274 T27386 M932714 AA972695 AW377728 A1632506 T29066 
A1783934 AW377727 BE163715 AL047291 AA279047 AA523003 BE008048 BE440141 W23614 BE090519 BE092193 N29181 N20358 N44153 
BE546944 T69231 AW377441 AA907406 H50799 AW051416A1420712 BE620922 A1279161 AA992549 W47198 BE005241 AI342696 H50700 
AI969974 A1863855 AA374490 AW130675AI950633 AA146687 H99482 X55150 BE005414 BE005339 N28294 AI673068A1887890 AW804171 
AI675961 AW804172AA778841 AL048050 Al 1 27757 AI095568 AW204965 AW468978 W31 898 AI052595A1 278771 BE46401 8 A1081503 AI824196 
AA513211 AA411062 AW084376 N48752 AA703209 N35580 AW059918 AA054563 AI280942 T27619 BE621435 N66010 AW589527 AI160414 
AA283090 AA962536 H82726 W521 1 5 W45432 W60433 AA577548 AA146714 BE150994 AA054615 AW796025 AW382768 BE565671 C00444 
AA054555 

100668 26401.3 L05424 X56794 S66400 X55150 W60071 AW351820 X55938 M83326 BE005289 BE070059 M83324 BE005248 BE069717 BE181648 BE069700 

AW606203 BE069721 AW382138 AW803776 BE463954 BE005334 BE005274 T27386 AA932714 AA972695 AW377728 AI632506 T29066 
AI783934 AW377727 BE16371 5 AL047291 AA279047 AA523003 BE008048 BE440141 W23614 BE090519 BE092193 N29181 N20358 N44153 
BE546944 T69231 AW377441 AA907406 H50799 AW051416A1420712 BE620922 A1279161 AA992549 W47198 BE005241 AI342696 H50700 
A1969974 A1863855 AA374490 AW130675 A1950633 AA146687 H99482 X55150 BE005414 BE005339 N28294 AI673068 AI887890 AW804171 
AI675961 AW804172AA778841 AL048050 Al 127757 AI095568 AW204965 AW468978 W3 1898 AI052595AI278771 BE464018 AI081503AI824196 
AA513211 AA411062 AW084376 N48752 AA703209 N35580 AW059918 AA054563 A1280942 T27619 BE621435 N66010 AW589527 AI160414 
AA283090 M962536 H82726 W521 15 W45432 W60433 AA577548 AA146714 BE150994 AA054615 AW796025 AW382768 BE565671 C00444 
AA054555 

101332 25130 1 J04088NM 00tQ67 AF071 747 AJ01 174 1 N85424 AL042407 AA21 8572 BE296748 BE083981 AL040877 AW49991 8 AW675045H17813 
BE081283 AA670403 AW504327 BE094229 AA104024 AI471482 A1970337 AA737616 AI827444 AW003286 A1742333 AI344044 A1765634 
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A1948838 AW235336 AW172B27 AA095289 BE046383 AT734240 W16699 AI660329 AI289433 AA933778 AW469242 AA468838 AA806983 
AA625873 W78031 BE206307 AA550803 AI743147 A1990075 AA948274 AA1 29533 AI635399 AA605313 AI624669 AW59431 9 A1221 834 A1337434 
AA307706 BE550282 AI760467 AI63063S AI221521 AW674314 AW078889 A1933732 AI686969 A) 186928 AW074595 A1127485 AL079644 
AI910815 H17814 AA310903 AW137854 T19279 AA026682 AA308035 AW383390 AW383389 AW383422 AW383427 AW383395 H09977 
AA306247AA352501 AW403639 F05421 AA224473AA305321 H93904AA089612AW391543AW402915AW173382AW402701 AW403113 
R94438 N73126 H93466 AA090928 AA095051 T29025 AW951071 L47277 L47276 AI375913 BE384156 W24652 AA746288 AA568223 BE090591 
H93033 N57027 AA504348 AA327653 AW959913 N53767 AA843715 A1453437 AW263710 AI076594 AA583483 AW873194 AW5751 68 Al 128799 
AI803319 AL042776 AW074313 A1887722 A1032284 AA447521 A1123B85 N29334 AI354911 AW090687 AA236763 AA435535 AA236910 
AA047124 AA238734 AW514610 H93467 AA962007 AI446783 AA127259 AI613495 AI686720A1587374 AA936731 AA702453 AI859767 
AA216786 AI251819 AI469227 AA806022 AI092324 N71868 AA988782 AA236919 AA809450 AA227220 AA765284 AI192007 AA768810 
AA805794 AA729280 AA805238 AW768817 N71879 AI050686 AA505B22 AA668974 AI688160 BE045915 AW466315 AA731314 AA649568 
AA834316 AW591901 AW083876 AW294770 AI300266 AI336094 A1560380 AA721755 H09978 D20305 D29155 AW821790 BE150864 F01675 
AI457474 AW466316 AA550969 AA630788 

100780 458.127 BE561958 BE561728 BE397612 BE514391 BE269037 BE514207 BE562381 BE514256 BE514403 BE514250 BE397832 BE289598 BE559865 
BE396881 BE560031 BE514199 BE560037 BE560454 

100830 4002.1 AC004770 W05005 AA356058 AA094281 H29358 T56781 AW875313 L37374 BE312466 BE311755 BE207106 BE293320 BE018115 AW239090 
BE548830 AW247547 AA776062 BE397382 AA486713 T101 11 T09340AW498981 BE547280AA356003AW581520AW875331 AA580720 
AW875336 BE276873 BE408229 AW1 88148 BE255166 BE253761 AW793727 AW373141 AW581548 AA471223 AA305950 BE263976 AA626820 
BE257409 AW360962 AA090855 C00312 BE312741 BE407213 AA209352 AW298199 AW248553 AW297794 AW731722 BE300586 AW731972 
AW615446 BE301599 AW615520 AA486714 AW440257 AA19S516 AA564630 AA618079 AW192592 AW474985 AA604580 AI627461 AA765440 
AI680394 AL135648 AI683224 A1581 126 AW245096 AW194154 H29274 N70363 AA629758 AA580602 AA862006 AI863841 AI097667 A1928583 
AI358774 BE243487 AA620553 AA653297 AA292690 T101 1 0 Z38905 AA908544 AA340930 All 85438 T03328 T28844 AI687010 AI864965 
AI872575 BE388740 T56780 AW373138 BE258717 AA699671 

100906 4312.1 AU07691 6 BE2981 10 AW239395 AW672700 NM.003875 U10860 AW651755 BE297958 C03806 AI795876 AA644165 T36030 AW392852 

AA446421 AW881866 A1469428 BE548103 T96204 R94457 N78225 AI564549 AW004984 AW780423 AW675448 AW087890 AA971454 AA305698 
AA879433 AA535069 AI394371 AA928053 AI378367 N59764 AI364000 AI431285 T81090 AW674657 AW674987 AA897396 AW673412 BE063175 
AW674408 AI202011 R00723 AI753769 AI460161 AW079585AW275744 AI873729 D25791 BE537646 T81139 R00722 

100930 16865 1 J04129 NM.002571 AA293088AA477016AA404631 T28299AA4769Q4AA433965AA43G486AA495907AI151391 AA291495 AA402723 W25651 
AA706816 AI826712 AW296294 AA293479 AI276581 AW044154 AI080180 AI417985 AI274168 AI474212 M495908 M635664 A1092114 
AI804952 AA479874 AI597661 AI42051 1 AA479738 AA421417 AA421 247 AA436220 AL047797 M34046 N42277 AA228076 W02698 AI420297 
AA434011 AI369971 AA479731 A1865541 AI418020 AA421246 AA452764 AL048051 

102221 3861 1 NM006769 U24576 AW161961 AW160473 AW160465 AW160472 AW161069 AI824831 AW162635 AI990356 AW162477 AW162571 AI520838 
AW162352 AW162351 AW162752 A1962216 AI537346 AA853902 HI 7667 BE045346 BE559802 BE255391 AA985217 AA235051 AI129757 
AW366451 T34489 D56106 D56351 A1936579AW023219AW889335AW889120AW889232AW889175BE093702AW889349AA147546 
A1952998 AA912579 AI143356 AW902211 R64717 AW157236 AI815242 D45274 AW263991 AA442920 AA1 29965 AL035713 AI923255 AI949082 
AI142826 AI6841 60 A1701 987 AI678954 AI827349 BE453635 AW628092 AW302281 AA493203 BE348856 BE536419 AW1 93969 AW673561 
AW592609 AI224044 H43943 AA091912 R49632 R48353 AI568409 R48256 AI198046 H27986 H43899 AI678759 AI680310 A1624220 H17052 
AA156410 N56062 AI699430 AA664529 T09406 T10459M627506 AI379584 N83831 N88633 AW022651 AA971281 AA248036 AI039197 
AI914689 AA973825 AL047305 AA129966 AI798369 AW264348 AI445879 AI658759 N67924 AI933507 AI216121 AI333174 T10972 AI375028 
A1185756A)273778M610487AI797946AA853903AA903939AI338587A)278494AW627595AA904019 

101809 32963 1 M86849 AA315280 NM .004004 AA315269 BE142653 AA461400 AW802042 BE1 52893 AW3831 55 AA490688 AW1 17930 AW384563 AW384544 
AW384566 AW378307 AW378323 AW839085 AA257102 AW378317 AW276060 AW271245 AW378298 AW384497 AI5981 14 AW264544 A1018136 
AW021810 AA961504 AW086214 AW771489 AW192483 AI290266 AW192488 AW384490 AW0O7451 AW890895 AA554460 AA613715 
AW020066 AI783695 AJ589498 A1917637 AW264471 AW384491 AI816732 AW368530 AW368521 AW368463 AA461087 AI341438 AI970613 
AI040737 AI41 8400 AA947181 AA962716 AI280695 AW769275 AW023591 AI160977 AA055400 N71882 AA490466 AW243772 AW316636 
AI076554 AW51 1702 N69323 H88912 AA257017 AJ952506 H88913 AI912481 AA600714 BE465701 N64149 C00523 N64240 AA677120 

102590 15932 1 R61573 BE005029 X98091 AA297307 BE537267 BE566138 BE566139 F11561 BE564795 BE568776 AW064005 BE566479 BE380035 BE567012 
BE568634 BE566568 AA298060 BE566043 BE568813 BE568618 AA283070 BE565414 BE566738 BE568585 BE565667 BE566116 BE566433 
U62136 AF049140 BE567057 BE567297 BE567403 BE564316 BE567400 BE568854 BE566588 AA448772 AA071363AW732642 BE564996 
AA297763 AA278550 AA421083 AA298184AA091007 AA984577 AA205916 N28759 AL031291 C15757 C15761 H02728BE566410AA1 29335 
AA41 9499 N87741 BE379689 BE004824 BE37961 1 D25874 AA148454 AA323654 AW95031 1 AA448795 AW749423 AA773386 AA773843 
AW020327BE348580 BE504258BE5499908E220200AI673334 AI202679AA975515 D61421 Al 168688 AA1 02843 AW246821 AI276203 
AI074054 AI633824 AI962927 AI148925 N50969 AI308911 AA410994 AW373025 AA148455 H02620 AA688293 AI246318 N22220 AI917777 
AI050943 AI097286 AA663794 AW368662 AW627826 AW078734 AI253060 AA7491 54 AA832236 AJ1 92358 AW024676 M448676 AA764891 
BE439467 AA661 534 AA258061 AI090546 AA995157 AI051 01 1 AA584421 AI026032 AW591 338 AW589563 AA776914 AW024684 AA421 002 . 
F09219 BE464500 A1383595 AA954244 AA601583 AA737304 AA195549 AA805778 AI055876 AA164942 AW013961 A! 672608 AW5 14211 D59441 
AW582574 AA160935 BE566501 BE5646I2 BE565353 BE568195 BE565447 BE568302 BE566097 8E565470 BE564249 AL036217 AW749424 
BE567494 AA102842 AA314761 AV661237 C14211 M651866 AW798997 AA470605 

101977 29073 1 AF1 1221 3 AL050318 T24804 AW248136 BE386341 BE263177 W16677BE250224 BE563669BE267405BE546577AV651354AV651292 

AI346903 AI5391 28 Al 1 891 71 S83364 AW073849 AI81 6760 AW073309 AI422690 AA296692 AI860301 AI805446 N77735 AI340328 BE092530 
AW028742 BE088442 AA657742 M742438 AW170086 AI038920 AI432379 N36073 AI9361 94 AA868655 AA98361 2 AI077505 BE080433 
AI375014 AI126547 AJ348244 AI346077 AI748952 N26915 AI753574 A1093341 A1278762 BE092517 N74204 H08158 T58149A1 129303 N58366 
AA524456 BE122661 AA542925 AI246120 A1735203 AA706829 M877544 AI082289 AA926687 N92840 AW249798 AA934763 AW998363 
AI128632 N25202 A1240209 AW118892 N80744 R35655 At342321 AI340141 AW878792 AI857321 H09610 W04601 AW006650AA1 26006 
AA553675 AI052791 AW059835 AI041906 M814658 AW002059 AA729483 AI609301 AA994633 AA903651 AI459183 T95072 AW088630 
AA1261 12 AI800091 AI561 215 H17502 AW475072 AI819003 AI683272 AI262701 AW793140 T81 787 R99586 AI275160 AI310420 AI698929 
M1591 74 AJ827968 F30305 F30309 AA806662 AI091 923 AW87B722 AA583430 AW571 913 AI674584 AA292533 AI079471 M642325 M71 9050 
AW793172 AA305476 AW1 03745 T23459 N79525 AI784438 AA534551 AW193751 AI074360 BE281214 T32229 W25066 W01205 T63086 
AW795348 AI361287 AW795353 AW795349 AA594759 AI400295 D1 1489 AI370689 AA482356 AA485295 W401 51 AA564661 AW300745 
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Table 2A shows 504 genes down-regulated In lung tumors relative to normal lung and chronically diseased lung. Chronically diseased lung samples represent chronic norv 
mafignani lung diseases such as fibrosis, emphysema, and bronchitis. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 GenecWp array. Gene 
expression data for each probeset obtained from this analysis was expressed as average intensity (A!), a normalized value reflecting the relative level of mRNA expression, 

Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene tide 

R1 : 90th percentile of Al for normal lung samples divided by the 80th percentile of Al for adenocarcinoma and squamous cell carcinoma lung tumor 

samples. 

R2: median of Al for normal lung samples divided by 90th percentile of Al for adenocarcinoma and squamous cell carcinoma lung tumor samples. 

R3: median of Al for normal king samples minus the 1 5th percentile of Al for all normal lung, chronically diseased lung and tumor samples divided by 

the 90th percentile of Al for adenocarcinoma and squamous ceU carcinoma lung tumor samples minus the 15th percentile of Al for alt normal 

lung, chronically diseased lung and tumor samples. 
R4: average of Al for normal lung samples divided by average Al for squamous cell carcinoma and adenocarcinoma lung tumors. 

R5: median of Al for norma) lung samples divided by the 90th percentile of A) for adenocarcinomas. 

R6: median of Al for normal lung samples minus the 1 5th percentile of Al for all normal lung, chronically Diseased lung and tumor samples divided by the 90th 

percentile of AJ for adenocarcinomas minus the 15th percentile of Al for ail normal lung, chronically diseased lung and tumor samples. 
R7: average of Al for normal lung samples divided by the 90th percentile of A) for squamous cell carcinomas. 

R8: median of Al for normal lung samples minus the 1 5th percentile of Al for all normal lung, chronically diseased lung and tumor samples divided by the 90th 

percentile of Al for squamous cell carcinomas minus the 1 5th percentile of Al for all normal lung, chronically diseased lung and tumor samples. 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


100095 


Z97171 


Hs.78454 


myocilin; trabecular mesnwork inducible 
glutathione peroxidase 3 (plasma) 


100115 


NM_002084 


Hs.336920 


100138 


U83508 


Hs.2463 


angbpoietin 1 


100299 


D49493 


Hs.2171 


growth differentiation factor 10 


100306 


U86749 


Hs.80598 


transcription elongation factor A (SH); 


100447 


NMJJ14767 


Hs.74583 


KIAA0275 gene product 


100458 


S74019 


Hs.247979 


Vpre-B 


100862 


AA005247 


Hs.285754 


Hepatocyte Growth Factor Receptor 


100959 


AA359129 


Hs.1 18127 


actin; alpha; cardiac muscle 


101032 


BE206854 


Hs.46039 


phosphoglycerale mutase 2 (muscle) 


101081 


AF047347 


Hs.4880 


amyloid beta (A4) precursor protein-bind 


101088 


X70697 


Hs.553 


solute carrier family 6 (neurotransmltte 


101125 


AJ250562 


Hs.82749 


transmembrane 4 superfamily member 2 


101180 


U11874 


Hs.846 


interteukin 8 receptor; beta 


101308 


141390 




"Homo sapiens core 2 beta-1,6-N-acetylgl 


101330 


L43821 


Ks.80261 


enhancer of fomentation 1 (cas-likedo 


101345 


NMJ06795 


Hs.152175 


Calcitonin receptor-like 


101346 


A1738616 


Hs.77348 


hydroxyprostaglandin dehydrogenase 15-(N 


101397 


M26380 


Hs.180878 


lipoprotein lipase 


101414 


NMJJ00Q66 


Hs.38069 


complement component 8; beta polypeptide 


101435 


NM_001100 


Hs.1288 


actin; alpha 1; skeletal muscle 


101507 


X16896 


Hs.82112 


interteukin 1 receptor; type I 


101530 


M29874 


Hs.1360 


cytochrome P450; subfamily IIB (phenobar 


101537 


AI469059 


Hs.184915 


zinc finger protein; Y-Bnked 


101542 


NMJOQ102 


Hs.1363 


cytochrome P450; subfamily XVII (steroid 


101545 


BE246154 


Hs.1 54210 


EDG1; endothelial differentiation, sphin 


101554 


BE207611 


Hs.123078 


thyroid stimulating hormone receptor 


101560 


AW958272 


Hs.83733 


Intercellular adhesion molecule 2, exon 


101574 


M34182 


Hs.158029 


protein kinase; cAMP-dependent; catalyti 


101605 


M37984 


Hs.1 18845 


troponin C; slow 


W1621 


BE391804 


Hs.62661 


guanylate binding protein 1; interferon- 


101680 


AA299330 


Hs.1042 


Sjogren syndrome antigen A1 (52kD; ribon 


101829 


AW452398 


Hs.1 29763 


solute carrier family 8 (sodium/calcium 


101842 


M93221 


Hs.75182 


mannose receptor; C type 1 


101961 


AW004056 


Hs.168357 


"Hs-TBX2=T-box gene {T -box region} [huma 


101994 


T92248 


Hs.2240 


uteroglobin 


102020 


AU077315 


Hs.154970 


transcripOon factor CP2 


102091 


BE260901 


Hs.83155 


aldehyde dehydrogenase 7 


102112 


AW025430 


Hs.155591 


forkhead box F1 


102190 


AA723157 


Hs.73769 


folate receptor 1 (adult) 


102202 


NMJJ00507 


Hs.574 


fructose-bisphosphatase 1 


102241 


NMJJ07351 


Hs.268107 


Multimerin 


102310 


U33839 




Accession not listed in Genbank 


102397 


U41898 




'Human sodium cotransporter RKST1 mRNA, 


102571 


U60115 


Hs.239069 


"Homo sapiens skeletal muscle LIM-protei 


102620 


AA976427 


Hs.121513 


Human clone W2-6 mRNA from chromosome X 


102636 
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AW836591 

T32982 

AL110151 

H00083 

BE242256 

D78841 

AW629054 

AL040137 

AA648284 

AW962712 

AA760776 

AW002846 

AA594027 

X80031 

AW976035 

M774725 

AW340720 

AA905730 

NMJM2214 

W27411 

W68800 

N76462 

AI265784 

M305407 

M464918 

AW271273 

AI311238 

NNL004131 

W26667 

H39537 

R25513 

AF134803 

W04245 

AA744902 

AW156903 

AW580945 



Hs.300670 

Hs.104864 

Hs.98467 

Hs.98463 

Hs.98610 

Hs.98724 

Hs.98746 

Hs.98849 

Hs.98872 

Hs.98890 

Hs.104936 

Hs.98907 

Hs.121602 

Hs.99081 

Hs.160318 

Hs.98321 

Hs.99489 

Hs.5120 

Hs.105328 

Hs.211869 

Hs.190046 

Hs.194024 

Hs.102679 

Hs.1 12497 

Hs.1 12724 

Hs.1 12795 

Hs.112893 

Hs.120017 

Hs.241519 

Hs.95497 

Hs.107149 

Ks.124685 

Hs.108135 

Hs.190266 

Hs.270962 

Hs.10299 

Hs.102415 

Hs.107197 

Hs.285625 

Hs.109284 

Hs.1 27436 

Hs.1 37274 

Hs.26530 

Hs.304389 

Hs.127826 

Hs.335774 

Hs.103118 

Hs.103156 

Hs.102720 

Hs.128797 

Hs.2441 

Hs.125976 

Hs.23964 

Hs.187584 

Hs.126712 

Hs.293977 

Hs.105468 

Hs.191788 

Hs.530 

Hs.292396 

Hs.128970 

Hs.125983 

Hs.128254 

Hs.177576 

Hs.336920 

Hs.1 2126 

Hs.269152 

Hs.145197 

Hs.102308 

Hs.23767 

Hs.104476 

Hs.1051 

Hs.184581 

Hs.75309 

Hs.10683 

Hs.180141 

Hs.107761 

Hs.107767 

Hs.108124 

Hs.330466 



ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

EST 

ESTs 

EST 

ESTs; Moderately slrrular to bilhoraxoid- 

EST 

EST 

ESTs; Weakly similar to hypothetica) pro 

EST 

EST 

ESTs 

phosphdemman 

Homo sapiens cDNA FU14103 lis, clone MA 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

EST 

Homo sapiens clone 24760 mRNA sequence 
ESTs 

olfactory receptor; family 7; subfamily 
ESTs 

ESTs; Weakly similar to GLUCOSE TRANSPOR 

ESTs; Weakly similar to PTB-ASSOCtATED S 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

'yw35g11.s1 Morton Fetal Cochlea Homo sa 
ESTs 

TATA box binding protein (TBP)-assadate 
ESTs 

calcium channel; voltage-dependent; alph 

ESTs; Weakly similar to SPUCEOSOME ASSO 

serum deprivation response {phosphatidyl 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

DKFZP586D0824 protein 

ary) hydrocarbon receptor-interacting pr 

KIAA0022 gene product 

HUM525A05B Human placenta poryA+ (TFuji 

ESTs; Weakly similar to metaiioprotease/ 

ESTs; Weakly similar to HC1 ORF (M.muscu 

ESTs 

ESTs; Weakly similar to plL2 hypothetica 

aa59b04.s1 NCLCGAP_GCB1 Homo sapiens c 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

mannosyl (alpha-1;3-)-glycoprotein beta- 
glutathione peroxidase 3 (plasma) 
ESTs; Weakly similar to LR8 [H.sapiens] 
ESTs; Weakly similar to ZINC FINGER PROT 
ESTs 

potassium inwardly-rectifying channel; s 
ESTs; Moderately similar to 111! ALU SUB 
ESTs 
ESTs 

granzyme B (granzyme 2; cytotoxic T-rymp 
ESTs 

eukaryolic translation elongation factor 
ESTs 

cofilin 2 (muscle) 

ESTs; Weakly similar to PUTATIVE RHO/RAC 
ESTs; Moderately similar to CaM-KII inhi 
ribosomal protein L41 
ESTs 



Z34 
2.61 



34.40 

49.40 
29.80 



30.80 



9.00 



15.00 
6.67 



31.40 



39.80 



2.65 



35.80 



35.40 
58.00 



29.00 
28.40 



17.00 



59.20 



2.63 



£35 

3.07 
2.48 

2.50 



8.00 



29.00 
32.20 



35.80 

39.60 
28.80 



12.00 



9.00 



7.33 



10.00 



2.36 



3.09 



31.20 



32.20 
41.80 



82.80 

32.40 
70.60 

29.40 

30.80 

37.60 

34.20 

33.60 
29.80 

34.40 



29.40 
37.20 

38.40 



34.40 



41.60 
87.00 



19.00 
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2.25 
2.92 



3.58 



3.60 
3.14 
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2.58 



3.95 



13.00 



4.74 



3.14 



6.03 



10.00 



3.12 



2.79 



4.42 
2.58 

4.31 



2.66 



9.00 



11.00 



3.10 



3.17 



3.21 



4.02 
3.76 

3.68 



34.60 
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15 
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35 
40 
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60 
65 
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129210 
129240 
129262 
129301 
129331 
129381 
129555 
129595 
129513 
129782 
129950 
129958 
129959 
130160 
130259 
130273 
130312 
130436 
130523 
130799 
130885 
131002 
131012 
131031 
131061 
131066 
131082 
131087 
131161 
131179 
131182 
131205 
131277 
131281 
131282 
131285 
131355 
131391 
131461 
131487 
131517 
131545 
131583 
131647 
131675 
131676 
131708 
131717 
131756 
131762 
131821 
131839 
131861 
132015 
132070 
132242 
132334 
132476 
132490 
132533 
132598 
132619 
132652 
132726 
133028 
133071 
133120 
133129 
133147 
133151 
133213 
133276 
133377 
133407 
133535 
133537 
133656 
133689 
133779 
133978 
133985 
134000 
134111 
134185 
134204 



AL039940 

AA361258 

BE222198 

AF182277 

AW167668 

AW2458 Q5 

X77777 

U09550 

AW978517 

AW016932 

F07783 

R27496 

AL036554 

AA305688 

ML000328 

AW972422 

AF056195 

NMJJ01928 

M999702 

AB028945 

NM 005883 

AL050295 

AL039940 

NMJJ0165O 

N64328 

AW169287 

AI091121 

AF147709 

AF033382 

AA171388 

AI824144 

NMJW3102 

AA131466 

AA251716 

X03350 

A1567943 

R52804 

AW085781 

AA992841 

F13038 

AB037789 

AL137432 

AK000383 

AA359615 

H15205 

A1126821 

S60415 

X94630 

AA443966 

AA744902 

AA017247 

AB014533 

AL096858. 

AI418006 

BE622641 

AA332697 

AW080704 

AL1 19844 

NM.001290 

AI922988 

X80031 

H28855 

N41739 

M52298 

R51604 

BE384932 

NM_003278 

AA428580 

AA026533 

NM.014051 

M903424 

AW978439 

AJ131245 

AF017987 

AL134030 

U41518 

8E149455 

NIVL001872 

T5B486 

AF035718 

L34657 

AW175787 

A1372588 

AA285136 

A(873257 



Hs.202949 

Hs.237868 

Hs.109843 

Hs.330780 

Hs.279772 

Hs.1 10903 

Hs.198726 

Hs.1154 

Hs.1 72847 

Hs.104105 

Hs.1369 

Hs.1378 

Hs.274463 

Hs.267695 

Hs.153614 

Hs.1 53863 

Hs.15430 

Hs.155597 

Hs.214507 

Hs.12696 

Hs.20912 

Hs.22039 

Hs.202949 

Hs.288650 

Hs.268744 

Hs.22588 

Hs.246218 

Hs.22824 

Hs.23735 

Hs.1 84482 

Hs.23912 

Hs.2420 

Hs.23767 

Hs.25227 

Hs.4 

Hs.25274 

Hs.25956 

Hs.26270 

Hs.27263 

Hs.27373 

Hs.263395 

Hs.28564 

Hs.323092 

Hs.30089 

Hs.30509 

Hs.30514 

Hs.30941 

Hs.3107 

Hs.31595 

Hs.107767 

Hs.164577 

Hs.33010 

Hs.184245 

Hs.3731 

Hs.38489 

Hs.42721 

Hs.45033 

Hs.49476 

Hs.4980 

Hs.172510 

Hs.530 

Hs.53447 

Hs.61260 

Hs.55608 

Hs.300842 

Hs.64313 

Hs.65424 

Hs.65551 

Hs.66 

Hs.94896 

Hs.6786 

Hs.69504 

Hs.7239 

Hs.7306 

Hs.284180 

Hs.74602 

Hs.75415 

Hs.75572 

Hs.222566 

Hs.78051 

Hs.78146 

Hs.334841 

Hs.8022 

Hs.301914 

Hs.7994 



WAA1 102 protein 

irtterteukin 7 receptor 2.2} 
ESTs 

Human cytochrome P450-JIB (h!iB3) mRNA; 
ESTs; Highly similar to CGI-38 protein ( 

daudin 5 (transmembrane protein deleted 2.93 

vasoactive intestinal pspMs receptor 1 1 60.80 

oviducta! glycoprotein 1 ; 1 20kD 1 0.00 

ESTs; Weakly similar to collagen alpha 1 

EST 9.00 

decay accelerating (actor for complement 87.80 

annexin A3 44.60 

defensin; alpha 1 ; myel old-related seque 2.72 

UDP-GabbetaGlcNAc beta 1 ;3-gaiactosyltr 42.20 

retinitis pigmentosa GTPase regulator Z54 

MAD (mothers against decapentaplegic; Dr 51 .60 

DKFZP586G1219 protein 

D component of complement (adipsin) 

ESTs 

ests aoo 

adenomatous polyposis cofi like 
WAA0758 protein 

KIAA1 102 protein 20.00 
aquaporin4 41.20 

ESTs; Moderately similar to KIAA0273 [H. 31.40 

ESTs 29.60 

ESTs; Weakly similar to zinc finger prot 9.00 

ESTs; Weakly similar to p160 myb-binding 

potassium vdtage-galed channel; subfami 

DKFZP586D0624 protein 

ESTs 

superoxide dismutase 3; extracefiular 2.98 

ESTs 3.15 

ESTs 32.20 

alcohol dehydrogenase 3 (class f); gamma 

ESTs; Moderately similar to putative sev 

DKF2P564D206 protein &00 

ESTs 10.00 

butyrate response factor 2 (EGF-response 28.80 

Homo sapiens mRNA; cDNA OKFZp56401763 (f 

ESTs; Highly similar to semaphorin Via [ 39.00 

ESTs 11.00 
ESTs; Weakly similar to dual specificity . 10.00 

ESTs 2.47 
ESTs 

ESTs 45.80 
calcium channel; voltage-dependent; beta 2.28 
CD97 antigen 
ESTs 

ESTs; Moderately similar to CaM-WI inhi 

ESTs 2.87 
KIAA0633 protein 

KIAA0929 protein Msx2 interacting nudea 54.00 
ESTs 
ESTs 

ESTs 2.68 
lacrimal proline rich protein 4.66 
Homo sapiens done TUA8 Cri-du-chat regi 34.20 
LIM binding domain 2 2.66 
ESTs 13.00 
collagen; type IV; alpha 3 (Goodpasture 
ESTs; Moderaterysimilar to kinesin Ggh 
ESTs 

ESTs; Weakty similar to cDNA EST yk484g1 11.43 

ESTs 2.37 

ESTs 2.27 

tetranectin (plasminogervbinding protein 2.83 

ESTs 

InterJetiJdn 1 receptor-like 1 - 6.20 

ESTs 

ESTs 31.40 
ESTs 9.00 
SEC24 (S. cerevisiae) related gene fama 41 .20 

secreted frizzted-related protein 1 50J20 
protocadherin 2 (cadherin-tike 2) 
aquaporin 1 (channel-forming integral pr t 

Accession not listed tn Genbank 2.65 

carboxy peptidase B2 (plasma) 90.80 

ESTs 

transcription factor 21 2.92 • 

platelet/endothelial cell adhesion molec 
selenium binding protein 1 

TU3A protein 4.49 
Homo sapiens mRNA; cONA DKF2p586K1220 (f 

ESTs; WeaWy simHar to CGI-69 proteb ( 40.80 
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3.30 



3.40 



4.05 
4.09 



3.16 
4.77 
3.54 



3.14 
3.80 



6.40 



4.11 
3.50 

3.86 
3.67 

3.44 



4.03 



3.06 



40.60 



49.20 
34.80 



30.60 



3.78 
3.67 



3.48 



4.02 
3.18 



5.49 



3.69 



3.72 



3.05 



3.35 



3.45 
4.05 
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134641 AI092634 Hs.156114 

134677 AA251363 Hs.177711 

134745 NM.00Q685 Hs.89472 

134749 T28499 Hs.89485 

134786 T29618 

134825 U33749 

134978 AI829008 

135010 N50465 

135053 AW795190 Hs.93678 

135081 AF069517 Hs.173993 

135091 AA493650 

135135 AA775910 

135203 C15737 

135236 AI636208 

135266 R41179 

135346 NA/L000928 Hs.992 

135378 AW961818 Hs.24379 

135387 NMJJ01972 Hs.99863 

135388 W27965 Hs.99865 
135402 L12398 Hs.99922 



Hs.89640 
Hs.197764 
Hs.333383 
Hs.92927 



Hs.94367 

Hs.95011 

Hs.269386 

Hs.96901 

Hs.97393 



protein tyrosine phosphatase; non-recept 
ESTs 

angiotensin receptor 1 B 

carbonic anhydrase IV 

angtopoietin 1 receptor; TEK tyrosine ki 

thyroid transcription factor 1 

ficolin (coDagen/Sbrinogen domain-con! 

ESTs 

ESTs 

RNA binding motif protein 6 
ESTs 

syntrophin; beta 1 (dystrophin-assocfele 

ESTs . 

ESTs 

Human mRNA for WAA03ffl gene; partial cd 
phosphoffpase A2; group (9 (pancreas) 
potassium voltage-gated channel; shaker- 
elastase 2; neutrophil 
EST 

dopamine receptor 04 



15.00 



3.05 
2.52 



32.20 
57.80 
31.60 



28.80 



43.00 



37.20 
38.80 



8.00 



182 
4.15 
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3.76 



3.73 



3.21 



4.31 



4.24 



6.42 



4.21 



TABLE 2B shows the accession numbers for those primekeys lacking unigenelffs for Table 2A. For each probeset we have listed the gene duster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tods (DoubleTwist Oakland California). The Genbank accession numbers for sequences comprising each duster are listed in the 
"Accession" column. 

Pkey. Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession; Genbank accession numbers 



Pkey 

108447 
108550 
108655 
102397 
126303 
125810 
103627 
121366 
114609 
115272 
108338 
108434 
123802 
102310 
102636 
104776 
120504 
113502 
108499 
101308 
108629 
103098 
103241 
103508 
103575 
119514 
121082 
128634 
105817 
121518 
114449 
114648 
121950 
107723 



CAT number Accessions 



43452_-7 

120073J 

12752^1 

4437L-1 

1525933 1 

1554054J 

2615.2 

280401J 

116777J 

172113J 

112186 1 

114012J 



M079126 

AA084857AA084996 
AA099960AA113013 
U41898 

D78841 D78880 
H00083 R81062 
Z48513Z48512 

A174351 5 AA405617 AW276705 
AA079505AA079537 
AW015947 AA21 1890 AA279425 
AA070773AA070774 
AA078899 AA078782 AA075788 
genbank_AA620448 AA620448 
NOTJOUND.entrez^U33839 
entrez_U67092 U67092 
genbanK_AA026349 
genbank_AA256837 
genbankJT89130T89130 
genbank_AA083103 
entrezj.41390 141390 
genbank J AA102425 
221 J15 
entrez_X76223 
entrez_Y10141 Y10141 
entrez.226256 Z26256 
NOT_FOUND_entrez_W37937 W37937 
genbank_AA398722 AA398722 
AA464918_at AA464918 
genbank_AA397825 AA397825 
genbanU\A412l55 AA412155 
genbanLAA020736 AA020736 
genbanKJ\A101056 M101056 
genbankj\A4295l5 AA429515 
genbankJ\A015967 AA015967 



U33839 

AA026349 
AA256837 

AA083103 

AA102425 

M86361 Z26593 X02850 D13070 AE000659 M17649 M87869 M87871 X61077 M16286 AFQ18169 X61079 S59351 X60142 AF043169 
X76223 
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Table 3A shows 452 genes up-regulated in chronically diseased fung relative to norma! lung. Chronically diseased tung samples represent chronic rurwrtaiignant lung diseases 
such as fibrosis, emphysema, and bronchitis. These genes were selected from 59860 probesets on the Eos/Affymetrix Ku03 Genechip array. Gene expression data for each 
probeset obtained from this analysis was expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UntgenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : 80th percentile of Al for chronically diseased lung samples divided by the SOlh percentile of Al for norma) lung samples. 

R2: 80th percentile of Al for chronically diseased lung samples divided by the 90lh percentile of normal lung samples, squamous cell carcinomas and 

adenocarcinomas 

R3: 70th percentile of Al for chronically diseased rung samples minus the 1 5th percentile of Al for all normal lungi chronically diseased lung and tumor samples 

divided by the 90th percentile of normal lung samples, squamous cell carcinomas and adenocarcinomas minus the 15lh percentile of Al for all normal lung, 
chronically diseased lung and tumor samples 



Pkey 


ExAccn 


Unigene! D 


Unigene Title 


R1 


R2 


R3 


135423 


U50531 


Hs.138751 


Human BRCA2 region, mRNA sequence CG030 


12.40 






135378 


AW961818 


Hs.24379 


MUM2 protein 






2.13 


135346 


NM_000928 


Hs.992 


phosphofipase A2, group IB (pancreas) 








135235 


AW298244 


Hs.293507 


ESTs 


12.40 






135057 


U9026B 


Hs.93810 


cerebral cavernous malformations 1 


11.67 






134951 


BE305081 


Hs.169358 


hypolhetica) protein 




aoo 




134799 


M36821 


Hs.89690 


GR03 oncogene 




8.20 




134786 


T29618 


Ks.89640 


TEK tyrosine kinase, endothelial (venous 








134772 


NM_000829 


Hs.163697 


glutamate receptor, bnotrophlc, AMPA 4 


29.80 






134752 


BE246762 


Hs.89499 


arachidonafe 5-l/poxygenase 






1.93 


134749 


T28499 


Hs.89485 


carbonic anhydrase IV 






2.07 


134696 


8E326276 


Hs.8861 


ESTs 








134636 


NMJJ05582 


Hs.87205 


lymphocyte antigen 64 (mouse) homolog, r 


13.60 






134627 


AI018768 


Hs.12482 


glyceronephosphate O-acyltransferase 






1.92 


134622 


AW975159 


Hs.293097 


ESTs, Weakly similar to A55380 faciogenl 






1.92 


134570 


U66615 


Hs.172280 


SWUSNF related, matrix associated, acti 


13.20 






134561 


U76421 


Hs.85302 


adenosine deaminase, RNA-spechlc, B1 (h 






1.78 


134468 


NMJJ01772 


Hs.83731 


CD33 antigen (gp67) 




6.20 




134417 


NM.006416 


Hs.82921 


solute carrier family 35 (CMP-siafic aci 








134343 


050683 


Hs.82Q28 


transforming growth factor, beta recepto 








134323 


BE170651 


Hs.8700 


deleted In liver cancer 1 








134300 


NMJKM43Q 


Hs.8136 


endothelial PAS domain protein 1 








134299 


AW580939 


Hs.97199 


complement component C1q receptor 








134253 


X52075 


Hs.80738 


sialophorin (gpL115, leukosialin, CD43J 


20.60 






134182 


D52059 


Hs.7972 


K1AA0871 protein 


12.20 






133985 


134657 


Hs.78146 


platelet/endotheCal cell adhesion molec 








133978 


AF035718 


Hs.78061 


transcription factor 21 








133835 


A1677897 


Hs.76640 


RGC32 protein 








133651 


AI301740 


Hs.1 73381 


dihydropyrimidinase-like 2 








133633 


D21262 


Hs.75337 


nucleolar and coiled-body phosph protein 


15.20 






133565 


AW955776 


Hs 313500 


ESTs Moderately similar to ALU7 HUMANA 








133548 


AW9463B4 

• Iff WtWW7 


Hs 178112 


DMA spament sinala conv orobd LNS-CAI/L 






1.77 


133488 


AA335295 


Hs.74120 


adipose specific 2 








133478 


X83703 


Hs.31432 


cardiac ankvrin reneat orotein 

WUiVllUV ail*\JF*iJI lv|iwut piuiwii 






2.08 


133337 


AF085983 


Hs.293676 


ESTs 




9.60 




133200 


AB037715 


Hs 183639 


hvnothetical orotein FU 10210 






1.77 


133153 


AF070592 


Hs 66170 


HSKM-B orotein 


30.60 






133130 


Al 128606 


Hs.6557 


zinc finger protein 161 


22.60 






133120 


NM 003278 


Hs 65424 


tetranectin fDlasminnaen-hindino orotein 








132928 


AW1 68082 


Hs. 169449 


protein kinase C, alpha 


13.80 






132836 


AB023177 


Hs.29900 


WAA0960 protein 








132799 


W73311 


Hs 169407 


fciinnrpsmr nf artin miitafinn^ 2 

vJAut ^OupjJI C-OvAJI Ul OWUII IIIUUlUUIlw 


41.60 






132742 


AA025480 


Hs 292812 


ESTs Weaidv similar to T33468 hvootheti 


40.40 






132548 


X12830 


Hs 193400 


intprfpitkin firpcpntnr 




7.20 




132476 


AL1 19844 


Hs.49476 


Homo saoiens clone TUA8 Cri-du-chat reoi 




4.76 




132439 


AK001942 


Hs.4863 


hvoothetical orotein DKFZo566A1524 






1.88 


132240 


AB018324 


Hs.42676 


KIAA0781 protein 


21.20 






132210 


NMJ07203 


Hs.42322 


A kinase (PRKA) anchor protein 2 






1.99 


132199 


AL041299 


Hs.165084 


ESTs 


15.20 






131751 


T96555 


Hs.31562 
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Hs.16355 


ESTs 


11.02 






111341 


AL157484 


Hs.22483 


Homo sapiens mRNA; cDNA DKFZp762M127 (fr 






1.86 


111280 


AA373527 


Hs.19385 


CGi-58 protein 


18.40 






111247 


AW058350 


Hs,16762 


Homo sapiens mRNA; cDNA DKFZp554B2062 (f 








111232 


A1247763 


Ks.16928 


ESTs 


27.60 






110942 


R63503 


Hs.28419 


ESTs 


14.80 






110924 


AW058453 


Hs.12940 


zinc-fingers and homeoboxes 1 


24.71 






110837 


H03109 


Hs.108920 


HT018 protein 






Z18 


110824 


AI767183 


Ks.26942 


ESTs 


12.20 






110776 


AB032417 


Hs. 19545 


frizzled (Drosophila) homolog 4 






1.75 


110576 


H60869 


Hs.37889 


ESTs 


13.00 






110369 


AK000768 


Hs.107872 


hypothetical protein FU20761 




5.60 




110099 


R44557 


Hs.23748 


ESTs 






2.31 


109984 


AI796320 


Hs.10299 


Homo sapiens cDNA FU13545 fis, clone PL 








109958 


M001266 


Hs.133521 


ESTs 


11.25 






109893 


AA884208 


Hs.30484 


ESTs 






2.68 
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109842 


AW818436 


Hs.23590 


solute earner family 16 (monocarboxyifc 


23.83 




109837 


K00656 


Hs.29792 


ESTs. Weakly siiralar to 138022 hypotheti 






109796 


AJ800515 


Hs.12024 


ESTs 


• 


17.20 


109888 


R41900 


Hs.22245 


ESTs 




9.60 


109648 


H17800 


Hs.7154 


ESTs 


22.80 




109613 


H47315 


Hs.27519 


ESTs 






1095S0 


AW021488 


Hs.26981 


ESTs 






10S523 


AW193342 ' 


Hs^4144 


ESTs 






109472 


AK001989 


Hs.91165 


hypothetical protein 




6.00 


109355 


AA524525 


Hs.48297 


DKFZP586C1620 protein 


15.00 




109260 


AW978515 


Hs.131915 


KIAA0863 protein 


25.60 




108781 


AA128654 




gbzn98g07.s1 Stratagene fetal retina 93 


14.20 




108653 


BE219231 


Hs^92653 


ESTs. Weakly similar to T26845 hypothec 


11.00 




108573 


AA086005 




gbzl84c04.s1 Stratagene colon (937204) 


26.00 




108480 


AL133092 


Hs.68055 


hypothetical protein DKFZp434l0428 






108382 


NMJW6770 


Hs.67726 


macrophage receptor with collagenous str 






108174 


AA055632 


Hs.303070 


ESTs 


15.20 




108138 


AL049990 


Hs.51515 


Homo sapiens mRNA; cONA DKFZp564G112 (fr 




3.60 


108087 


AA045708 


Hs.40545 


ESTs 


15.44 




108048 


AI797341 


Hs.165195 


Homo sapiens cDNA FU 14237 fis, clone NT 




11.40 


108041 


AW204712 


Hs.61957 


ESTs 






107997 


AL049176 


Hs.82223 


chordlrvlike 




4.76 


107994 


AA036811 


Hs.48469 


UM domains containing 1 






107922 


BE153855 


Hs.61460 


Ig superfamily receptor LNIR 


14.20 




107681 


BE379594 


Hs.49136 


ESTs, Moderately similar to ALU7_HUMAN A 


51.80 




107666 


AA010611 


Hs.60418 


EST 


29.20 




107332 


T87750 


Hs.183297 


DKFZP566F2124 protein 


10,73 




107292 


BE166479 


Hs.4789 


Homo sapiens serologically defined breas 


32.00 




107230 


AI034467 


Hs.34650 


ESTs 


17.40 




107168 


W57578 


Hs.237955 


RAB7, member RAS oncogene family 


10.43 




107160 


AA314490 


Hs.27669 


KIAA1563 protein 


11.40 




107054 


AI076459 


Hs.15978 


K1AA1 272 protein 






107029 


AF264750 


Hs.288971 


myeloid/lymphoid or mixed-lineage leukem 


21.40 




106999 


H93281 


Hs.10710 


hypothetical protein FU20417 


35.80 




106954 


AF128847 


Hs.204038 


indolethylamine N-methyttransferase 






106870 


A1983730 


Hs.26530 


serum deprivation response (phosphatidyl 






106855 


AW192535 


Hs.19479 


ESTs 


13.40 




106844 


AA485055 


Hs.158213 


sperm associated antigen 6 




7.13 


106820 


NM_016831 


Hs.12592 


period (Drosophila) homolog 3 




7.00 


106818 


AK002135 


Hs.3542 


hypothetical protein FU1 1 273 


13.00 




106797 


AI768801 


Hs.169943 


Homo sapiens cDNA FU13569 fis, clone PL 






106773 


AA478109 


Hs.188833 


ESTs 






106747 


NMJJ07118 


Hs.171957 


triple functional domain (PTPRF 'interact 


12.60 




106743 


6E613328 


Hs.21938 


hypothetical protein FU12492 


10.60 




106667 


AW360847 


Hs.16578 


ESTs 






106605 


AW772298 


Hs.21103 


Homo sapiens mRNA; cDNA DKFZp564B076 (fr 






106567 


AW450408 


Hs.86412 


chromosome 9 open reading frame 5 






106562 


AL031846 


Hs.152151 


ptakophilin 4 






106536 


AA32964B 


Hs.23804 


ESTs, Weakfy similar to PN0099 son3 prat 






106533 


AL134708 


Hs.145998 


ESTs 


23.20 




106507 


AA259068 


Hs.267819 


protein phosphatase 1, regulatory (inhib 


15.20 




106490 


M404265 


Hs.115537 


putative dipeptidase 






106474 


BE383668 


Hs.42484 


hypothetical protein FLJ10618 


10.44 




106211 


AA428240 


Hs.126083 


ESTs 




29.80 


105986 


AB037722 


Hs.8707 


KIAA1301 protein 




3.70 


105894 


A1904740 


Hs.25691 


receptor (calcitonin) activity modifying 






105847 


AW964490 


Hs.32241 


ESTs, WeaWy similar to S65657 alpha-1C- 






105803 


AW747996 


Hs.160999 


ESTs, Moderately similar to A56194 throm 






105731 


M834664 


Hs.29131 


nuclear receptor coactivator 2 


10.71 




105729 


K46612 


Hs.293815 


Homo sapiens KSPC285 mRNA, partial cds 






105688 


A1299139 


Hs.17517 


ESTs 


23.40 




105510 


Z42047 


Hs.283978 


Homo sapiens PR02751 mRNA, complete cds 


37.20 




105101 


H63202 


Hs.38163 


ESTs 




8.30 


104989 


R65998 


Hs.285243 


hypothetical protein FU22029 




8.09 


104986 


AW088826 


Hs.1 17176 


paly(A)-binding protein, nuclear 1 






104969 


A1670947 


Hs.78406 


phosphatidytinositoW-phosphate 5-kinas 




5.40 


104903 


AI436323 


Hs.31141 


Homo sapiens mRNA for KIAA1568 protein, 




7.60 


104896 


AW015318 


Hs.23165 


ESTs 


13.80 




104865 


779340 


Hs.22575 


Homo sapiens cDNA: FU21042 fis, done C 






104825 


AA035613 


Hs.141883 


ESTs 






104781 


AA099904 


Hs.21610 


DKFZP434B203 protein 






104776 


M026349 




gb:z]99f01.s1 Soaresj>regnanLuterus_NbH 




10.20 


104691 


U29590 


Hs.37744 


Homo sapiens beta-1 adrenergic receptor 




5.69 


104667 


AI239923 


Hs.30098 


ESTs 




3.82 


104404 


H58762 




gb;EST00057 HE6W Homo sapiens cDNA clone 




4.20 


104392 


AA076049 


Hs.274415 


Homo sapiens cDNA FL) 10229 fis, clone HE 


27.20 




104212 


AB002298 


Hs.173035 


KIAA0300 protein 






104074 


AL162039 


Hs.31422 


Homo sapiens mRNA; cDNA DKFZp434M229 (fir 


11.20 




103749 


AL135301 


Hs.8768 


hypothetical protein FLJ10849 


10.66 




103845 


AW246253 


Hs.7043 


succinate-CoA Rgase, GDP-fo/ming, alpha 


12.00 




103554 


AI878826 


Hs.323469 


caveolin 1, caveotae protein, 22kD 






103541 


AI815601 


Hs.79197 


CD83 antigen (activated B lymphocytes, i 






103496 


Y09267 


Hs.132821 


flavin containing monooxygenase 2 






103428 


BE3B3507 


Hs.78921 


A kinase (PRKA) anchor protein 1 


11.20 




103353 


X89399 


Hs. 119274 


RAS p21 protein activator (GTPase activa 


19.80 





191 



1.89 



1.83 



1.76 



2.05 



2.40 
1.78 
1.76 
219 



1.94 
1.75 
2.47 



1.92 



1.87 
1.93 



1.91 



1.80 
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103295 


X81479 


Hs.2375 


103280 


U84722 


Hs.76206 


103100 


NM_0Q5574 


Hs.184585 


103025 


NMJJ02837 


Hs.123641 


102898 


Ml 8667 


Hs.1867 


102659 


BE245169 


Hs.211610 


102580 


1)60808 


Hs.152981 


102417 


AA034127 


Hs.153487 


102363 


NMJ3Q3734 


Hs.198241 


102302 


AA306342 


Hs.69171 


102283 


AW161552 


Hs.83381 


102188 


U20350 


Hs.78913 


102151 


T27013 


Hs.3132 


101957 


128824 


Hs.74101 


101842 


M93221 


Hs.75182 


101771 


NM.002432 


Hs.153837 


101764 


A1198550 


Hs.81256 


101716 


AF050658 


Hs.2563 


101678 


M62505 


Hs.2161 


101447 


M21305 




101383 


NMJ300132 


Hs.79345 


101346 


AI738616 . 


Hs.77348 


101345 


NMJ)05795 


Hs.152175 


101336 


NM-00S732 


Hs.75678 


101330 


L43821 


Hs.80261 


101277 


BE297626 


Hs.296049 


101262 


L35854 




101168 


NMJ305308 


Hs.211569 


101102 


NMJ503243 


Hs.79059 


101088 


X70697 


Hs.553 


101066 


AW970254 


Hs.889 


100971 


BE379727 


Hs.83213 


100893 


BE245294 


Hs. 180789 


100770 


W25797.comp 


Hs.177486 


100716 


X89887 


Hs.172350 


100555 


M69181 




100425 


NMJM4747 


Hs.78748 


100408 


D86640 


Hs.56045 


100382 


D83407 


Hs.156007 


100351 


D64158 




100299 


D49493 


Hs.2171 


100134 


M305746 


Hs.49 


100108 


U09577 


Hs.76873 


100095 


Z97171 


Hs.78454 


100065 







egf-fike module containing, mucin-iike, 
cadherin 5, type 2, VE-cadherin {vascuJa 
UM domain only 2 (momboGn-like 1) 
protein tyrosine phosphatase, receptor t 
progastricsin (pepsinogen C) 
CUG triplet repeat RNA-bmding protein 
CDP-tfiacylglycerol synthase (phosphatide 
signal transducing adaptor molecule (SH3 
amine oxidase, copper containing 3 (vase 
protein kinase (Mike 2 
guanine nucleotide binding protein 11 
chemokine (CX3-C) receptor 1 
steroidogenic acute regulatory protein 
spleen tyrosine kinase 
mannose receptor, C type 1 
myeloid ceD nuclear differentiation ant 
S1 00 calcium-binding protein A4 (calcium 
tachykinin, precursor 1 (substance K, su 
complement component 5 receptor 1 (C5a I 
' gb:Kuman alpha satellite and satellite 3 
coagulation factor VIII, procoaguJant co 
hyoVoxy prostaglandin dehydrogenase 15-{N 
caldtonin receptor-like 
FBJ murine osteosarcoma viral oncogene h 
enhancer of filamentation 1 (cas-likedo 
microfibrillar-associated protein 4 
gb:Human dystrophin (dp140) mRNA, 5* end 
G protein-coupled receptor kinase 5 
transforming growth factor, beta recepto 
solute carrier family 6 (neurotransmitte 
CharoUeyden crystal protein 
fatty acid binding protein 4, adipocyte 
S164 protein 

amyloid beta (A4) precursor protein (pro 
HIR (histone cell cycle regulation defec 
gb:Human nonmuscle myosin heavy chain-B 
KIAAQ237 gene product 
sre homology three (SH3) and cysteine ri 
Down syndrome critical region gene t-fik 

growth differentiation factor 10 
macrophage scavenger receptor 1 
hyaluronoglucosaminidase 2 
myocilin, trabecular meshwork inducible 



3.60 
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1.76 
Z15 



11.00 
25.40 
14.00 

10.86 



16.40 
15.40 



18.80 
504.80 



7.40 



31.00 



19.00 



19.38 

15.40 
11.20 
14.80 
33.00 
16.20 



7.52 



1.78 
Z22 

1.75 
2.24 

2.01 

1.91 



11.29 



4.00 
4.24 
6.20 
21.20 



5.40 



1.79 



TABLE 3B shows the accession numbers for those primekeys lacking unigenelD's for Table 3A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene dusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were dustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland CaUfomia). The Genbank accession numbers for sequences comprising each duster are fisted in the 
•Accession' column. 



Pkey: . Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



AA602964AA609200 
AA325606AA099517 N89423 
H04043 D60988D60337 
AA248234AA090985 
AA399961 AA128347 
AA393283AA398628 

AA81 1 804 AA809404 AA286907 AW977624 
AA431082 

AA226198 AA226513 AA383773 
AA620448 
H50834 



123619 371681J 

126433 127143 1 

125831 1522905J 

126816 122973J 

126852 136135J 

121059 273450 1 

120637 200885.1 

122011 7617.-2 

120934 177521 1 

123802 genbank_M620448 

116814 genban)eH50834 

118329 genbank_N63520 

104404 H58762_at H58762 

104776 genbanLAA026349 

113502 genbank_T89130T89130 

101262 entrez L35854 L35854 

108573 genbank^AAOSeOOS 

101447 entrez_M21305 M21305 

124357 genbankJJ22401 

108781 genbankJ\A128654 

112794 genbank_R97018 

100351 entrezj)64158 D6415B 

100555 tigr_HT2245 M69181 M81105 U51039 



AA026349 



AA086005 

N22401 ' 

AA128654 

R97018 
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WO 02/086443 PCT/US02/12476 

Table 4A shows 202 genes up-regulaled in samples from patients treated with chemotherapy or radiotherapy. These genes were selected from 59680 pretests on the 
Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average.intensity (AT), a normalized value reflecting 
the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 
ExAccn: . Exemplar Accession number, Genbank accession number 
UnigenelO: Unigene number 
Unigene Title: Unigene gene title 

te average of Al for normal lung samples. 



R1: 


average of Al for samples from patients treated with chemotherapy or radiotherapy divide 


Pkey 


ExAccn 


UnigeneiD 


Unigene Title 


R1 


100113 


NMJ)01269 


Hs.84746 


chromosome condensation 1 


27.20 


100187 


D17793 


Hs.78183 


aldo-keto reductase family 1, member C3 


20.60 


100210 


D26361 


Hs.3104 


WAAQ042 gene product 


■ 20.40 


100225 


D28539 


Hs.167185 


glutamate receptor, metabotropic 5 


20.60 


100269 


NMJJ01949 


Hs.1189 


E2F transcription factor 3 


29.40 


100438 


AA013051 


Hs.91417 


topoisomerase (DNA) 1! binding protein 


23.50 


100877 


X80821 


Hs.27973 


KIAA0874 protein 


35.56 


100893 


BE245294 


Hs.180789 


S164 protein 


43.40 


101273 


Z11933 


Hs.182505 


POU domain, class 3, transcription facto 


21.80 


101447 


M21305 




gb:Human alpha satellite and satellite 3 


193.60 


101649 


AW959908 


Hs.1690 


heparin-binding growth factor binding pr 


38.40 


101724 


L11690 


Hs.620 


bullous pemphigoid antigen 1 (230/240kD) 


198.80 


101748 


NMJ01944 


Hs.1925 


desmogtein 3 (pemphigus vulgaris antigen 


78.60 


101809 


M85849 


Hs.323733 


gap junction protein, beta 2, 26kO (conn 


162.20 


101B79 


M176374 


Hs.243886 


nuclear autoantigenic sperm protein (his 


50.00 


101915 


AF207881 


Hs.155185 


cytosolic ovarian carcinoma antigen 1 


26.00 


101973 


U41514 


Hs.80120 


UDP-N-acetyl-a!pha-D-galactosamine:polyp 


37.20 


102025 


U04045 


Hs.78934 


mutS (£ col/) homotog 2 (colon cancer, 




102031 


U04896 


Hs.2156 


RAR-related orphan receptor A 


32.00 


102052 


NM.002202 


Hs.505 


ISL1 transcription factor, UM/homeodoma 


51.20 


102391 


AA296874 


Hs.77494 


deoxyguanosine kinase 


13.90 


102420 


U44060 


Hs.14427 


Homo sapiens cDNA: FU21800 fis» ctane H 


28.80 


102610 


U65011 


Hs.30743 


preferenQaily expressed antigen in mela 


110.60 


102829 


NM.0061B3 


Hs.80982 


neurotensin 


116.80 


103000 


NM.001975 


Hs.146580 


enolase 2, (gamma, neuronal) 


2.30 


103036 


M13509 


Hs.83169 


matrix metailoproteinase 1 (interstitial 


181.40 


103507 


AJ000512 


Hs.296323 


serunVglucocorticoid regulated kinase 


49.20 


103587 


BE270266 


Hs.82128 


574 oncofetal trophoblast glycoprotein 


86.60 


104660 


BE298665 


Hs.14846 


Homo sapiens mRNA; cDNA DKFZp564D016 (fr 


42.60 


104896 


AW015318 


Hs.23165 


ESTs 


29.40 


105038 


AW503733 


Hs.9414 


KIM1488 protein 


21.50 


105298 


BE387790 


Hs.26369 


hypothetical protein FU20287 


32.80 


105510 


Z42047 


Hs.283978 


Homo sapiens PR02751 mRNA, complete cds 


20.20 


105667 


AA767526 


Hs.22030 


paired box gene 5 (B-ceil lineage specif 


28.40 


106073 


AL157441 


Hs.17834 


downstream neighbor of SON 


25.40 


106205 


AW965058 


Hs.111583 


ESTs, Weakly similar to 138022 hypotheb* 


32.00 


106516 


AL137311 


Hs.234074 


Homo sapiens mRNA; cONA DKFZp761G02121 ( 


40.60 


106533 


AL134708 


Hs.145998 


ESTs 


59.80 


106575 


AW970502 


Hs.105421 


ESTs 


43.40 


106654 


AW075485 


Hs.286049 


phosphoserine aminotransferase 


50.80 


106851 


A1458623 




gb:tk04g09.x1 NCLCGAP_Lu24 Homo sapiens 


53.40 


106995 


AB023139 


Hs.37892 


KIAA0922 protein 


20.88 


107332 


T87750 


Hs.183297 


DKFZP566F2124 protein 


23.60 


107532 


AA443473 


Hs.173684 


Homo sapiens mRNA; cDNA DKFZp762G207 (fr 


57.20 


107922 


BE153855 


Hs.61460 


Ig superfamily receptor LNIR 


49.00 


108609 


BE409857 


Hs.69499 


hypothetical protein 


19.67 


108780 


AU076442 


Hs.117938 


collagen, type Xvll, alpha 1 


48.17 


109166 


AA219691 


Hs.73625 


RAB6 interacting, kinesin-like (rabkines 


59.20 


109260 


AW978515 


Hs.131915 


KIAA0863 protein 


28.60 


109280 


AK001355 


Hs.279610 


hypothetical protein FU10493 


22.80 


109292 


AW975746 


Hs.188662 


KIAA1702 protein 




109384 


AA219172 


Hs.86849 


ESTs 


21.00 


109415 


U80736 


Hs.110826 


trinucleotide repeat containing 9 


31.60 


109445 


AA232103 


Hs.189915 


ESTs 


24.20 


109502 


AW967069 


Hs.211556 


hypothetical protein MGC5487 


21.40 


109633 


AW003785 


Hs.170267 


ESTs 


20.40 


109786 


AI989482 


Hs.146286 


kinesln family member 13A 


19.60 


109958 


"M001266 


Hs.133521 


ESTs 


24.00 


110920 


N47224 


Hs.20521 


HMT1 (hnRNPmethyltransferase, S.cerevi 


28.40 


110924 


AW05B463 


Hs.12940 


zinc-fingers and homeoboxes 1 


36.00 


111084 


H44186 


Hs.15456 


POZ domain containing 1 


61.20 


111132 


AB037807 


Hs.83293 


hypothetical protein 


24.60 


111229 


AW389845 


Hs.110855 


ESTs 


27.20 


111337 


AA837396 


Hs.263925 


US1-interacting protein NUDE1, rat homo 


48.00 


111987 


NO15310 


Hs.6763 


KIAA0942 protein 


37.80 


112046 


AA383343 


Hs.22116 


CDC14 (cell division cycle 14, S. cerevi 


26.80 


112268 


W39609 


Hs.22003 


solute carrier family 6 (neurotransmitte 


63.80 


112685 


R87650 


Hs.33439 


ESTs, Weakly similar b ALU1_HUMAN ALU 


26.40 


112871 


AL110216 


Hs.12285 


ESTs, Weakly similar to 155214 salivary 


47.64 


112897 


AW206453 


Hs.3782 


ESTs 


22.00 


112973 


AB033023 


Ks.318127 


hypothetical protein FU 10201 


65.00 


112992 


AL157425 


Hs.133315 


Homo sapiens mRNA; cDNA DKFZp761J1324 (f 


42.00 


113073 


N39342 


Hs.103042 


microtubute-associated protein 18 


55.40 
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113434 


T91451 


Hs.86538 


ESTs 


22 80 


113560 


T91015 


Hs.268626 


ESTs 


22.80 


113849 


AA457211 


Hs.8858 


brorrtodomain adjxent to zinc finger doma 


51.80 


113950 


AI287652 


Ks.30504 


Homo sapiens mRNA; cDNA DKFZp434E082 (fr 


28.20 


114339 


AA782845 


Hs.22790 


ESTs 


20.20 


114365 


H42169 


Hs.18653 


hypothetical protein FU14627 


21.00 


114455 


H37908 


Hs.271616 


ESTs, Weakly similar to ALU8_HUMAN ALU S 


25.80 


114518 


AW163267 


Hs.106469 


suppressor of varl (Sxerevisiae) 3-like 


23.60 


114824 


AA950961 


Hs.305953 


zinc finger protein 83 (HPF1) 


27.20 


114837 


BE244930 


Hs.166895 


ESTs 


30.20 


114974 


AW956931 


Hs.179562 


nucleosome assembly protein 1-fike 1 


20.80 


115075 


AA814043 


Hs.88045 


ESTs 


30.60 


115084 


BE383668 


Hs.42484 


hypothetical protein FU10618 


28.86 


115291 


BE545072 


Hs.122579 


hypothetical protein FU10461 


3&00 


115313 


AA808001 


Hs.184411 


albumin 


22.60 


115697 


D31382 


Hs.63325 


transmembrane protease, serine 4 


173.60 


115909 


AW872527 


Hs.59761 


ESTs, Weakly similar to DAP1 HUMAN DEATH 


27.77 


116090 


A1591147 


Hs.61232 


ESTs 


20.80 


116107 


AL133916 


Hs.172572 


hypothetical protein FU20093 


164.20 


116399 


AA889120 


Hs.1 10637 


homeo box A10 


38.00 


117099 


K93699 




gb.7vl6all.sl Soares fetai liver spleen 


21.60 


117881 


AF161470 


Hs.260622 


buty rate-induced transcript 1 


49.40 


118091 


AW005054 


Hs.47883 


ESTs, Weakly similar to KCC1_HUMAN CALCI 


22.40 


118138 


AA374756 


Hs.93560 


Homo sapiens mRNA for KIAA1771 protein, 


22.00 


118720 


N73515 




gbrza49d07.sl Soares fetal liver spleen 


20.00 


118873 


AI824009 


Hs.44577 


ESTs 


19.40 


119126 


R45175 


Hs.1 171 83 


ESTs 


111.20 


119717 


AA918317 


Hs.57987 


B-cell CLLflymphoma 1 1B (zinc finger pro 


33.00 


119940 


AL050097 


Hs.272531 


DKFZP586B031 9 protein 


31.00 


120266 


AI807264 


Hs.205442 


ESTs, Weakly simitar to T34036 hypotheli 


20.20 


120515 


AA258356 




gb:zr59c10.s1 Soares_NhHMPu_S1 Homosapi 


25.00 


120859 


AA826434 


Hs.1619 


achaste-scute comnlsx fDrosonhilal homo) 


95.40 


120983 


AA398209 


Hs.97587 


EST 


105.20 


121054 


AW976570 


Hs.97387 


ESTs 


38.80 


121369 


AW450737 


Hs.128791 


CGJ-09 protein 


41 £0 


122335 


AA443258 


Hs.241551 


chloride channel, calcium activated, fam 


30.80 


122612 


AA974832 


Hs.1 28708 


ESTs 


19.60 


123130 


AA487200 




gb:ab19f02.s1 Stratagene lung (937210) H 


33.20 


123440 


A1733692 


Hs.1 12488 


ESTs 


23.17 


123596 


AA421130 


Hs.1 12640 


EST 


23.00 


123619 


AA602954 




gb:no97c02.s1 NCI CGAP_Pr2 Homo sapiens 


28.80 


124008 


AI147155 


Hs.270016 


ESTs, 


77.60 


124169 


BE079334 


Hs.271630 


ESTs 


22.20 


124281 


A!333756 


Hs.1 11801 


arsenate resistance protein ARS2 


42.20 


124472 


N52517 


Hs.102670 


EST 


32.60 


124617 


AW628168 


Hs.1 52684 


ESTs 


21.80 


124631 


NM_014053 


Hs.270594 


FLVCR protmn 


30.40 


124839 


R55784 


Hs.140942 


ESTs 


21.20 


125186 


AA610620 


Hs.1 8 1244 


major histocompatibility complex, class 


42.80 


125321 


T86652 


Hs.1 78294 


ESTs 


27.00 


125535 


NM_013243 


Hs.22215 


secretogranin III 


23.80 


125646 


AA628962 


Hs.75209 


protein kinase (cAMP-dependent, catalytj 


23.20 


125684 


AW589427 


Hs.1 58849 


Homo sapiens cDNA: FU21663 fis, clone C 


21.20 


125724 


AL360190 


Hs.295978 


Homo sapiens mRNA full length insert cDN 


48.80 


125847 


AW161885 


Hs.249034 1 


ESTs 


31.00 


125934 


AA193325 


Hs.32646 


hypothetical protein FU21901 


21.20 


126077 


M78772 


Hs.210836 


ESTs 


49,80 


126299 


AW979155 


Hs.298275 


amino add transporter 2 


21.80 


126395 


AI468004 


Hs.278956 


hypothetical protein FL) 12929 


71.00 


126433 


AA325606 




gb:EST28707 Cerebellum II Homo sapiens c 


23.20 


126509 


R47400 


Hs 23850 


ESTs 


23.80 


126538 


AB030S56 


Hs. 17377 


coronin acfin-hindtnn nrolein 1C 


23.10 


126666 


AA648866 


Hs.1 51 999 


ESTs 


38.00 


126812 


AB037860 


Hs 173933 


nuclear factor 1/A 


20.80 


126872 


AW450979 




gb:UI-H-BI3-aia-a-12-vVUI.s1 NCI__CGAP_Su 


46.29 


127046 


AA321948 


Hs 293968 


ESTs 


22.80 


127431 


AW771958 


Hs 175437 


ESTs, Moderately similar to PC4259 fern 


30.00 


127489 


AA650250 


Hs 272076 


ESTs 


20.80 


127521 


AW297206 


Hs.164018 


ESTs 


25.20 


127742 


AW293496 


Hs.1 801 38 


ESTs 


28.00 


127925 


AA805151 


Hs.3628 


mitogen-aciivated protein kinase kinase 


21.20 


127930 


AA809672 


Hs 123304 


ESTs 


20.54 


127968 


M830201 


Hs.1 24347 


ESTs 


28.20 


127987 


AI022103 


Hs.1 24511 


ESTs 


19.60 


128116 


H07103 


Hs.286014 


Homo sapiens, done IMAGE:3867243, mRNA 


20.40 


128609 


NM.003616 


Hs 102456 


survival of motor neuron protein interac 


34.40 


128777 


AI878918 


Hs. 10526 


cysteine and gtyctne-ncrt protein 2 


53.80 


128949 


AA009647 


Hs8850 


a disintegrin and metalloproteinase doma 


23.00 


129168 


AI132988 


Hs.1 09052 


chromosome 14 open reading frame 2 


37.60 


129404 


AI267700 


Hs 317584 


ESTs 


28.60 


129527 


AA769221 


Hs.270847 


delta-tubutin 


40.80 


129574 


AA026815 


Hs 11463 


UMP-CMP kinase 


31.20 


129598 


N30436 


Hs.11556 


Homo sapiens cDNA FU12566 lis, done NT 


29.60 


129785 


H19006 


Hs.184780 


ESTs 


72.20 


129970 


AV655806 


Hs.296198 


chromosome 12 open reading frame 4 


22.20 
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130149 


AW067805 


Hs.172665 


rnethylenetelrahydrofolate dehydrogenase 


29.60 


130199 


Z48579 


Hs.172028 


a distntegrin and metaiioprotetnase doma 


27.60 


130441 


U63530 


Hs.155637 


protein kinase, DNA-actvated, catalytic 


28.36 


130466 


W19744 


Hs.180059 


Homo sapiens cDNA FU20653 lis. clone KA 


20.20 


130482 


AW409701 


Hs.1578 


bacutovira) IAP repeat-containing 5 (sur 


22.40 


130617 


M90516 


Hs.1674 


gfutamine-fructose-6-phosphate trans arrtin 


19.60 


130703 


R77776 


Hs.18103 


ESTs 


19.40 


130732 


AW890487 


Hs.63984 


cadherin 13, H-cadherin (heart) 


21.40 


130887 


NMJW1072 


Hs.284239 


UDP glycosyt transferase 1 family, pctype 


110.00 


131028 


AI879165 


Hs.2227 


CCAAT/enhancer binding protein (C/EBP). 


25.20 


131086 


AL035461 


Hs.2281 


chromogranin B (secretogranin 1) 


40.60 


131284 


NM 001429 


Hs.25272 


E1 A binding protein p300 


24.60 


131775 


AB014548 


Hs.31921 


KIAA0S48 protein 


21.00 


131660 


BE383676 


Hs,334 


Rho guanine nucleotide exchange factor ( 


33.40 


131945 


NM_002916 


Hs.35120 


replication factor C (activator 1) 4 (37 


60.80 


132040 


NM 001196 


Hs.315689 


Homo sapiens cDNA: FU 22373 fts, done H 


20.40 


132084 


NMJ02267 


Hs.3885 


karyopherin alpha 3 (importin alpha 4) 


29.40 


132389 


AA310393 


Hs.190044 


ESTs 


32.40 


132437 


AA152106 


Hs.4859 


cyclinLania-6a 


27.40 


132550 


AW969253 


Hs.170195 


bone morphogenetic protein 7 (osteogenic 


75.60 


132617 


AF037335 


Hs.5338 


carbonic anhydrase XII 


31.36 


132632 


AU076916 


Hs.5398 


guanine monphosphate synthetase 


32.40 


132672 


W27721 


Hs.54697 


Cdc42 guanine exchange factor (GEF) 9 


23.40 


132742 


AA025480 


Hs.292812 


ESTs, Weakly similar to T33468 hypotheU 


61.20 


132771 


Y10275 


Hs.56407 


phosphoserine phosphatase 


22.33 


133070 


U92649 


Hs.64311 


a disintegrin and metalloproteinase doma 


23.50 


133153 


AF070592 


Hs.66170 


HSKM-B protein 


30.00 


133181 


X91662 


Us.6B7AA 


twist (Drosophila) homolog (acrocephalos 


23.80 


133282 


AA449015 


Hs.286145 


SRB7 (suppressor of RNA polymerase B, ye 


51.60 


133350 


AI499220 


Hs.71573 


hypothetical protein FU 10074 


33.00 


133592 


AV652066 


Hs.75113 


general transcription factor II!A 


82.00 


133658 


AA319146 


Hs.75426 


secretogranin II (chromogranin C) 




133865 


AB011155 


Hs.170290 


discs, large (Drosophila) homolog 5 


69.33 


134032 


NM-005025 


Hs.78589 


serine (or cysteine) proteinase inhibito 


33.20 


134125 


NM 014781 


Kte.50421 


KIAA0203 gene product 


31.60 


134158 


U15174 


Hs.79428 


BCL2/adenovirus E1B 19kD-tnterac6ng pro 


30.60 


134321 


BE538082 


Hs.8172 


ESTs, Moderately similar to A46010 X-fin 


23.40 


134367 


AA339449 


Hs.82285 


phosphoribosyiglycrn amide formyltransfer 


49.20 


134570 


U66615 


HsA722BQ 


SWI/SNF related, matrix associated, acii 


20.20 


134753 


NM_006482 


Hs.173135 


dual-specificity tyrosine-(Y)-phosphoryl 


20.80 


135002 


AA448542 


Hs.251677 


G antigen 7B 


37.60 


135029 


H58818 


Hs.187579 


hydroxysteroid (17-beta) dehydrogenase 


53.40 


135047 


AL134197 


Hs.93597 


cyclin-dependent kinase 5, regulatory su 


31.60 


135345 


X53655 


Hs.99171 


neurotrophin 3 


28.80 



TABLE 4B shows the accession numbers for those primekeys tacking unigenelD"s for Table 4A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubieTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 

Genbank accession numbers 



Pkey CAT number Accessions 

123619 371681 1 AA602964 AA609200 

126433 127143J AA325606 AA099517 N89423 

126872 142696 1 AW450979 AA1 36653 AA1 36656 AW41 9381 AA984358 AA492073BE1 68945 AA809054 AW238038 BE01 1212 BE01 1359 

BE011367 BE011368 BE011362 BE011215 BE011365 BE011363 

106851 322947J AI458623 AA639708 AA485409 R22065 AA485570 

118720 genbanfLN73515 N73515 

120515 genbank_AA258356 AA258356 

117099 321871J H93699 H97976 H80036 

101447 entrez_M21305 M21305 

123130 genbank_AA487200 AA487200 
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Table 5A shows 680 genes up-regulated in squamous cefl carcinoma or adenocarcinoma rung tumors relative to norma) lung and chronically diseased lung. These genes were 
selected from 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average 
intensity (A!), a normalized value reflecting the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccrt Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Titte: Unigene gene title 

R1 : 70th percentile of A) for squamous cell carcinoma and adenocarcinoma lung tumor samples divided by the 90th percentile of A) for normal and chronically 

diseased lung samples. 

R2: 80th percentile of Al adenocarcinoma lung tumor samples divided by the 90th percentile of Al for normal and chronically diseased rung samples. 

R3: 80th percentile of Al squamous cell carcinoma lung tumor samples divided by the 90th percentile of Al for normal and chronically diseased lung samples. 

R4: 80th percentile of A! adenocarcinoma lung tumor samples divided by the 80th percentile of Al for squamous cell carcinoma lung tumor samples. 

R5: ' 70th percentile of Al for squamous cell carcinoma and adenocarcinoma lung tumor samples minus the 1 5th percentile of Al for all normal lung, chronically 

diseased lung and tumor samples divided by 90th percentile of Al for norma) and chronically diseased lung samples minus the 15th percentile of A) for aO 

norma) lung, chronically diseased lung and tumor samples 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


R3 


R4 


R5 


100035 






AFFX control: GAPDH 










6.76 


100036 






AFFX control: GAPDH 










5.77 


100037 






AFFX control: GAPDH 










5.75 


100071 


A28102 




Human GABAa receptor alpha-3 subunit 




8.00 








100114 


X02308 


Hs.82962 


thymldylate synthetase 










5.71 


100154 


H60720 


Hs.81892 


WM01 01 gene product 


3.84 










100187 


D17793 


Hs.78183 


aldo-keto reductase family 1 , member C3 


3.33 










100188 


AW247090 


Hs.57101 


minlchromosome maintenance deficient (S. 










4.52 


100202 


BE294407 


Hs.99910 


phosphofructokinase, platelet 










5.49 


100216 


AA489908 


Hs.1390 


proteasome (prosome, macropain) subunit, 










5.67 


100269 


NMJJ01949 


Ks.1189 


E2F transcription factor 3 


2.55 










100287 


AU076657 


Hs.1600 


chaperomn containing TCP1, subunit 5 (e 










5.66 


100297 


AU077258 


Hs.182429 


protein disulfide isomerase-related prat 










3.81 


100330 


AW410976 


Hs.77152 


minichromosome maintenance deficient (S. 
platelet-activating factor aceiylhydrola 










4.50 


100335 


AW247529 


Hs.6793 


5.07 










100360 


W70171 


Hs.75939 


uridine monophosphate kinase 










4.82 


100372 


NMJM4791 


Hs.184339 


K1AA0175 gene product 










3.79 


100474 


NM~000699 


Hs.300280 


amylase, alpha 2A; pancreatic 








15.65 




100486 


T19006 


Hs.10842 


RAN, member RAS oncogene family 










5.49 


100491 


056165 


Hs.275163 


non-metastatic cells 2, protein (NM23B) 










4.17 


100516 


D90278 


Hs.11 


carcinoembryonic antigen-related cell ad 




7.20 








100522 


X51501 


Hs.99949 


prolactin-induced protein 








14.20 




100559 


NM.000094 


Hs.1640 


collagen, type VII, alpha 1 (epidermolys 


3.10 










100576 


X00356 


Hs.37058 


calcitonin/calcitonin-related polypeptid 








9.30 




100629 


AA015693 


Hs.21291 


mitogen- activated protein kinase kinase 








20.60 




100661 


BE623001 


Hs.1 32748 


Homo sapiens ribosomal protein L39 mRNA, 


3.85 










100677 


AA353686 


Hs 57813 


zinc ribbon domain containing, 1 




8.60 








100696 


D14887 


Hs.1 21 686 


general transcription factor HA, 1 (37k 








10.00 




100709 


N26539 


Hs. 100469 


myeloid/lymphoid or mixed-lineage leukem 






24.80 






100761 


BE208491 


Hs.295112 


KIAA0618 gene product 




7.60 








100830 


AC004770 


Hs.4756 


flap structure-specific endonuclease 1 










7.99 


100867 


U14622 




gbiHuman transketolase-like protein gene 




10.20 








100902 


M16029 


Hs.287270 


ret proto-oncogene (multiple endocrine n 




aoo 








100906 


AU076916 


Hs.5398 


guanine monphosphate synthetase 










5.16 


100960 


J00124 


Hs.1 17729 


keratin 14 (epidermolysis bullosa simple 


2.57 










101045 


J05614 




gb:Hurnan proliferating cell nuclear anil 










4.69 


101061 


NM_000175 


Hs.180532 


glucose phosphate isomerase 










4.19 


101071 


L02840 


Hs.84244 


potassium voltage-gated channel, Shab-re 




12.91 








101124 


L10343 


Hs, 11 2341 


protease inhibitor 3, skin-derived (SKAL 


3.12 










101175 


U82671 


Hs.36980 


melanoma antigen, family A, 2 


3.50 










101181 


BE262621 


Hs.73798 


macrophage migration inhibitory factor ( 










5.69 


101204 


L24203 


Hs.82237 


ataxia-telangiectasia group D-associated 


4.08 










101210 


L29301 


Hs.2353 


opioid receptor, mu 1 






6.40 






101216 


AA284166 


Hs.84113 


cyclin-dependent kinase inhibitor 3 (CDK 


2.53 










101228 


AA333387 


Hs.82916 


chaperonin containing TCP1, subunit 6A ( 










7.90 


101233 


AL135173 


Hs.878 


sorbitol dehydrogenase 










4.45 


101273 


Z11933 


Hs.182505 


POU domain, class 3, transcription facto 


8.50 










101342 


U52112 


Hs.182018 


interieukin-1 receptor-associated kinase 










4.17 


101346 


AI738616 


Hs.77348 


hydroxyprostagtandin dehydrogenase 15-{N 








21.89 




101369 


NM_000892 


Hs.1901 


kallikrein B, plasma (Fletcher factor) 1 








12.80 




101396 


BE267931 


Hs.78996 


proliferating cell nuclear antigen 


3.24 










101431 


BE185289 


Hs.1076 


small profine-rich protein 1B (comifin) 










7.90 


101448 


NW1000424 


Hs.195850 


keratin 5 (epidermolysis bullosa simplex 


8.31 










101462 


AL035668 


Hs.73853 


bone morphogenetic protein 2 








38.80 




101466 


BE262660 


Hs.170197 


glutamic-oxaloacetic transaminase 2, mit 










4.01 


101484 


AA053486 


Hs.20315 


interferon-induced protein with tetratri 








12.00 




101502 


M2695B 




gb:Human parathyroid hormone-related pro • 
asparagine synthetase 


10.50 










101505 


AA307680 


Hs.75692 










4.46 


101526 


NM.002197 


Hs.154721 


aconitase 1. soluble 


4.02 










101535 


X57152 


Hs.99853 


fibril larin 










4.65 


101577 


M34353 


Hs.1041 ' 


v-ros avian UR2 sarcoma virus oncogene h 








9.09 




101649 


AW959908 


Hs.1690 


heparin-binding growth factor binding pr 


54.00 










101663 


NfVL003528 


Hs.2178 


H2B histone family, member Q 


5.59 










101664 


AA436989 


Hs.121017 


H2A histone family, member A 


7.00 










101669 


124498 


Hs.80409 


growth arrest and DNA-damage-inducibte, 




7.60 









I 



111 



WO 02/086443 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



101695 


M69136 


Hs.135626 


101724 


L11690 


Hs.620 


101748 


NM.001944 


Hs.1925 


101759 


M80244 


Hs.184601 


101771 


NMJH)2432 


Hs.153837 


101804 


MS6699 


Hs.169840 


101809 


M86849 


Hs.323733 


101833 


AU076442 


Hs.1 17938 


101842 


M93221 


Hs.75182 


101851 


BE260964 


Hs.82045 


102002 


NM_002484 


Hs.81469 


102039 


AL134223 


Hs.306098 


102072 


U09410 


Hs.78743 


102083 


T35901 


Hs.75117 


102111 


L36196 


Hs.81884 


102123 


NM_001809 


Hs.1594 


102154 


U17760 


Hs.75517 


102193 


AL036335 


Hs.313 


102217 


AA829978 


Hs.301613 


102224 


NML002810 


Hs.148495 


102234 


AW163390 


Hs.278554 


102251 


NMJ04398 


Hs.41706 


102305 


AL043202 


Hs.90073 


102330 


BE298063 


Hs.77254 


102340 


U37055 


Hs.278657 


102348 


U37519 


Hs.87539 


102368 


U39817 


Hs.36820 


102394 


NMJJ0381& 


Hs.2442 


102404 


MO05429 


Hs.79141 


102537 


U57094 


Hs.50477 


102581 


AU077228 


Hs.77256 


102605 


AI435128 


Hs.181369 


102610 


U65011 


Hs.30743 


102623 


AW249285 


Hs.37110 


102642 


AA205847 


Hs.23016 


102654 


AV649989 


Hs.24385 


102659 


BE245169 


Hs.211610 


102669 


U71207 


Hs.29279 


102672 


U72066 


Hs.29287 


102687 


NM_007019 


Hs.93002 


102696 


BE540274 


Hs.239 


102768 


U82321 




102781 


BE258778 


Hs.108809 


102784 


U85658 


Hs.61798 


102824 


U90916 


Hs.82845 


102829 


NNL006183 


Hs.80962 


102888 


A1346201 


Hs.76118 


102892 


BE440042 


Hs.83326 


102913 


NMJ)02275 


Hs.80342 


102935 


BE561850 


Hs.80506 


102951 


X15218 


Hs.2969 


102983 


BE387202 


Hs. 118638 


103023 


AW500470 * 


Hs.1 17950 


103036 


M13509 


Hs.83169 


103038 


AA926960 


Hs.334883 


103060 
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8.12 






128920 


AA622037 


Hs.166468 


programmed cell death 5 










128924 


BE279383 


Hs.26557 


piakophilin 3 










128971 


H05132 


Hs.107510 


ESTs 




1260 






129008 


AL079648 


Hs.301088 


ESTs 




8.80 






129041 


BE382756 


Hs.169902 


solute carrier family 2 (facilitated giu 










129075 


BE250162 


Hs.83765 


dihydrofolate reductase 


259 








129105 


AI769160 


Hs.108681 


Homo sapiens brain tumor associated prot 






6.67 




129189 


AB023179 


Hs.9059 


KIAA0962 protein 




8.00 






129229 


AF013758 


Hs.109543 


poiyadenylate binding protetn-interactin 


4.00 








129241 


A1878857 


Hs.109706 


hematological and neurological expressed 










129300 


W94197 


Hs.110165 


ribosornal protein 126 homolog 


255 








129404 


AI2677G0 


Hs.317584 


ESTs 


18.00 








129457 


X61959 


Hs.207776 


aspartyiglucosaminidase 


6.50 








129466 


L42583 


Hs.334309 


keratin 6A 


1294 








129494 


A! 148976 


Hs 112062 


ESTs 








11.00 


129605 


AF061812 


Hs.1 15947 


keratin 16 (focal non-epidermolytic palm 










129841 


AI911527 


Hs.11805 


ESTs 








1200 


129665 


AW1 63331 


Hs. 11 8778 


KDEL (Lys-Asp-Glu-Leu) endoplasmic retic 










129703 


BE388665 


Hs. 179999 


Homo sapiens, done IMAGE:3457003, mRNA 










129720 


AA156214 


Hs.12152 


APMCF1 protein 










129748 


M16707 


Hs.123053 


H4 histone, family 2 


3.50 








129890 


AI868872 


Hs.282804 


hypothetical protein FU22704 










129896 


BE295568 


Hs.13225 


UDP-Gal:betaG!cNAc beta 1 ,4- galactosyit 


256 








129945 


BE514376 


Hs.1 65998 


PAI-1 mRNA-binding protein 










130010 


AA301116 


Hs.142838 


nucleolar phosphoprotein Nopp34 






7.00 




130026 


T40480 


Hs.332112 


EST 




6.40 






130080 


X14850 


Hs.147097 


H2A histone family, member X 










130149 


AW067805 


Hs.1 72665 


methyienetetrahydrofolate dehydrogenase 


274 








130285 


M063546 


Hs.75981 


ubtQuitin specific protease 14 (tRNA-gua 






7.40 




130441 


U63630 


Hs.1 55637 


protein kinase, DNA-activated, catalytic 










130482 


AW409701 


Hs.1 578 


baculovira) IAP repeal-containing 5 (sur 


4.87 








130500 


AB007913 


Hs.158291 


K1AA0444 protein 








9.60 


130524 


U89995 


Hs.1 59234 


forkhead box E1 (thyroid transcription f 






13.40 




130541 


X05608 


Hs.211584 


neurofilament, Sight polypeptide (68k0) 






8.20 




130553 


AF062649 


Hs.252587 


pituitary tumor-transforming 1 










130567 


AA383092 


Hs.1608 


replication protein A3 (14kD) 






7.00 




130577 


M69241 


Hs.162 


insulin-like growth factor binding prote 


3.04 








130627 


BE003054 


Hs.1695 


matrix metalloproteinase 12 (macrophage 


3.87 








130648 


AI458165 


Hs.17296 


hypothetical protein MGC2376 








16.20 


130697 


L29472 


Hs.1 802 


major histocompatibility complex, dass 








17.80 


130744 


H59696 


Hs.1 8747 


POP7 (processing of precursor, S. cerevi 










130800 


AI187292 


Hs.19574 


hypothetical protein MGC5469 










130867 


NMJJ01072 


Hs.284239 


UDP glycosyllransferase 1 family, polype 


16.84 








130869 


J03626 


Hs.2057 


uridine monophosphate synthetase (orotat 










130925 


AF093419 


Hs.169378 


multiple PDZ domain protein 








9.60 


130994 


W17044 


Hs. 327337 


ESTs 




12.40 






131028 


AI879165 


Hs,2227 


CCAAT/enhancer binding protein (C/EBP), 


10.21 








131031 


NM_001650 


Hs.288650 


aquaporin 4 








9.80 


131041 


T15767 


Hs.22452 


Homo sapiens mRNA for KIAA1737 protein, 








9.60 


131058 


W28545 


Hs.101514 


hypothetical protein FU1 0342 








17.00 


131090 


AI143139 


Hs.2288 


visin in-like 1 


274 








131112 


H15302 


Hs.1 68950 


Homo sapiens mRNA; cDNA DKFZp566A1046 (f 






8.80 




131148 


AW953575 


Hs.303125 


p53-induced prolein P1GPC1 


3.12 








131185 


BE280074 


Hs.23980 


cyclin B1 


3.07 








131200 


BE540516 


Hs.293732 


hypothetical protein MGC3195 


3.07 








131219 


W25005 


Hs.24395 


small inducible cytokine subfamily B (Cy 


287 








131257 


AW339037 


Hs,24908 


ESTs 








14.67 


131375 


AW293165 


Hs.143134 


ESTs 






19.20 




131460 


NR.003729 


Hs.27076 


RNA 3*4enTdnal phosphate cyclase 


3.50 








131476 


AI521663 


Hs.334644 


hypothetical protein FU 14668 


15.00 








131510 


BE245374 


Hs.27842 


hypothetical protein FU11210 






7.80 




131646 


BE302464 


Hs.30057 


MRS2 (S. cerevisiae)-like, magnesium horn 






7.00 




131786 


BE000971 


Hs.306083 


Novel human gene mapping to chomosome 22 


265 








131839 


AB014533 


Hs.33010 


WAA0633 protein 








35.20 


131843 


AA192315 


Hs.184062 


putative RabSinteracting protein 











4.30 
4.56 

4.48 

4.62 
4.04 

6.05 



4.06 



4.46 

4.70 
4.02 
5.71 

4.21 

4.03 

4.65 

3.91 



6.06 



5.28 
4.43 

4.92 



4.11 
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131877 


J04088 


Hs.156346 


topoisomerase (DNA) (1 alpha (170kD) 


19.00 










131885 


BE502341 


Hs.3402 


ESTs 


6.48 










131921 


AA456093 


Hs.34720 


ESTs 




8.'40 






131945 


NM_002916 


Hs.35120 


replication factor C (activator 1) 4 (37 


56.00 








131958 


NNL014062 


Hs.3566 


ART-4 protein 










3.82 


131965 


W79283 


Hs.35962 


ESTs 


3.03 










132000 


AW247017 


Hs.36978 


melanoma antigen, family A, 3 




9.80 








132040 


NMJJ01196 


Hs.315689 


Homo sapiens cDNA; FU22373 fis, clone H 


3.30 










132109 


AW190902 


Hs.40098 


cysteine knot superfamily 1, BMP an tag on 


21.00 










132114 


NMJXI6152 


Hs.40202 


rymphoid-restricted membrane protein 




8.40 








132162 


M315805 


Hs.94560 


desmoglein 2 










12.25 


132164 


AI752235 


Hs.41270 


procollagen-lysine, 2-oxogiutarate 5-dio 


170 










132180 


NMJXM460 


Hs.418 


fibroblast activation protein, alpha 


Z71 










132181 


AW981231 


Hs.16773 


Homo sapiens clone TCCCIA00427 mRNA sequ 


183 










132182 


NM.014210 


Hs.70499 


ecotropic viral integration site 2A 








1350 




132231 


AA662910 


Hs.42635 


hypothetical protein DKFZp434K2435 


9.50 










132277 


AK001745 


Hs.184628 


hypothetical protein FU 1 0883 


4.50 










132328 


NM.014787 


Hs.44896 


DnaJ (Hsp40) homolog, subfamily B, membe 








9.20 




132394 


AK001680 


Hs.30488 


OKFZP434F091 protein 








19.60 




132424 


AA417878 


Hs.48401 


ESTs, Moderately similar to ALU8.HUMAN A 






8.60 






132528 


T78736 


Hs.50758 


SMC4 (structural maintenance of chromoso 






27.40 






132543 


BE568452 


Hs.5101 


protein regulator of cytokinesis 1 


4.38 










132544 


L19778 


Hs.51011 


H2A histone famify, member P 




7.00 








132550 


AW969253 


Hs.170195 


bone morphogenetic protein 7 (osteogenic 


164 










132552 


BE621985 


Hs.296922 


thiopurine S-meihyi transferase 






* 


15.83 




132581 


AK000631 


Hs.52256 


hypothetical protein FU20624 






6.60 






132617 


AF037335 


H5.5338 


carbonic anhydrase XII 


4.95 










132638 


AI796870 


Hs.54277 


DNA segment on chromosome X (unique) 992 




8.20 








132653 


Z15008 


Hs.54451 


laminin, gamma 2 (nicein (100kD), kalini 


4.38 










132669 


W38586 


Hs.293981 


guanine nucleotide binding protein (G pr 










4.36 


132710 


W74001 


Hs.55279 


serine (or cysteine) proteinase inhibito 


4.60 










132771 


Y10275 


Hs.56407 


phosphoserine phosphatase 


3.71 










132799 


W73311 


Hs.169407 


SAC2 (suppressor of actin mutations 2, 








9.48 




132833 


U78525 


Hs.57783 


eukaryotic translation initiation factor 
tensin 










5.83 


132892 


AW834050 


Hs.9973 








12.00 




132906 


BE613337 


Hs.234896 


geminin 


3.09 










132959 


AW014195 


Hs.61472 


ESTs, Weakly similar to YAE6JYEAST HYPOT 










3.87 


132962 


AA576635 


Hs.6153 


CGI-48 protein 


3.50 










132990 


X77343 


Hs.334334 


transcription factor AP-2 alpha (activat 


6.18 










132994 


AA1 12748 


Hs.279905 


clone HQ0310PRO0310p1 


3.19 










133000 


AL042444 


Hs.62402 


p21/Cdc42/Rac1 -activated kinase 1 (yeast 


2.96 










133050 


X73424 


Hs.63788 


propionyl Coenzyme A carboxylase, beta p 


2.55 










133083 


BE244588 


Hs.6456 


chaperonin containing TCP1, subunit 2 (b 










4.00 


133086 


L17131 


Ks.139800 


high-mobility group (nonhistone chromoso 










8.96 


133134 


AF198620 


Hs.65648 


RNA binding motif protein 8A 










4.28 


133155 


M58583 


Hs.662 


cerebellin 1 precursor 








10.60 




133181 


X91662 


Hs.66744 


twist (DrosophOa) homolog (acrocephatos 


3.00 










133204 


BE267696 


Hs.254105 


enolase 1, (alpha) 










4.63 


133412 


U41493 


Hs.73112 


guanine nucleotide binding protein (G pr 




12.50 








133421 


AF134160 


Hs.7327 


ctaudin 1 


185 










133451 


AW970026 


Hs.73818 


ubiqutnol-cytochrome c reductase hinge p 




• 






4.66 


133453 


AI659306 


Hs.73826 


protein tyrosine phosphatase, non-recept 




6.80 








133504 


NM_004415 


Hs.74316 


desmoplakin (DPI, DP(f) 


6.14 










133506 


BE562958 


Hs.74346 


hypothetical protein MGC14353 










4.55 


133615 


M62843 


Hs.75236 


ELAV [embryonic lethal, abnormal vision, 








17.80 




133627 


NM.002047 


Hs.75280 


grycyHRNA synthetase 










4.85 


133649 


U25849 


Hs.75393 


acid phosphatase 1, soluble 










6.34 


133669 


NRL006925 


Hs.166975 


splicing factor, arginlne/sertne-rich 5 








14.00 




133749 


L20852 


Hs.10018 


solute carrier family 20 (phosphate tran 






6.11 






133776 


BE268649 


Hs.1 77766 


ADP-ribosyltransferase (NAD*-; pory (ADP- 










4.91 


133865 


AB011155 


Hs.1 70290 


discs, large (Drosophila) homolog 5 


3.07 










133946 


AJ001258 


Hs.173878 


NfPSNAP, C. elegans, homolog 1 










4.60 


133973 


N55540 


Hs.78026 


ESTs, Weakly similar to similar to ankyr 








13.00 




134047 


BE262529 


Hs.78771 


phosphoglycerate kinase 1 










3.85 


134098 


BE513171 


Hs.79086 


mitochondrial ribosomal protein L3 


Z56 










134107 


NM-005629 


Hs. 187958 


solute carrier family 6 (neurotransmitte 






8.20 






134112 


AW449809 


Hs.79150 


chaperonin containing TCP1, subunit 4 (d 










4.08 


134158 


U15174 


Hs.79428 


BCL2/adenovirus E1B 19kD4nteracting pro 


31.00 










134160 


T98152 


Hs.79432 


fibrillin 2 (congenital contractural ara 






24.60 






134168 


AA398908 


Hs.181634 


Homo sapiens cDNA: FLJ23602 fis, done L 










6.71 


134185 


AA285136 


Hs.301914 


neuronal specific transcription factor D 








14.74 




134201 


L35035 


Hs.79886 


ribose 5-phosphate isomerase A (ribose 5 




8.40 








134272 


X76040 


Hs.278614 


protease, serine, 15 


4.50 










134276 


BE083936 


Hs.80976 


antigen identified by monoclonal anObod 




9.00 








134353 


AL138201 


Hs.82120 


nuclear receptor subfamily 4, group A, m 








16.40 




134367 


AA339449 


Hs.82285 


phosphoribosylglycinamide formyttransfer 


280 










134380 


AU077143 


Hs.179565 


minichrornosome maintenance deficient (S. 


4.68 










134423 


H53497 


Hs.83006 


CGM39 protein 










3.84 


134469 


AA279661 


Hs.83753 


small nuclear ribonucteoprotein polypept 










5.81 


134470 


X54942 


Hs.83758 


CDC28 protein kinase 2 










4.21 


134498 


AW246273 


Hs.84131 


threonyt-tRNA synthetase 










7.30 


134502 


BE148534 


Hs.84168 


UV-B repressed sequence, HUR 7 




13.60 








134510 


NMJ02757 


Hs.250870 


mitogen-actfvated protein kinase kinase 








9.70 




134548 


N95406 


Hs.333495 


Deleted in spiit-hand/spllt-foot 1 regio 










4.63 


134654 


AK001741 


Hs.8739 


hypothetical protein FLJ10879 


6.00 
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134724 


AF045239 


Hs.321576 


134743 


AA044163 


Hs.89453 


134781 


AA374372 


Hs.89626 


134806 


AD001528 . 


Hs.89718 


134853 


BE268326 


Hs,90280 


134859 


D26488 


Hs.90315 


134891 


R51083 


Hs.90787 


134960 


BE246400 


Hs.285176 


134993 


BE409809 


Hs.301005 


135047 


AL134197 


Hs.93597 


135080 


A1761180 


Hs.94211 


135103 


NMJJ03428 


Hs.9450 


135145 


AW014729 


Hs.95262 


135184 


U13222 


Hs.96028 


'135242 


A1583187 


Hs.9700 


135286 


AW023482 


Hs.97849 


135289 


AW372569 


Hs.9788 


135355 


AK001652 


Hs.99423 


135371 




Hs.997 


135393 


L11244 


Hs.99885 



ring finger protein 22 
potassium large conductance catounvacti 
parathyroid hormone-like hormone 
spermine synthase 

5-amir«intWa2de-4<arboxarnide ribcnucte 

KIAA0007 protein 

ESTs 

acetyl-Coenzyme A transporter 
purine-rich element binding protein B 
cydtn-dependent kinase 5, regulatory su 
rcdl (required for cell differentiation, 
zinc finger protein 84 (HPF2) 
nuclear factor related to kappa B bindin 
forkhead box 01 
cyclinEI 
ESTs 

hypothetical protein MGC10924 similar to 
ATP-dependent RNA hetlcase 
protease, serine, 22 

complement component 4-binding protein, 



1100 



4.00 



4.00 

9.50 
5.00 



13.50 
6.46 

10.00 

aoo 



25.20 



6.20 
7.40 



11.00 



8.80 



7.00 



PCT/US02/12476 



4.58 
4.79 



4.48 



4.01 



14.60 



TABLE 5B shows the accession numbers for those prirnekeys lacking unigenetD's for Table 5A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
■Accession" column. 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 

117079 
124305 
101502 
109792 
126034 
102768 
126345 
127066 
127099 
119243 
125875 
112054 



CAT number 



126992 
122318 
114699 
114793 
108305 
108393 
100867 
123731 
109700 
1207W 
113702 
115113 
101045 
108554 
108573 
119052 
126522 
126605 



1621717J 
242183J 
18202_-6 
754958.1 
15981 57 J 
44641J 
1653833 1 
1703458J 
244301 1 
1774795J 
1566433 1 
1538292J 
171411 1 
880655 1 
292419 1 
135322J 
150742.1 
111550.1 
113411J 
tigr_HT4586 
genbankJ^A609839 
genbank.F09609 F09609 
genbank_AA292700 
genbankJ97307 T97307 
genbankJ\A256460 
entrez_J05614 J05614 
genbank _jAA084948 
genbanleAA086005 
149538 1 
416020 1 
439280J 



H92325T97125 

AW963221 AA344870 AA344871 H93331 
M26958 
R49625F10674 
H60340 N91637 
U82321 H66077 
N49713N49819W03810 
R25066 R20144 R20145 Z43845 
AA347668 AW956810 Z44271 F07065 F07064 R13506 
T12603T12604 
H14480N98295 
R43590F10439 
AA210954AA211007 
AI809521 H12174Z42556 
AA429743AA442754 
AA127386R15644AA127404 
AA158245AA158235 
AA071391 AA069892 AA069891 
AA075211 AA075245 AA075126 AA074946 
U14622 

AA609839 



AA292700 
AA256460 



AA084948 
AA086005 
R10889R10888 
W31912AI167491 

AA676910 AA778853 AA778865 W86800 

103768 46922.1 W42667 AI580740 AI690440 AI561350 AW467906 AW151450 AI825927 AL041 716 AI885600 AI74221 3 AW248624 AI955498 AA033947 

AA845593 A162371 1 N68583 C00064 AA193567 AW083858 AW163216 AA191595 AA522778 AI628008 AI915518 AA843508 AI926195 
AA176265 AW167963 AA9921 15 W93647 AW103572 AI862994 AI342059 AA91 1719 AA1 761 55 AA024712 AA069988 AA205591 AI591 107 
A)199673AI811766AJ275B32AI422233AI191852A)096682A!580124^^ 

H44848 H20477 T91 695 W47039 AA070055 AA024795 AA328855 AA379248 AA379330 AA385580 W25920 W03688 AA448359 AA093881 
AW362477 AA089997 AI350265 W93479 N99688 M932257 AW351469 H68590 AA663402 AA069771 AW087986 A1858420 AA600214 
AI970774 AI857712AI6B3081 AI885584 AW1 31 1 50 AI567981 AW002714AW189973 AW075495AW1 68303 AA953714 AW516881 AI357375 
AI566663 AW512676 AI570580 AI023690 AA448216 A1079853 AI422707 AA779516 AW026972 AW130082 AW162307 AW438646 AA709332 
AW192394 AI167350 AI217879 AI1 29152 AA719509 AI350480 AA663418 AI003634 AW1 18546 AA180261 AA442833 AI268625 AA888881 
AI038759 AA846723 AI248770 AA993694 AI280335 AI885107 AW51 8649 AA641 563 AA995835 AA582521 AI276744 AA436478 AI017360 
AI620763 AI859887 hf73926 AI076327 AI741615 AI160617 AW172819 AI492005 AA677429 M996334 AJ693771 A1950039 AI245629 AI28B515 
AI866186 T93293 AA1 73262 AA599779 AI680092 AW439316 AI084555 AI272672 AI583507 AW473219 AA738132 AW473283 AI367492 
AA9954 10 AI689624 AA206353 AI033095 AI040382 AAB73S30 A122W4 AI934840 AJ418680 AA844306 R94503 AA773520 AA8431 69 
AA219425 M629658 AI81 1719 AW41 1275 AI590981 W37907 AI591 178 AI684051 AA983238 AA669347 AA976239 AA704570 AI628339 
AI884391 AI241580 AI003539 AW1 76687 AA009650 N34566 AI333493 All 86070 AA070827 AA411683 AI280884 AA872023 AA207255 
AA021576 N71953 AI885888 AW076039 T15777 AI537673 AW248048 H09554 W93480 W47001 AW0791 14 AA0631 60 AA757453 R60788 
A1859431 H20478 AA21 8882 AA757465 AA100995 A1864135 AI934209 M070503 H47008 AA219646 W61039 W93907 AW385050 W37967 
W78028 AA189007 AA4791 36 R93650 AA442312 T30287 AA847628 AA180262 AA009649 C03892 AW149464 AA310963 AA219693 
M069747 R29207 M094784 AA293615 AA447848 A1984167 N90393 C05097 N56499 AW292351 AW149681 AW473258 AA629322 AI004409 
AW1 05577 A1954937 AI81 1 070 AA902422 AW514437 AA535460 AA916877 AW51 71 22 AA974657 AA975649 AW5171 30 AW51 71 29 F31 737 
W07688 M193645 AA378994 AA489273 F32267 W39303 M021181 N86810 AA406524 AA062553 AA436801 H08985 H15979 N40310 
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AA436789 AA232172 AW360778 W25862 R60282 AA436530 AA378894 AA187461 AI940535 AA604210 M089514 AA3S0421 N88243 N84281 
AA209340 N56174 N88374 AA191088 AW247691 AA249013 AA0931 1 1 AA972536 AW298594 AA375893 T12139 W28186 AW243849 
AI288629 AA843998 W15260 All 88288 AW248079 R15836 

119599 genbank_W45552 W45552 

112382 genbank_R59904 R59904 

105264 genbankJ^A227934 AA227934 

100071 entrez_A28102 A28102 

123315 714071.1 AA496369AA496546 



Table 6A shows 99 genes up-regulated nonsmokers with lung cancer relaOve to smokers with lung cancer. These genes were selected from 59580 probesets on the 
Eos/Afrymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average Intensity (Al), a normalized value reflecting 
the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelCh Unigene number 

Unlgene Title: Unigene gene title 

R1 : average of Al for samples from non-smokers with adenocarcinoma divided by the 90th percentile of Al for samples from smokers with adenocarcinoma 

R2: average of Al for samples from non-smokers with squamous cell carcinoma divided by the 90th percentile of Al for samples from smokers with squamous cell 
carcinoma 



Pkey 


ExAccn 


UnigenelD 


1 Ininpnft Tiitft 


R1 


R2 


MflQ71 


BE379727 


Hs.83213 


faitv aniri bindina nrotein 4 atioocvte 




3.64 




L17330 


Hs.280 




15.00 






Y12490 


Hs.85092 


thvmiri hnrmnnp rocpntnr Inlflrnrtfw 1 1 

UtyiUlU I lull MU IIC IcUUplUI IIILUIdUlul 1 1 




2.46 




AA001021 


Hs.6685 


ihvrnirl hnrmnno iw.pntnT intfirartnr R 

UIYIUHJ MUIIUUMU ICVCJJUJI IIUQIOOkLH V 




12.00 


lUloUb 


AA586894 


Hs.112408 


Cinn /»?ilr*ti im_hinrlmn nrntoin A 7 fnennac 
O IUU CaHJJUlirUulUir iy [JiUlClll r\i ^piUJIdO 




Z68 


till VI c. 


S82472 




nh'hpla ~nn\-DbiA nnlvmprtw* hflla /fly nn a 

Ul/.UOia *pwrVI»n JA/f Jrl 1 lei C*X7 vela y3A.vfl a 




2.11 


WjlH 4- 


U30930 


Hs.158540 


1 inP nh/rn<;vltrar}sfRr;viR ft fURP-nalaHnv* 


7.50 




102394 


NMJ03816 Hs.2442 


a oisiniegnn ana meiaiiopi otcindse uuiiid 


7 <5A 






U92015 






13.50 




103010 


X52509 


Hs. 161 640 


tyrosine aminotransferase 


n.ou 






X98266 








2.50 


103563 


L02911 


Hs.150402 


acth/in A receptor, type 1 


q nn 




lQ3o5/ 


AI076795 


Hs.45033 


lacnnidi pruiiiic ncn piuieni 




3.94 


104239 


AB002367 


Hs.21355 


uOuuiecorun anu oaivt (urwse-i)Ke i 


U.Dlf 




i cm con 


AW373062 


Hs.83623 


m i^lanr ro/^onfnr euhfamilv 1 ftnum 1 m 

nuuicai icUcpiui ouuidJiniy t, yiuup i, in 




12.66 


104907 


AA055829 


Hs.196701 


CQTe Woslflu cimilar in A) 1 11 HI IMAM Al i l 






inC4 04 

lUol J 1 


BE514788 


Hs.296244 






2.17 


IUbo/2 


K47233 


Hs.30643 


CO IS 


7 on 




IUbb72 


T56887 


Hs.18282 


IflA A1 1 0A nrnfain 

nam! i o*» protein 


11 ^n 




incocn 

1Uo9o0 


AA156238 


Hs.32501 


to IS 




9 7A 


lUb9/i 


Z43846 


Hs.194478 


Unmn canto no mOMA> rnMA nKF7nA1<ini /f 

rtomo sapiens mrciNn, cuinm ur\rt.p*w*iu io/ z \i 


q en 






AA035375 


Hs.57887 


to i s, weawy similar 10 immhu i oo proiei 






108562 


M100796 




gb:zm26c06.s1 Stratagene pancreas (93720 


4C cn 




108599 


AB01B549 


Hs.69328 


Mu-2 protein 






108663 


BE219231 


Hs.292653 


tSTs, weawy similar to i2bo4D nypotneu 




0 An 


109247 


M314907 


Hs.85950 


tSTs 


7 nn 




4ftt>£Qn 

luabou 






to IS 




c nn 


110193 


AI004874 


Hs.310764 


HOmo Sapiens mKNA, CUNA Ul\rZp4J4MU0£ 1" 


10 *^n 




110234 


H24458 


Hs.32085 


COT 

tol 


ic cn 
ID.OU 




110644 


R94207 


Hs.268989 


tSTs, Hignly similar to type [| CALM/Ari 


a nn 




IIUBdO 


AW274992 


Hs.72249 


inree-ruz. containing proiem similar 10 


17 nn 
If .uu 




111057 


T79639 


Hs.14629 


to IS 


1R ^n 

ID.OU 




111950 


AF071594 


Hs.1 10457 


\Mr\U UlwAtiltAni MMdmina /><->nrt'tri^ln i 

woif-nirscnnom synorome canoioaie 1 


11 nn 




112291 


R53972 


Hs.26026 


COT p. 

to is 




^nn 


4 4 9GGR 


Z43784 


Hs.75893 


ariKynn j, nuoe or rcanvier laruvynji 




Z79 


113009 


T23699 


Hs.7246 


to Is 




a cn 


113060 


BE564162 


Hs.250820 


hypothetical protein FL) 14827 


9.79 




113073 


N39342 


Hs.103042 


microtubuie-associated protein 1B 


32.50 




113074 


AK001335 


Hs.31137 


protein tyrosine phosphatase, receptor t 




3.82 


113121 


T48011 


Hs.8764 


EST 




Z21 


113125 


AA968672 


Hs.8929 


hypothetical protein FU11362 


19.50 




113757 


M703095 


Hs.18631 


ESTs 




2.65 


113848 


W52854 


Hs.27099 


hypothetical protein FU23293 similar to 


6.00 




113884 


AI333076 


Hs.28529 


chromosome 12 open reading frame 2 




6.00 


113936 


W17056 


Hs.83623 


nuclear receptor subfamily 1, group I, m 




4.63 


114875 


AA235609 


Hs.236443 


Homo sapiens mRNA; cDNA DKFZp564N1063 ( 




7.00 


114987 


AA251016 


Hs.87808 


EST 




6.00 


115460 


AW958439 


Hs.38613 


ESTs 




2.27 


115722 


W91892 


Hs.59609 


ESTs 




9.00 


116261 


AA481788 


Hs.190150 


ESTs 


9.50 




116830 


H61037 


Hs.70404 


ESTs, Weakly similar to ALU2.HUMAN ALU 


aso 




116970 


AB023179 


Hs.9059 


KIAA0962 protein 


7.50 




117178 


H98675 


Hs.269034 


ESTs 




£68 


117757 


AF088019 


Hs.46732 


EST 


7.50 




118283 


AA287747 


Hs.173012 


ESTs. Weakly similar to A46010 X-Tinked 


16.50 




118384 


AF217525 


Hs.49002 


Down syndrome cell adhesion molecule 




Z50 


118657 


AI822106 


Hs.49902 


ESTs 




Z39 


120328 


M923278 


Hs.290905 


ESTs, Weakly similar to protease [Rsapi 




3.50 


120404 


AB023230 


Hs.96427 


KIAA1013 protein 


7.00 




120524 


AA261852 


Hs.192905 


ESTs 


6.00 




120688 


AW207555 


Hs.97093 


Homo sapiens cDNA: FU23004 fis, clone L 


17.92 





120 











121558 


AA412497 




gb:zl95g12s1 SoaresJesOs_NHT Homo sap 




Z95 


121676 


H56037 


Hs.108146 


ESTs 


10.00 




121936 


A1024600 


Hs.98612 


ESTs 


15.00 




121938 


AA428659 


Hs.98610 


ESTs 


14.00 




122177 


AA435789 


Hs.98833 


EST 


8.93 




123442 


AA299652 


Hs.1 11496 


Homo sapiens cDNA FU11643 fis, done HE 


13.04 




123551 


AA608837 




gb:afD3M2-s1 SoaresJestis_NHT Homo sap 


11.50 




123756 


AA609971 


Hs.1 12795 


EST 


11.00 




123861 


AA620840 




gb:af89gOU1 Soares_testis„NHT Homo sap 




250 


124371 


N24924 


Hs.188601 


ESTs 


6.50 




127477 


BE328720 


Hs.280651 


ESTs 




4.33 


127591 


A1190540 


Hs.131092 


ESTs 




3.02 


128252 


AA455924 


Hs.192228 


ESTs 


7.00 




128426 


A1265784 


Hs.145197 


ESTs 




2.08 


128925 


R67419 


Hs.21851 


Homo sapiens cDNA FU 12900 fis, done NT 




211 


128945 


AJ990506 


Hs.8077 


Homo sapiens mRNA; cDNA DKFZp547E184 (fr 


10.00 




129105 


AI769160 


Hs.108681 


Homo sapiens brain tumor associated prot 


15.50 




129235 


AW977238 


Hs.126084 


KIAA1055 protein 




4.25 


129506 


AB020684 


Hs.11217 


K1AA0877 protein 


6,50 




129595 


U09550 


Hs.1 154 


oviductal glycoprotein 1, 120kD (mucin 9 




10.00 


130160 


AA305688 


Hs.267695 


UDP-Gal:betaQcNAc beta 1,3-galactosyltr 


20.00 




130340 


D82326 


Hs.239106 


solute carrier family 3 (cystine, dibasi 


11.50 




131220 


AB023194 


Hs.300855 


KIAA0977 protein 


17.50 




131430 


AI879148 


Hs.26770 


fatty add binding protein 7, brain 


6.10 




132114 


NM_006152 


Hs.40202 


lymphoid-restncted membrane protein 




6.15 


132458 


AA935315 


Hs.48965 


Homo sapiens cDNA: FU21693 fis, done C 




5.58 


132647 


NlvL006927 


Hs.54432 


sialyltransferase 4B (beta-galadosidase 


7.50 




132655 


D49372 


Hs.54460 


small inducible cytokine subfamily A (Cy 




253 


132682 


AI077500 


Hs.54900 


serologically defined colon cancer aniig 




250 


132747 


AA345241 


Hs.55950 


ESTs, Weakly similar ta KIAA1330 protein 




283 


132812 


R50333 


Hs.92186 


Leman coiled-coi) protein 




3.82 


133337 


AF085983 


Hs.293676 


ESTs 




5.00 


133876 


AL134906 


Hs.771 


phosphorylase, glycogen; liver (Hers dis 




3.00 


134119 


AW157837 


Hs.79226 


fasciculation and elongation protein zet 




206 


134464 


AA302983 


Hs.239720 


CCR4-NOT transcription complex, subunit 




2.27 


134542 


M14156 


Hs.85112 


insuQn-Gke growth factor 1 (somatomedi 




11.50 


135002 


AA448542 


Hs.251677 


G antigen 7B 


87.00 




135305 


AA203555 


Hs.98288 


Homo sapiens cuna flj i4yuo ns, clone kl 




O.OU 



PCTAJS02/12476 



TABLE 6B show the accession numbers for those primekeys lacking unlgeneltys for Table 6A, For each probeset we have listed the gene duster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist Oakland California). The Genbank accession numbers for sequences comprising each duster are listed in the 
"Accession* column. 



Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

108562 36375 1 AA100796 AF020589 AA074629 AA075946 AA1 00849 AA085347 AA1 26309 M07931 1 AA079323 AA085274 

103439 35330J X98266 N41124 

123551 genbank_AA60B837 AA608837 

123861 genbank_AA620840 AA620840 

102832 entrezJJ92015 U92015 

101972 entrez_S82472 SB2472 

121558 genbankJ\A412497 AA412497 
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WO 02/086443 PCT/US02/12476 

Table 7A shows 98 genes down-regulated in non-smokers with lung cancer relative to smokers with lung cancer. These genes were selected from 59680 probesets on the 
Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysts was expressed as average intensity (Al), a normalized value reflecting 
the relative level of mRNA expression. 

Pkey: Unique Eos probeset Wentifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnfgeneJD: Unigene number 

Unigene TiUa Unigene gene title 

R1: 90th percentile of Al for samples from smokers with adenocarcinoma divided by the average of Al for samples from non-smokers with adenocarcinoma 

RZ 90th percentile of Al for samples from smokers with squamous cell carcinoma divided by the average of Al for samples from non-smokers with squamous cell 
carcinoma. 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


100187 


D17793 


Hs.78183 


aldo-keto reductase family 1, member C3 




164.10 


100380 


D82343 


Hs.18551 


neuroblastoma (nerve (issue) protein 




77.40 


100576 


X00356 


Hs.37058 


caicitonin/calcitonin-related polypeptid 


10Z40 




100971 


BE379727 


Hs.83213 


fatty add binding protein 4, adipocyte 


463.80 




101046 


K01 160 




(NONE) 


67ZC0 




101066 


AW970254 


Hs.889 


Charot-leyden crystal protein 


66.00 




101175 


U82671 


Hs.36980 


melanoma antigen, family A, 2 




77.20 


101497 


W05150 


Hs.37034 


homeoboxA5 


62.80 




101663 


NO03528 


Hs.2178 


H2B histone family, member Q 


78.00 




101677 


NMJJ00715 


Hs.1012 


complement component 4-binding protein, 


186.20 




101745 


M88700 


Hs.150403 


dopa decarboxylase (aromatic L-amino ad 


80.08 




101941 


S77583 




gb:HERVK1 07HUMMTV reverse transcriptase 


99.20 




102125 


NMJ06456 


Hs.288215 


sialyltransferase 




103.10 


102242 


U27185 


Hs.82547 


relinoic acid receptor responder (tazaro 


67.00 




102340 


U37055 


Hs.278657 


macrophage stimulating 1 (hepatocyte gro 


71.60 




102369 


U39840 


Hs.299867 


hepatocyte nuclear factor 3, alpha 




69.70 


102457 


NM.001394 


Hs.2359 


dual specificity phosphatase 4 • 


153.00 




102669 


U71207 


Hs.29279 


eyes absent (Drosophtla) homolog 2 




65.70 


102796 


AL079646 


Hs.107019 


symplekin; Huntingtin interacting protet 




58.80 


102829 


NMJJ06183 


Hs.80962 


neurotensin 




268.80 


103207 


X72790 




gb:Human endogenous retrovirus mRNA for 


70.00 




103242 


X76342 


Hs.389 


alcohol dehydrogenase 7 (class IV), mu o 




212.10 


103260 


X78416 


Hs.3155 


casein, alpha 




130.70 


103351 


X89211 




gb:H.sapiens DNA for endogenous retrovir 


64.60 




104212 


AB002298 


Hs.173035 


KIAA0300 protein 


66.80 




104252 


AF002246 


Hs.210863 


cell adhesion molecule with homology to 


63.80 




104258 


AF007216 


Hs.5462 


solute carrier family 4, sodium bicarbon 


94.40 




105024 


AA126311 


Hs.9879 


ESTs 


68.20 




106260 


AI097144 


Hs.5250 


ESTs, Weakly similar to ALU1JWMAN ALU S 




74.60 


106440 


M449563 


Hs.151393 


glutamate-cysteine Bgase, catalytic sub 




71.10 


106566 


BE298210 




gb:601118016F1 NIH_MGCJ7 Homo sapiens c 


73.20 




106605 


AW772298 


Hs.21103 


Homo salens mRNA; cDNA DKFZp564B076 (fr 


63.80 




106614 


AA648459 


Hs.335951 


hypothetical protein AF301222 




62.30 


106654 


AW075485 


Hs.286049 


phosphoserine aminotransferase 




202.40 


106999 


H93281 


Hs.10710 


hypothetical protein FU20417 




89.60 


108700 


AA121518 


Hs.193540 


ESTs, Moderately similar to 2109260A B c 




66.40 


108810 


AW295647 


Hs.71331 


hypothetical protein MGC5350 




95.50 


108857 


AK001468 


Hs.62180 


anillin (Drosophila Scraps homolog), act 




63.40 


109597 


AA989362 


Hs.293780 


ESTs 


85.00 




109691 


T65568 


Hs.12860 


ESTs 




58.70 


109704 


A1743880 


Hs.12876 


ESTs 




60.60 


110942 


R63503 


Hs.28419 


ESTs 


76.40 




111722 


R23924 


Hs.23596 


EST 


74.60 




112891 


T03927 


Hs.293147 


ESTs, Moderately similar to A46010 X-Ii 


64.80 




112992 


AL157425 


Hs.133315 


Homo sapiens mRNA; cDNA DKFZp761 J1324 (f 




76.70 


113073 


N39342 


Hs.103042 


micro tubule-associated protein 1B 




120.20 


114251 


H15261 


Hs.21948 


ESTs 


127.20 




115230 


AA278300 


Hs,124292 


Homo sapiens cDNA: FU23123 fis, clone L 


174.00 




115291 


BE545072 


Hs.1 22579 


hypothetical protein FU 10461 




91.00 


115815 


AW905328 


Hs.180842 


ribosomal protein L13 


66.40 




115909 


AW872527 


Hs.59761 


ESTs, Weakly similar to DAPLHUMAN DEATH 




226.60 


115965 


AA001732 


Hs.173233 


hypothetical protein FU 10970 


82.80 




116107 


AL133916 


Hs.1 72572 


hypothetical protein FU20093 




361.60 


116552 


D20508 


Hs.164649 


hypothetical protein DKFZp434H247 


69.00 




116571 


D45652 




gb:HUMGS02848 Human adult lung 3' direct 


64.20 




118466 


N66741 




gb:yz33g08.s1 Morton Fetal Cochlea Homo 




63.50 


120484 


AA253170 


Hs.96473 


EST 


81.60 




120983 


AA398209 


Hs.97587 


EST 




81.10 


121034 


AL389951 


Hs.271623 


nudeoporin 50kD 




66.20 


121423 


AW973352 


Hs.290585 


ESTs 


64.40 




122553 


AA451884 


Hs.1 901 21 


ESTs 




60.40 


122946 


AI718702 


Hs.308026 


major histocompatibility complex, dass 


188.60 




123130 


AA487200 




gb:ab19f02.s1 Stratagene lung (937210) H 




80.20 


124472 


N52517 


Hs.102670 


EST 


71.00 




124526 


N62096 


Hs.293185 


ESTs, Weakfy similar to JC7328 amino ad 




104.90 


125489 


H49193 


Hs.124984 


ESTs, Moderately similar to AW7.HUMAN A 




72.00 


125731 


R61771 


Hs.26912 


ESTs 




69.90 


125747 


Hi^LQ02884 


Hs.865 


RAP1A, member of RAS oncogene family 


69.00 




126020 


H79863 


Hs.1 14243 


ESTs 




62.40 


126547 


U47732 


Hs.84072 


transmembrane 4 superfamtly member 3 




62.80 


126966 


R38438 


Hs.1 82575 


solute earner famiry 15 (H+/pepude tra 




60.10 
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127472 


AA761378 


Hs.192013 


ESTs 


70.20 




127610 


AA9B0867 


Hs.150271 


ESTs, Highly simHar b unnamed protein 


64.00 




127742 


AW83498 


Hs.180138 


ESTs 


85.20 




127987 


A1022103 


Hs.124511 


ESTs 


96.60 




128233 


AW889132 


Hs.11916 


ribokinase 




78.90 


128420 


AA650274 


Hs.41296 


fibronecb'n leucine rich transmembrane p 




106.90 


128766 


AW160432 


Hs.296460 


craniofacial development protein 1 


66.80 




129014 


AW935187 


Hs.170162 


KIAA1357 protein 




58.53 


129215 


AB040930 


Hs. 126085 


WAA1497 protein 


64.20 




130090 


H97878 


Hs.132390 


zinc linger protein 36 (KOX 18) 


63.80 




130385 


AW067800 


Hs.155223 


stanntocalctn 2 




139.60 


130732 


AW890487 


Hs.63934 


cadherin 13, H-cadherin (heart) 




64.60 


131025 


AB040900 


Hs.6189 


K1AA1467 protein 


64.40 




131241 


BE501914 


Hs.24654 


Homo sapiens cDNA FU1 1640 fis, clone HE 


76.20 




131775 


AB014548 


Hs.31921 


KIAAG648 protein 


97.80 




132240 


AB018324 


Hs.42676 


K1M0781 protein 




• 71.00 


132856 


NW 001448 


Hs.58367 


gtypican 4 




88.40 


132977 


AA093322 


Hs.301404 


RNA binding motif protein 3 


133.20 




133749 


L20852 


Hs.10018 


solute carrier famify 20 (phosphate ban 




59.30 


133818 


AI110684 


Hs.7645 


fibrinogen, B beta polypeptide 


341.00 




134264 


AF149297 


Hs.B087 


NAG-5 protein 

flavin containing monooxygenase 3 




64.30 


134265 


M83772 


Hs.80876 




232.53 


134346 


X84002 


Hs.82037 


TATA box binding protein (T8P)-assoctate 


66.00 




134395 


AA456539 


Hs.8262 


tysosomal-assoaated membrane protein 2 




75.80 


135047 


AL134197 


Hs.93597 


cyclin-dependent kinase 5, regulatory su 




108.30 


135056 


N75765 


Hs.93765 


lipoma HMGIC fusion partner 


71.40 




135309 


AI564123 


Hs.42500 


ADP-ribosylaiion factor-like 5 


70.40 





TABLE 7B shows the accession numbers for those primekeys tacking unigenelD's for Table 7A. For each probeset we have Bsted the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DouWeTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed In the 
'Accession" column. 

Pkey: Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 

103207 30635,-4 X72790 

106566 12035B 1 BE298210 AI672315 AW0864B9 BE298417 AA455921 AA902537 BE327124 R14963 AA085210 AW274273 A1333584 A1369742 A1039658 

A1885095 AI476470 AI287650 A1885299 A1985381 AW592624 AW3401 36 AI266556 AA456390 AI31 081 5 M484951 

116571 genbanR_D45652 D45652 

118466 genban\.N66741 N66741 

101046 entrezJ<01160K01160 

101941 entrez_S77583 S77583 

103351 entrezj<8921 1 X89211 

123130 genbanLAA487200 AA487200 
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WO 02/086443 PCT/US02/12476 

Table 8A shows 1720 genes either up or down-regulated in lung tumors or chronically diseased lung relative to a broad collection of over 40 distinct norma! body tissues. 
Chronically diseased lung samples represent chronic non-malignant lung diseases such as fibrosis, emphysema, and bronchitis. These genes were selected from 39494 
probesets on the Eos/Anymetrix Hu02 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a 
normalized value reflecting the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UntgenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : 70lh percentile of Al for lung tumors divided by 90th percentile of Al for normal lung 

R2: 70(h percentile of Al for chronxafly diseased lung divided by 90th percentile of Al for normal rung 
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gb:op33cGS.s1 Scares JIFU_T_GBC_S1 Homos 
ribosoma) protein L18a 

gb:or84d07.s1 NCI_CGAP_Lu5 Homo sapiens 
EST, Weakly similar to RL23.HUMAN 60S R 
gb:ou57e08.s1 NCJ_CGAPJir2 Homo sapiens 
gb:os25c1Zs1 NCLCGAP_Kid5 Homo sapiens 
gb:os18c10.s1 NCI_CGAP_Kid5 Homo sapiens 
gtycera3dehyde-3-phosphaie dehydrogenase 
ribosomal protein, targe P2 
gb:ou11b07jc1 Soares_NFLT_GBC_S1 Homos 
PR02047 protein 

gb:ov29f10jc1 Scares Jestis_NHT Homo sap 
EST 

hemoglobin, alpha 2 

gb:ow70h1Zs1 So3res_fetaLBver_sp!een 
ESTs 

gb:qa75h12jc1 SoaresJeta!„heart_NbHH19W 
gb:qa33c06.s1 Soares_NhHMPu_S1 Homo sapi 
gb:am66f03.s1 Barstead spleen HPLRB2 Horn 
gb:am55e09.x1 Johnston frontal cortex Ho 
ribosoma) protein L13a 

gb:qb85b12j(1 Soares JetaLheaiLNbHHI 9W 
gb:ox70h06.s1 Soares_NhHMPu_S1 Homo sapi 
gb:cc99g06.x1 SoaresjiregnanLuterus.NbH 
ferritin, light polypeptide 
EST 

CD68 antigen 
ESTs 

ribosoma) protein S3A 

gb:qh92bOZx1 Soares_NFl_T_GBC_S1 Homos 
collagen, typo I. alpha 2 

gb:qh30g11.x1 Soares_NFU_T_GBC_S1 Homos 
gb:q!72d03.x1 Soares_NhHMPu_S1 Homo sapi 
gb:qu52f11j(1 NCLCGAPJ.ym6 Homo sapiens 
gb:qp55a12jc1 Soares_fetalJung_NbHL19W 
gb:qm01f02j(1 Soares_NhHMPu_S1 Homo sapi 
ribosoma] protein S19 

gb:tb17b01.x1 NCLCGAP_Ov37 Homo sapiens 
EST, Weakly similar to RL6_HUMAN 60S Rl 
small nuclear ribonucleoprotein polypept 
gb:qt43b07jc1 SoaresJetalJungJy|bHL19W 
gb:qt27f07.x1 Soaresj)regnanLuterus.NbH 
gb:qo26a07j(1 NCI_CGAP_Lu5 Homo sapiens 
gb:tc05d02.x1 NQ_CGAP_Co16 Homo sapiens 
gb:qt18f09.x1 NCLCGAPJ3C4 Homo sapiens 
gb:qt09d02j(1 NCLCGAPJ3C4 Homo sapiens 
gb:qt09g03jc1 Na_CGAP^GC4 Homo sapiens 
gb:qt94a1 1 jcI NCI CGAP_Co1 4 Homo sapiens 
EST, Weakly similar to R5HU22 ribosoma) 
gb:qz08g05jc1 NCLCGAP_CLL1 Homo sapiens 
gb:tg02h05j(1 NCI_CGAP_CLL1 Homo sapiens 
eukaryotic translation elongation factor 
ESTs 

gb:ti60a08j(1 Na_CGAP_Lym12Homosapien 
hemoglobin, alpha 1 

gryceraldehyde-3-phosphate dehydrogenase 
EST, Weakly similar to RL10JWMAN 60S R 
eukaryotic translation elongation factor 
eukaryotic translation elongation factor 
gb:tj77e12jrf Soares_NSF_F8_9W_OT_PA_P_S2.38 
EST 

gb:tn93d08j<1 NCt_CGAP_Ut2 Homo sapiens 
ESTs, Weakly similar to sch!afen4 [M.mu 
anaplastic lymphoma kinase (KM) 
gb:PT2.1_12_E04.r tumor2 Homo sapiens cD 
gb:PT2.1J3_H06.r tumor2 Homo sapiens cD 
gb:PT2.1J5_D07.r tumor2 Homo sapiens cD 
ribosomal protein S3 
eukaryotic translation elongation factor 
KIAA0553 protein 
ESTs 

KIAA1527 protein 

EST, Moderately similar to 138055 myosi 
gb:as51g1lx1 Barstead aorta HPLRB6 Homo 
gb:as47d07.x1 Barstead aorta HPLRB6 Homo 
hypothetical protein FU23045 
gb:tr19a12j(1 Na_CGAP_Ov23 Homo sapiens 
gbw*09c10Jt1 NC1_CGAP_CLL1 Homo sapiens 
gb:wi97a07.x1 NCI_CGAP,Kid12Homosapien 
ESTs 

gb:tr04c11ji1 NCI_CGAP_Ov23 Homo sapiens 
eukaryotic translation elongation factor 
gbwk52c01 x\ NCLCGAP_Pr22 Homo sapiens 
annexin A2 

gb:at48g03.x1 Barstead colon HPLRB7 Homo 
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thymosin, beta 4, X chromosome 
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ESTs 


5.80 


9.03 


316187 
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ESTs 


1.20 


3.96 
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Hs.120257 


ESTs 


4.92 


6.94 


316232 
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Hs.251203 


ESTs 


1.48 
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5.86 
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316291 
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Hs.156704 


ESTs 
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316303 
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Hs.209609 


ESTs 
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316344 


AA744518 


Hs.120610 


ESTs 


3.66 


8.34 


316346 
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Hs.157447 


ESTs 


3.51 


6.69 


316365 
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Hs.210776 


ESTs 
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AJ393378 


Hs.164496 


ESTs 
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6.04 


316519 


AI929097 




gb:od10c1U1 NCI_CGAP_GCB1 Homo sapiens 
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4.64 
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ESTs 
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1.96 
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ESTs 


5.44 
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3.56 


4.37 
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Hs.143707 


ESTs 


0.69 
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ESTs 
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6.64 
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1.98 


151 


317212 


A/866468 
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ESTs 


1.86 


183 


317223 


AW297920 
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1.57 
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Hs.130184 


ESTs 


1.38 


2.28 


317320 


AA927151 


Hs. 130452 


ESTs 


3.58 
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ESTs 
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Hs.1 271 22 


ESTs 


1.00 


Z43 


317571 


AA938663 
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Z48 
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5.30 
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ESTs 
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ESTs 
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ESTs 


£92 


5.22 


^18054 


AW449270 


Hs 232140 


ESTs 
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ESTs 
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1.00 


1.00 


319041 


244720 


Hs.98365 


ESTs, Weakly similar to weak similarity 


3.38 


6.11 


319103 


H05896 


Hs.4993 


K1AA1313 protein 


1.00 


1.07 


319170 


R13678 


Hs.285306 


outaiive selenocvsteine tvase 


3.79 


5.03 


319196 


F07953 


Hs.16085 


putative 6-protein coupled receptor 


1.00 


2.98 


319199 


F07361 


Hs.1 3306 


ESTs 


3.53 


5.66 


319242 


F11472 


Hs.12839 


ESTs 


5.87 


7.26 
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319263 


T65331 


Hs.81360 - 


Homo sapiens cDNA* FU21927 fis, done H 


1.81 


1.57 


319257 


F11802 


Hs.6818 


ESTs 


1.10 


4.72 


319270 


R13474 


Hs.290263 


ESTs 


4.80 


10.40 


319279 


T65094 


Hs.12677 


CGM47 protein 


1.50 


2.11 


319282 


M461358 


Hs.12876 


ESTs 


1.00 


1.00 


319289 


W07304 


Hs.79059 


transforming growth factor, beta receplo 


0.18 


0.68 


319291 


W85578 


Hs.285243 


hypotheOca) protein FU22029 


0.26 


0.62 


319293 


F12119 


Hs.12583 


ESTs 


3.13 


4.50 


319312 


Z45481 




gb:HSC2QE041 normafized infant brain cON 


1.10 


1.00 


319370 


H54254 


Hs.325823 


ESTs, Moderately similar to ALU5 HUMANA 


0.16 


0.73 


319391 


R06304 


Hs.13911 


ESTs 


1.26 


2.43 


319398 


H67130 


Hs.301743 


ESTs 


0.70 


0.76 


319398 


AA359754 


Hs.191196 


ESTs 


2.45 


3.59 


319407 


R05329 




gb:ye91b04.r1 Soares fetal Ever spleen 


2.00 


3.54 


319425 


T82930 




gb:yd39f07j"1 Soares fetal Ever spleen 


4.28 


8.81 


319433 


R06050 


Hs.191198 


ESTs 


6.15 


14.13 


319437 


M282420 


Hs.1 11991 


ESTs, Weakly similar to Y48A5A.1 [Celeg 


3.26 


5.68 


319466 


A1809937 


Hs.116417 


ESTs 


1.76 


5.65 


319471 


R06546 


Hs.19717 


ESTs 


4.29 


4.84 


319480 


R06933 


Hs. 184221 


ESTs 


1.00 


1.00 


319484 


T91772 




gb:yd52a1G\s1 Soares fetal fiver spleen 


2.81 


4.88 


319486 


A1382429 


Hs.250799 


ESTs 


2.08 


2.82 


319508 


T99898 


Hs,270104 


ESTs, Moderately similar to ALU8_HUMAN A 


2.80 


4.39 


319523 


T69499 


Hs.191184 


ESTs 


1.55 


3.25 


319545 


R83716 


Hs.14355 


Homo sapiens cDNA FLJ 13207 fis, clone NT 


1.65 


1.19 


319546 


R09592 




gb:yf23b1Zr1 Soares fetal liver spleen 


5.11 


8.54 


319552 


AA096106 


Hs.20403 


ESTs 


1.89 


3.36 


319582 


T82998 


Hs.250154 


hypothetical protein FU 12973 


3.48 


4.82 


319586 


D78808 


Hs.283683 


chromosome 8 open reading frame 4 


0.26 


0.82 


319604 


R11679 


Hs.297753 


vimentin 


1.68 


3.41 


319609 


AW247514 


Hs.12293 


hypothetical protein FU21103 


3.06 


4.24 


319611 


H14957 




gb:ym19c10jr1 Soares infant brain 1 NIB H 


2.76 


4.24 


319653 


AA770183 


Hs.173515 


u recharacterized hypothalamus protein HT0 


Z51 


3.55 


319657 


R19897 


Hs.106604 


ESTs 


5.32 


7.68 


319658 


R13432 


Hs.167481 


syntrophin, gamma 1 


3.35 


5.00 


319661 


H08035 


Hs.21398 


ESTs, Moderately similar to A Chain A, H 


5.18 


1155 


319662 


H06382 


Hs.21400 


ESTs 


1.58 


1.56 


319708 


R15372 


Hs.22664 


ESTs 


1.00 


1.22 


319742 


T77668 


Hs.21162 


ESTs 


2.48 


3.13 


319748 


R18178 


Hs.295866 


Homo sapiens mRNA; cDNA DKFZp434N1923 {f 


3.02 


4.85 


319772 


R76633 


Hs.22646 


ESTs 


4.36 


11.61 


319788 


AA321932 


Hs. 117414 


KIAA1320 protein 


2.56 


3.68 


319805 


R92857 


Hs.271350 


likely orthoiog of mouse potydom 


4.63 


6.56 


319812 


N74880 


Hs.264330 


N-acylsphingosine amidohydroiase (acid c 


0.63 


1.32 


319834 


AA071267 




gb:zm61g01 .r1 Stratagene fibroblast (937 


0.30 


0.94 


319878 


T78517 


Hs.1 3941 


ESTs 


3.99 


6.44 


319882 


AA258981 


Hs.291392 


ESTs 


5.09 


7.36 


319912 


T77559 


Hs.94109 


Homo sapiens cDNA FU 13634 fis, clone PL 


3.24 


3.21 


319935 


H79460 


Hs.271722 


ESTs, Weakly similar to ALU1 HUMAN ALU S 


4.40 


9.42 


319944 


T79248 


Hs. 133510 


ESTs 


3.31 


5.39 


319947 


AA1 60967 


Hs.14479 


Homo sapiens cDNA FU14199 Ms, clone NT 


2.90 


4.95 


319962 


H06350 


Hs. 135056 


Human ONA sequence from clone RP5-850E9 


1.81 


1.57 


320007 


AA336314 




gb:EST40943 Endometrial tumor Homo sapie 


3.42 


6.29 


320018 


T83263 




gb:yd40h09.r1 Soares fetal liver spleen 


177 


5.14 


320030 


H63789 


Hs.296288 


ESTs, Weakly similar to KIAA0638 protein 


4.10 


6.69 


320032 


AI699772 


Hs.292664 


ESTs, Weakly similar to A46010 X-iinked 


3.27 


3.27 


320040 


AA233671 


Hs.87164 


hypothetical protein FU14001 


1.81 


1.64 


320047 


T86564 


Hs.302256 


EST 


3.38 


7.36 


320063 


AA074108 


Hs.120844 


FOXJ2forkhead factor 


5.90 


16.73 


320096 


H58138 


Hs.117915 


ESTs 


2.08 


4.47 


320099 


AW411307 


Hs.1 14311 


CDC45 (cell division cycle 45, S.cerevis 


1.00 


1.00 


320112 


T92107 


Hs.1 88489 


ESTs 


2.27 


2.06 


320140 


H94179 


Hs.1 19023 


SMC2 (structural maintenance of chromoso 


1.00 


1.00 


320188 


AW419200 


Hs.1 7231 8 


ESTs 


1.26 


1.00 


320193 


AA831259 


Hs.17132 


ESTs 


2.58 


6.23 


320195 


R62203 


Hs.24321 


Homo sapiens cDNA FU 12028 fis, clone HE 


2.85 


4.53 


320199 


R78659 


Hs.29792 


ESTs 


0.40 


0.94 


320203 


AL049227 


Hs.1 24776 


Homo sapiens mRNA: cDNA DKFZp564N1 1 16 (f 


0.84 


1.18 


320219 


AA327564 


Hs.1 27011 


tubuJointerstitia) nephritis antigen 


1.00 


1.17 


320220 


AF054910 


Hs.1 271 11 


tektin 2 (testicular) 


0.18 


1.09 


320225 


AF058989 


Hs.1 28231 


G antigen, family B, 1 (prostate associa 


5.26 


13.75 


320231 


H03139 


Hs.24683 


ESTs 


1.59 


1.93 


320260 


NM_003608 


Hs.131924 


G protein-coupled receptor 65 


1.38 


4.56 


320267 


AL049337 


Hs. 132571 


Homo sapiens mRNA; cDNA DKFZp564P016 (fr 


1.00 


1.92 


320268 


K06019 


Hs.151293 


Homo sapiens cDNA FU 10664 fis, clone NT 


5.58 


5.70 


320322 


AF077374 


Hs.1 39322 


small proline-rich protein 3 


1.41 


1.01 


320325 


A1167978 


Hs.1 39851 


caveolln 2 


0.05 


0.67 


320330 


AF026004 


Hs.141660 


chloride channel 2 


2.17 


1.26 


320339 


H10807 


Hs.281434 


Homo sapiens cDNA FU14028 fis, clone HE 


1.81 


2.32 


320388 


H16Q65 

III uvww 


Hs.31286 


ESTs 


1.00 


3.22 


320402 


R22291 


Hs.23368 


Homo sapiens done FLC0578 PR02852 mRNA, 


1.41 


1.36 


320413 


AA203711 


Hs. 173269 


ESTs 


2.31 


3.61 


320432 


R62786 


Hs.124136 


ESTs 


11.25 


20.78 


320435 


AA253352 


Hs.293663 


ESTs 


Z22 


3.49 


320438 


W24548 


Hs.5669 


ESTs 


3.53 


8.14 
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320448 


AI240233 


Hs.80887 


v-yes-1 Yamaguchi sarcoma viral related 


1.42 


3.46 


320451 


R26944 


Hs.180777 


Homo sapiens mRNA; cDNA DKFZp564M0264 (f 


0.87 


0.81 


320484 


AA094436 


Hs.296267 


fclGstaiin-fte 1 


0.65 


1.18 


320499 


R32555 


Hs.24321 


Homo sapiens cDNA FU 12028 fis, done HE 


3.44 


7.15 


320514 


AB007978 


Hs.158278 


K1M0509 protein 


6.44 


13.62 


320521 


N31464 


Hs.24743 


hypothetical protein FU20171 


1.48 


1.04 


320526 


AW374205 


Hs.111314 


ESTs 


3.66 


7.87 


320527 


R34672 


Hs.324522 


ESTs 


3.16 


5.63 


320536 


AA331732 


Hs.1 37224 


ESTs 


283 


5.83 


320556 


AF054177 


Hs.14570 


hypothetical protein FU22530 


1.28 


1.00 


320564 


AF056209 


Hs.1 59395 


peptldytglycine alpha-amidaling monooxyg 


1.22 


0.81 


320587 


244524 


Hs.167456 


Homo sapiens mRNA full length Insert cDN 


1.84 


244 


320635 


R54159 


Hs.80506 


small nuclear ribonucteoprotein polypept 


1.00 


6.25 


320639 


AA243258 


Hs.7395 


hypothetical protein FU23162 


260 


230 


320648 


N48521 


hte.26549 


Homo sapiens mRNA for K1AA1708 protein, 


1.00 


1.53 


320651 


AA489268 


Hs.1 11334 


ferritin, light polypeptide 


0.14 


0.79 


320664 


A1904216 


Hs.91251 


hypothetical protein FU11198 


5.02 


8.84 


320676 


AA132650 


Hs.300511 


ESTs 


3.63 


5.37 


320683 


R59291 


Hs.26638 


ESTs, Weakly similar to unnamed protein 


0.37 


1.31 


320689 


AA334609 


Hs.171929 


ESTs, Weakly similar to A54849 collagen 


1.27 


1.02 


320696 


AW135016 


Hs.172780 


ESTs 


3.53 


4.60 


320714 


A1445591 




gb:yq04a10.r1 Scares fetal liver spleen 


1.06 


0.85 


320727 


U96044 


Hs.181125 


immunoglobulin lambda locus 


1.35 


1.49 


320771 


A1793266 


Hs1117176 


poty(A)-binding protein, nuclear 1 


0.04 


0.82 


320794 


AA281993 


Hs.91226 


ESTs 


2.96 


4.33 


320822 


AF100780 


Hs.194679 


WNT1 inducible signaling pathway protein 


0.10 


0.79 


320824 


AF120274 


Hs.194689 


artemin 


1.16 


1.11 


320830 


AJ132445 


Hs.266416 


claudin 14 


1.06 


1.75 


320843 


AA317372 


Hs.34744 


Homo sapiens mRNA; cDNA OKFZp547C136 (fr 


1.36 


1.47 


320849 


D60031 


Hs.34771 


ESTs 


5.30 


7.49 


320853 


AI473796 


Hs.135904 


ESTs 


1.00 


1.00 


320896 


AB002155 


Hs.271580 


uroplakin 1B 


5.90 


255 


320921 


R94038 " 


Hs.199538 


inhibin, beta C 


2.20 


1.17 


320927 


AI205786 


Hs.213923 


ESTs 


0.18 


1.46 


320957 


AI878933 


Hs.92023 


core histone mscroH2A2.2 


1.67 


218 


320997 


H22544 




gb:yn69f1 1.r1 Soares adult brain N2b5HB5 


3.26 


3.62 


321045 


W88483 


Hs.293650 


ESTs 


225 


4.55 


321046 


H27794 


Hs.269055 


ESTs 


2.69 


4.25 


321052 


AW372884 


Hs.240770 


nuclear cap binding protein subunit 2, 2 


214 


256 


321059 


A1092824 


Hs.126465 


ESTs 


1.69 


0.53 


321062 


R87955 


Hs.241411 


Homo sapiens mRNA full length insert cDN 


276 


5.20 


321067 


AF131782 


Hs.241438 


Homo sapiens clone 24941 mRNA sequence 


4.79 


7.41 


321102 


AA018306 




gbrze40d08j1 Soares retina N2b4HRHomo 


1.79 


4.27 


321130 


H43750 


Hs.125494 


ESTs 


1.00 


3.14 


321142 


A1817933 


Hs.298351 


ASPL protein 


8.73 


15.36 


321155 


AA336635 


Hs.99598 


hypothetical protein MGC5338 


3.04 


5.03 


321158 


AA700289 




gb:yu76f 1 1 .rl Soares fetal liver spleen 


4.62 


8.39 


321170 


N53742 


Hs.172982 


ESTs 


221 


4.46 


321199 


AW385512 




gb:yy56d10.s1 Soares_multiple_sclerosis_ 


5.69 


8.01 


321206 


H54178 


Hs.226469 


Homo sapiens cDNA FU1 2417 fis, done MA 


4.00 


7.32 


321225 


AL080073 


Hs.251414 


Homo sapiens mRNA; cDNA DKFZp564B1462 (f 


4.17 


4.63 


321236 


AW371941 


Hs.18192 


Ser/Arg-relaled nuclear matrix protein ( 


1.00 


1.00 


321244 


AF068654 




gb:Homo sapiens isolate AN.1 immunoglobu 


216 


9.13 


321270 


R83560 




gb:yv76c06.s1 Soares fetal Ever spleen 


3.80 


5.26 


321317 


AI937060 


Hs.6298 


K1AA1151 protein 


1.81 


1.65 


321318 


AB033041 


Hs.137507 


KIAA1215 protein 


1.00 


1.00 


321325 


AB033100 


Hs.300646 


KIAA protein (similar to mouse paladin) 


0.44 


0.93 


321342 


AA127984 


Hs.222024 


transcription factor BMAL2 


4.94 


4.93 


321356 


R93443 


Hs.271770 


ESTs 


3.10 


4.66 


321418 


AI739161 


Hs.161075 


ESTs 


228 


254 


321420 


AI368667 


Hs.132743 


ESTs 


1.13 


0.97 


321430 


U05890 




gb:H.saplens (DIG3) mRNA for immunoglobu 


242 


3.35 


321453 


N50080 


Hs.82845 


Homo sapiens cDNA: FU21930 fis, clone H 


1.60 


3.11 


321467 


X13075 




gb:Human 2a12 mRNA for kappa-immunoglobu 


0.42 


0.72 


321468 


AA514198 


Hs.38540 


ESTs 


246 


6.50 


321491 


H70665 


Hs.292549 


ESTs 


1.00 


1.25 


321498 


AW295517 


Hs.255436 


ESTs 


3.19 


6.24 


321504 


W02356 


Hs.268980 


ESTs 


228 


■3.86 


321510 


AA703650 


Hs.255748 


ESTs 


2.14 


3.94 


321513 


H84972 


Hs.108551 


ESTs , 


278 


5.37 


321516 


AI382803 


Hs.159235 


ESTs • 


3.06 


7.19 


321565 


A1525773 


Hs.266514 


hypothetical protein FU11342 


4.89 


7.82 


321577 


H84260 




gbys90g04.r1 Soares retina N2b5HR Homo 


1.00 


1.73 


321581 


AA019964 


Hs.28803 


ESTs 


4.88 


6.73 


321582 


AA143755 


Hs.21858 


trinucleotide repeat containing 3 


1.00 


208 


321587 


H95531 




gb:ys76e02rl Soares retina N2b4HR Homo 


2.26 


4.52 


321626 


AA295430 


Hs.96322 


hypothetical protein FU23560 


1.95 


3.83 


321628 


H87064 


Hs.161051 


ESTs, Moderately similar to ALU6JWMAN A 


0.47 


1.02 


321642 


AW085917 


Hs.247084 


ESTs 


1.52 


1.38 


321669 


H95404 


Hs.294110 


ESTs 


217 


245 


321687 


AA625149 




gb:af70c12j1 Soares_NhHMPu_S1 Homosapi 


4.31 


6.95 


321688 


H97646 


Hs.123158 


Homo sapiens cDNA FU 12830 fis, clone NT 


282 


3.28 


321693 


AA700017 


Hs.173737 


ras-related C3 botuBnum toxin substrate 


0.51 


1.08 


321700 


N55160 


Hs.167260 


ESTs 


4.57 


7.46 


321701 


AW390923 


Hs.42568 


ESTs 


1.00 


1.00 
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321709 
321710 
321775 
321777 
321779 
321829 
321846 
321879 
321883 
321899 
321911 
321949 
321955 
321956 
321987 
321991 
322002 
322035 
322044 
322057 
322060 
322070 
322083 
322091 
322125 
322130 
322147 
322166 
322173 
322178 
322179 
322186 
322196 
322212 
322221 
322277 
322278 
322284 
322288 
322320 
322336 
322339 
322366 
322372 
322374 
322378 
322388 
322416 
322419 
322425 
322431 
322450 
322465 
322467 
322473 
322509 
322523 
322527 
322560 
322566 
322585 
322635 
322641 
322653 
322664 
322687 
322692 
322694 
322708 
322712 
322766 
322770 
322794 
322810 
322818 
322820 
322872 
322882 
322887 
322913 
322926 
322929 
322968 
322971 
322981 



N25847 

N35682 

AI694875 

A1637993 

N42729 

081993 

AA281594 

AL109670 

AA426494 

N55158 

AF026944 

R49202 

AI651866 

AL1 10177 

AL133612 

AL133627 

AA328801 

AL137517 

AW340926 

N92197 

AI341937 

U80769 

AF074982 

AI819863 

R93901 

R98978 

AF085919 

AF085958 

H52587 

H56535 

H92891 

H67346 

W87895 

AF087995 

AJ890619 

AI640193 

AF086283 

A1792140 

AL037273 

AF086419 

AA308526 

W17348 

AW404274 

W25624 

AI394663 

AF064819 

A1815730 

AA223183 

AA248987 

W37943 

AA069222 

M040131 

AA137152 

AF116826 

AA744286 

T52172 

W80398 

AF147359 

A1916847 

W87285 

AA837622 

AA679084 

AA007352 

AI828854 

AA011522 

AI1 10759 

AA018117 

A1110872 

AF1 13674 

AA021328 

AW068805 

AA045796 

AI608591 

AI962276 

AW043782 

AI377755 

AA827228 

AW248508 

A1986306 

A1733737 

AI825940 

AI365585 

A1905228 

C15953 

AA493252 



Hs.108923 

Hs.259743 

Hs.202312 

Hs.202312 

Hs.163835 

Hs.8966 

Hs.87902 

Hs.302809 

Hs.46901 

Hs.29468 

Hs.293797 

Hs.181694 

Hs.195689 

Hs. 132882 

Hs.272759 

Hs.158923 

Hs.84522 

Hs.306201 



Hs.210322 
Hs.226031 
Hs.106243 

Hs.1 17767 
Hs.1 14176 



Hs.269187 
Hs.211516 
Hs.134877 
Hs.179662 
Hs.226389 

Hs.49265 
Hs.7886 

Hs.76152 

Hs.122492 

Hs.1 53943 

Hs.122116 

Hs.201877 

Hs.247474 

Hs.298442 

Hs.14084 

Hs.34892 

Hs.141892 

Hs.25144 

Hs.286049 

Hs.180340 

Hs.266935 

Hs.302213 

Hs.193197 

Hs.270947 
Hs.269587 



Hs.256042 
Hs.258538 



Hs.60843 

Hs.279812 

Hs.283773 

Hs.23607 

Hs.288467 

Hs.122682 

Hs.38991 

Hs.127444 

Hs.293616 

Hs.1 20695 

Hs.126943 

Hs.279727 

Hs.86149 

Hs.68837 

Hs.211192 

Hs.146245 

Hs.83484 

Hs.212760 

Hs.159577 



RAB38, member RAS oncogene family 
ESTs 

Homo sapiens clone N11 NTera2D1 teratoca 
Homo sapiens clone N11 NTera2D1 teratoca 
ESTs 

tumor endothelial marker 8 

ESTs 

ESTs 

K1AA1462 protein 

ESTs 

ESTs 

EST 

ESTs 

ESTs 

KIAA1457 protein 

Homo sapiens mRNA; cDNA DKFZp434K0722 {f 
ESTs 

hypothetical protein DKFZp56401278 
gb3cy51b10a1 NCLCGAPJJJ34.1 Homosapie 



Hs.1 54679 synaptotagmin 1 



gb:qt10e03j(1 Na.CGAP.GC4 Homo sapiens 
Homo sapiens mRNA for KJAA1766 protein, 
ESTs, Highly similar to KIAA0535 protein 
ESTs 

gb:yq16c12.r1 Soares fetal liver spleen 

ESTs 

ESTs 

gb:yr88b03.r 1 Soares fetal fiver spleen 

gb.-yt85d04.r1 SoaresjrineaLgland_N3HPG 

gbyt88g03.r1 Soares_pineaLglarKLN3HPG 

gb:yl94c02*1 Soares_pineaLgland_N3HPG 

ESTs 

ESTs 

ESTs 

nucleosome assembly protein 1-Dke 1 
ESTs 

gb:zd46f01.r1 SoaresJetal_hearLNbHH19W 
ESTs 

pellino (Drosophila) homotog 1 
gb2d78d03j1 Soares Jetal_hearLNbHH19vV 
decorin 

gb:zb18c07;x5 Soares Jetal_lung_NbHL19W 

hypothetical protein 

ESTs 

ESTs, Moderately similar to Osf2 [M.musc 
DESC1 protein 

hypothetical protein FU21032 

adaptor-related protein complex 3, mu 1 

ring finger protein 7 

KIAA1323 protein 

ESTs 

ESTs 

phosphosenne aminotransferase 
putative proteirvtyrosine kinase 



ESTs 
ESTs 

gb:Homo sapiens fun length insert cDNA 

ESTs 

ESTs 

gb:zh69c01.M Soares Jeta!Jiver_spleen_ 
gb:zh90h08,r1 SoaresJetalJverjspleen_ 
ESTs 

slriaOn, calmoduGn-binding protein 

gb:zi03g07.r1 SoaresJetalJiver_spleen_ 

gb:AF074666 Human fetal fiver cDNA libra 

potassium voltage-g ated channel, shaker- 

PRO0327 protein 

clone FLB1727 

hypothetical protein FU11109 

Homo sapiens cDNA FU1 2280 fis, clone MA 

ESTs 

S100 cateiunvbinding protein A2 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens cONA FU 14035 fis, clone HE 

phosphoinositol 3-phosphate-binding prot 

ESTs 

ESTs 

ESTs 

SRY (sex determining region Y)-box 4 
hypothetical protein FU13649 
ESTs 



i no 


1.00 


2Q7 


5_96 


1.00 


1.00 


1 RR 

I.OO 


0 45 

U.HO 


0.90 


0.90 


9 fiQ 


1 RQ 

O.0O 


5 11 

O. 1 1 




6.49 


9.58 


U.iO 


0Q5 
U.00 


0.39 


0 35 


6.20 


10.76 


H.ac 


10 51 


289 


5.47 


0.32 


1.25 


1.00 


1.83 


4.00 


6 47 


2.10 


3.48 


1.00 


1.90 


3.20 


Q67 


1.55 


1 07 

I.Uf 


4.59 


7.68 


9 7ft 

C. 10 


4 52 


i in 

0. IU 


5 59 


1 5Q 


1 75 


2.06 


5.27 


IU. It 


16 4Q 


n qa 

U.OH 


064 


4.09 


6.67 


'i ACL 
O.HO 


4R5 


044 


954 


4 5? 


7 50 

I.OU 


0 15 
U. 10 


OQR 

U.OO 


9 90 


5.04 


149 

O.Hi 


4R4 


n ft? 

M.QC 


9 14 


1 69 


1 QR 

0.90 


1 no 

l.UU 


1 00 

l.UU 


U.00 


9 7R 

C. i 0 


0 71 

U.i 1 


0.70 


9 09 


9 7fi 


9 09 

c. He 


AAA 


ft 50 
o.ou 


11 5fi 
1 1 .00 


0 R1 

U.D 1 


1 14 


7 17 
» .or 


19 07 


4 7H 


10 50 

IU.OU 


1 00 

l.UU 


1.00 


l .UO 


R4Q 


1 90 


5 RO 
o.ou 


1.64 


1 57 

I.O/ 


0.83 


1.00 


1 Qft 

0. 30 


5 22 

O.CC 


5.18 


12.67 


0.*tl 


9 91 


1 no 

l.UU 


1 10 

I.OU 


1 75 


2.03 


1 00 
l.UU 


9 97 
c.ct 


9 75 


5 4Q 

O.HO 


1 95 

l.CO 


1 97 


4 57 


ft fl1 

0.0 1 


1 no 

l.UU 


1 49 

I .HC 


A 1ft 
**. 10 


RQ4 

0.04 


940 


4 R5 


9Q4 
z.oh 


4 64 


0 4R 


0 Ifl 

u.oo 


1 Q9 


2 1R 

C. IO 


4 14 

*t. I*f 


6 75 

V.I o 


1 50 
O.OU 


5 00 
o.uu 


1 AO 

I.OU 


1 79 


1 00 

l.UU 


3.43 


1 9ft 
o.co 


3.86 


1 61 

I.OO 


1 51 

I.OO 


1 51 

1.00 


1.06 


19 OR 


1 Q4 

I.OH 


4.09 


6Q0 
o.ou 


1 90 

l.iU 


1 61 

I.OO 


0 91 
M.c 1 


1 Ql 

1.00 


904 

C\UH 


1 fi! 

I.OO 


5.26 


1.22 


9 60 

C.OU 


9 94 


2.38 


6.61 


409 


5.79 


0 10 
u.ou 


1 14 


2.06 


1.13 


118 


ZOO 


2.28 


2.61 



136 



Hs.49359 
Hs.270471 



WO 02/086443 

322988 C18727 Hs.171941 

323003 AI733859 Hs.149089 

323013 AA134042 Hs.191451 

323025 AL157565 Hs.315369 

5 323032 AW244073 Hs.145946 

323052 R21124 Hs.85573 

323054 AL1 19341 

323098 AI700025 

323102 AL119913 Hs.t63615 

10 323155 AL135041 

323176 AW071648 Hs.82101 

323191 AA195600 Hs.301570 

323225 AA205654 Hs.24790 

323232 AA148722 Hs.224680 

15 323256 AW003362 Hs.243885 

323281 AI697556 Hs.292659 

323283 AA256014 Hs.86682 

323314 AA226310 Hs.191501 

323316 AL134620 Hs.280175 

20 323334 AI336501 

323338 R74219 

323348 AA233056 Hs.191518 

323351 AA704103 Hs.24049 

323359 AA234172 Hs.137418 

25 323360 AA716061 Hs.161719 

323405 AW139550 Hs.115173 

323420 AI672386 Hs.263780 

323434 AW081455 Hs.120219 

323445 AA253103 Hs. 135569 

30 323449 AA282865 Hs.284153 

323492 H00978 

323501 AA182461 

323505 A1652287 

323515 AA282274 Hs.256Q83 

35 323541 AI1B5116 

323545 AI8144Q5 

323635 R63117 

323675 AA984759 Hs.272168 

323678 AL042121 Hs^0880 

40 323691 AA317561 Hs.145599 

323693 AW297758 Hs.249721 

323746 AW298611 Hs.12808 

323774 AA329806 Hs.321056 

323856 AA355264 Hs.267604 

45 323857 T18988 Hs.293668 

323870 AA341774 Hs.129212 

323876 AL042492 Hs.147313 

323885 AA344308 Hs.128427 

_ 323911 AL043212 Hs.92550 

50 323919 AA862973 Hs.220704 

323972 A1869964 Hs. 182906 

324005 AA610011 

32403iS A/472078 

324055 AA528794 Hs.128644 

55 324063 AW292740 Hs.272813 

324072 AA381829 

324092 AW269931 Hs.202473 

324095 AW377983 Hs.298140 

324129 AI381918 Hs.285833 

60 324132 AW504860 Hs.288836 

324214 AA412395 Hs.225740 

324227 AA295552 Hs.28631 

324266 AL047634 Hs.231913 

324275 AA429088 Hs.98523 

65 324281 AL048026 Hs.124675 

324290 AA432032 Hs.304420 

324303 AL1 18754 

324312 AJ198841 

„ 324325 AL138153 

70 324338 AL138357 

324341 AW197734 Hs.99807 

324343 AW452016 Hs.293232 

324371 AA452305 Hs.270319 

„ 324382 AW502749 Hs.24724 

75 324384 AA453396 Hs.127656 

324385 F28212 Hs.284247 

324388 AI924963 Hs.306206 

324432 AA464510 Hs.152812 

no 324497 AW152624 Hs.136340 

oO 324510 AI148353 Hs.287425 

324580 AA492588 

324582 AA506935 Hs.132036 

324633 AA572994 Hs.325489 

324640 AW295832 Hs.134798 

85 324675 AW014734 Hs.157969 



Hs.77273 
Hs.23346 



Hs.20887 
Hs.84520 



Hs.104613 
Hs.224569 
Hs.9691 



Hs.208021 
Hs.303662 



Hs.128173 
Hs.300410 
Hs.145078 



ESTs 
ESTs 
ESTs 

Homo sapiens cDNA: FU23075 fis, clone L 
ESTs 

Homo sapiens DC29 mRNA, complete cds 
Homo sapiens mRNA; cDNA DKFZp547E052 (fr 
ESTs 
ESTs 

gb:DKFZp762K2310j1 762 (synonym: hmel2) 
pleckstrin homology-like domain, family 
ESTs 

KIAA1573 protein 
ESTs 

nuclear auloanfigenic sperm protein (hfe 
ESTs 

Homo sapiens cDNA: FU21578 Gs, done C 
ESTs 
ESTs 

ras homdog gene family, member A 
S-phase kinase-associated protein 2 (p45 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 

ESTs, Weakry similar to NEUROD [Ksapien 
Fancom anemia, complementation group A 
hypothetical protein FU 10392 
ESTs 

gb:EST382593 MAGE resequences, MAGK Homo2.21 
ESTs 

RP42 homoiog 
ESTs 

Homo sapiens cDMA: FU23249 fis. clone C 
tumor differentially expressed 1 
ESTs 
ESTs 
ESTs 
MARK 

Homo sapiens mRNA; cDNA DKF2p586F1322 (f 
hypothetical protein FU10450 
ESTs 
ESTs 
ESTs 

Homo sapiens BAC clone RP1 1-335J18 from 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 

dual oxidase 1 

gb:EST94855 Activated T-ceDs I Homo sap 
Homo sapiens cDNA: FU22278 fis, done H 
Homo sapiens cDNA: FU22502 fis, done H 
Homo sapiens cDNA: FU22135 fis, clone H 
hypothetical protein FU12673 
ESTs 

Homo sapiens cDNA: FU22141 fis, clone H 
ESTs 
ESTs 

ESTs, Weakly similar to T14742 hypolheti 
ESTs 

gb:DKFZp761P1910_r1 761 (synonym: hamy2) 
ESTs 
ESTs 

regulator of differentiation (in S. pomb 
ESTs, Weakly simBar to unnamed protein 
ESTs 
ESTs 

MFH-amptified sequences with leurine-ric 
KIAA1349 protein 
KIAA1491 protein 
hypothetical protein FU11215 
ESTs 

ESTs, Weakly similar to unnamed protein 
Homo sapiens cDNA FU1 1569 fis, clone HE 
gb:ng99c08.s1 NC1_CGAP_Thy1 Homo sapiens 
ESTs, Weakly similar to ALU1_HUMAN ALU S 
ESTs 

ESTs, Moderately similar to TTL MOUSE TU 
ESTs 



PCT/US02/12476 



0.39 


2.00 


3.28 


1.00 


3.38 


5.68 


0.06 


1.10 


10.18 


21.27 


1.46 


1.90 


3.08 


5.64 


2.31 


4.49 


5.38 


11.64 


2.38 


5.56 


1.06 


1.41 


0.73 


1.24 


5.25 


11.95 


0.45 


1.35 


1.71 


1.83 


1.24 


3.21 


12.68 


15.05 


4.42 


9.61 


2.98 


5.93 


1.98 


3.30 


1.62 


1.00 


1.00 


1.07 


1^43 


168 


0.34 


1.18 


3.01 


3.71 


1.90 


8.81 


0.29 


1.01 


2.27 


1.92 


0.43 


0.80 


3.19 


3.85 


2.70 


3.20 


2.04 


3.31 


21 


3.08 


2.69 


3.40 


1.20 


1.09 


1.25 


1.55 


0.27 


0.72 


3.70 


5.80 


3.33 


5.10 


1.00 


1.00 


2.01 


1.54 


4.11 


5.53 


206 


370 


3.42 


8.13 


5.97 


12.51 


3.17 


4.52 


0.36 


1.00 


2.31 


3.33 


4.38 


5.41 


5.80 


10.20 


3.10 


5.14 


5.34 


10.07 


1.00 


5.03 


0.86 


1.00 


0.45 


0.91 


2.82 


5.12 


2.40 


2.52 


1.32 


4.30 


1.40 


1.77 


4.24 


6.21 


6.96 


10.69 


0.81 


0.53 


2.42 


4.05 


3.62 


5.38 


0.14 


0.70 


3.71 


4.34 


0.95 


0.91 


4.06 


5.91 


5.88 


8.25 


0.87 


1.25 


1.28 


1.00 


2.54 


3.46 


5.85 


8.36 


0.76 


1.64 


2.88 


5.69 


1.81 


1.99 


1.00 


1.00 


2.73 


2.17 


0.71 


1.90 


1.00 


1.00 


2.18 


3.50 


5.96 


11.36 


192 


4.22 


5.48 


11.74 


0.39 


0.73 



137 



WO 02/086443 



PCT/US02/12476 



324699 


AW504732 


Hs.21275 


hypothetical protein FU11011 


0.93 


0.93 


324747 


AA603532 


Hs.130807 


ESTs 


1.57 


1.81 


324748 


AA657457 


Hs.292385 


ESTs 


1.55 


1.34 


324801 


A1819924 


Hs.14553 


sterol O-acyl transferase (acyl-Coenzyme 


1.00 


6.56 


324804 


AI692552 




gbwd73f12jc1 NCLCGAP_Lu24 Homo sapiens 


1.00 


7.53 


324828 


AA843926 


Hs.124434 


ESTs 


2.00 


a25 


324855 


AW152305 


Hs.122364 


ESTs 


174 


3.43 


324866 


A1541214 


Hs.46320 


Small profine-rich protein SPRK [human, 


1.07 


0.95 


324871 


AW297755 


Hs.271923 


Homo sapiens cDNA: FU22785 fis, clone K 


1.68 


1.21 


324886 


AA806794 


Hs.131511 


ESTs 


256 


5.61 


324889 


D31010 




gb:KUML1 2147 Human fetal lung Homo sapie 


2.20 


4.65 


324948 


AW383618 


Hs.265459 


ESTs, Moderately similar to ALU2_HUMAN A 


5.28 


7.05 


324953 


A1264628 


Ks.125428 


ESTs 


3.37 


5.51 


324958 


AA625076 


Hs.132892 


protocadherin 20 


5.12 


9.81 


324988 


T06997 


Hs.121028 


hypothetical protein FU 10549 
tamlnin, alpha 4 


2.52 


1.08 


325024 


F13254 


Hs.78672 


5.24 


10.22 


325105 


H97109 


Hs.105421 


ESTs 


1.00 


1.00 


325108 


AA401863 


Hs.22380 


ESTs 


1.99 


214 


325114 


D83901 


Hs.315562 


ESTs 


2.73 


3.17 


325146 


AJ054690 


Hs.171176 


ESTs 


1.86 


3.41 


325149 


D61117 


Hs. 187 646 


ESTs 


0.42 


0.93 


325187 


AI653682 


Hs.197812 


ESTs 


6.50 


11.31 


325228 








6.18 


15.76 


325235 








2.64 


4.12 


325328 








287 


4.42 


325340 








0.29 


0.33 


325367 








16.56 


24.29 


325373 








0.63 


1.22 


325389 








0.88 


1.05 


325436 








5.75 


14.14 


325471 








8.46 


17.82 


325498 








3.32 


6.42 


325557 
325559 








5.51 
7.48 


8.28 
21.40 


325560 








4.08 


6.25 


325569 








4.20 


5.24 


325585 








1.10 


1.13 


325587 








1.00 


1.00 


325597 








2.98 


13.40 


325639 








0.78 


0.78 


325685 








0.46 


0.66 


325686 








0.95 


1.55 


325735 








4.48 


9.20 


325739 








0.59 


0.88 


325740 








242 


6.61 


325792 








7.88 


9.83 


325819 








4.74 


7.18 


325883 








202 


264 


325895 








7.78 


15.98 


325925 








2.04 


10.60 


325932 








4.18 


7.36 


325941 








3.66 


9.03 


325969 








0.61 


0.80 


325971 








4.88 


7.42 


326025 








0.55 


1.07 


326046 








7.21 


14.72 


326099 








3.60 


5.98 


326108 








1.27 


1.06 


326163 








a27 


5.70 


326165 








0.45 


1.11 


326189 








0.13 


0.45 


326204 








5.60 


9.00 


326230 








7.00 


12.01 


326274 








1.00 


8.09 


326360 








9.86 


15.35 


326393 








0.52 


0.77 


326505 








1.00 


1.42 


326515 








1.24 


5.84 


326589 








9.20 


13.49 


326592 








277 


4.01 


326605 
326692 
326693 








201 
1.00 
1.00 


253 
1.00 
1.31 


326720 








0.19 


0.65 


326742 








234 


7.20 


326770 








0.25 


0.83 


326818 








3.09 


4.56 


326936 








208 


3.45 


326964 








0.41 


1.70 


326983 








202 


3.80 


326991 








1.09 


1.20 


327036 








1.00 


8.04 


327040 








3.05 


4.22 


327053 








3.55 


6.31 


327075 








1.59 


1.40 



138 



WO 02/086443 



PCT/US02/12476 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



80 



85 



327085 


2.50 


12.57 


327130 


5.38 


8.04 


327156 


3.74 


6.58 


327220 


1.28 


1.54 


327224 


6.56 


12.91 


327288 


2.61 


5.40 


327321 


2.42 


3.11 


327332 


6.62 


10.58 


327361 


2.69 


4.41 


327377 


Z04 


6.72 


327396 


Z61 


4.50 


327414 


1.00 


8.01 


327442 


5.91 


9.65 


327467 


6.58 


18.01 


327473 


3.79 


7.48 


327483 


4.08 


&87 


327562 


0.68 


186 


327568 


1.00 


ZOO 


327606 


2.06 


3.61 


327611 


5.90 


14.26 


327642 


4.06 


a74 


327654 


1.05 


Z08 


327734 


1.00 


1.00 


327775 


1.46 


11.79 


327796 


3.47 


5.65 


327840 


3.26 


6.64 


327940 


5.84 


15.58 


327984 


0.36 


1.50 


328004 


1.87 


1.42 


328021 


0.42 


0.59 


328068 


Z83 


4.68 


328100 


3.04 


5.39 


328101 


3.54 


5.20 


328113 


0.72 


0.91 


328157 


5.58 


5.16 


328196 


.'5.76 


11.13 


328197 


5.98 


10.58 


328264 


3.11 


4.88 


328299 


2.20 


3.06 


328342 


1.49 


1.94 


328365 


1.00 


1.00 


328369 


4.40 


7.36 


328381 


1.86 


4.93 


328451 


5.51 


7.56 


328481 


0.13 


0.72 


328500 


2.71 


3.97 


328530 


5.41 


7.62 


328600 


3.14 


10.68 


328608 


4.56 


8.17 


328616 


224 


11.91 


328623 


3.04 


5.46 


328632 


0.70 


1.19 


328664 


3.48 


6.60 


328666 
328698 


10.42 
9.68 


26.47 
14.56 


328700 


2.74 


10.22 


328708 


0.15 


0.57 


328735 


6.23 


8.91 


328743 


3.62 


6.54 


328806 


0.22 


0.78 


328861 


3.68 


10.54 


328908 


5.42 


16.36 


328933 


2.02 


5.29 


328934 


1.73 


4.45 


328949 


3.34 


5.41 


329005 


2.88 


7.26 


329011 


2.52 


3.72 


329033 


1.00 


1.03 


329037 


5.07 


6.16 


329067 


1.98 


Z41 


329134 


2.24 


3.25 


329157 


Z30 


11.04 


329178 


264 


5.02 


329192 


6.41 


15.27 


329194 


0,31 


0.79 


329204 


1.60 


3.75 


329224 


2.99 


6.11 


329228 


0.83 


0.83 


329288 


0.63 


1.01 


329337 


1.00 


1.00 


329541 


0.76 


1.68 


329560 


1.34 


Z02 


329588 


1.68 


2.22 


329643 


4.18 


11.77 


329703 


1.00 


1.00 



139 



WO 02/086443 



PCT/US02/12476 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



00Q7CX 

o7y/o4 








D.iO 


15 50 

IO.OU 








* 


2.09 


5.44 


OOORCn 

jzyobu 








3 11 

O. IO 


10.77 


329993 








7 A3 


14.21 


330020 








o.oo 


13.12 


ooaaoc 








1 39 


5 57 
o.o/ 


OOAACO 








4 31 


7Q7 


oonnoe 
oouuoo 








1.34 


1.76 


OOUUOO 








4 70 


12.46 


ooaaoo 








U.*rt 


1 06 


oo ni (\n 
oouiUU 








347 


4R3 


OOfHAC 
OOUlUO 








Z14 


3.61 


00MA.7 








3 17 

O. 1 / 


U.OJ 


OOAIon 








5R1 

O.OI 


11 HQ 


ooaioo 








4 5fl 


1974 


oononfl 
oo u/Uo 








1.55 


7.62 


OOAORO 








13.10 


23.38 


ooaoaa 








9 R1 

£.01 


4 QA 


O0A01O 








3 nn 


4 41 


OOnORR 
OOUOOO 








0.67 


0.76 


00A070 

OoUo/Z 








4 7fi 


11.82 


MfVIOC 

OOUOuD 


LAAAQ7AQ 


no. 1 0 L 3 1 1 


karvnnhprin slnhn S Hmnnrfin rHnha Rl 


2.14 


2.15 


00A0O7 


ni/icco 
U14ooy 


Ue 1RZ0R7 
nS.lD400» 


IMnHu 1 UO ycl 10 J« U0U6I 


n aa 


1 15 

1. 10 


00 fMCR 
OOIWOO 


L» U040 


Uc 1190/4 


nrAtoacet inhihiiAr 0. eHruHprit/P/i fQkAl 

(JlUlcdbO MlJIIUHUi O, olVilrUCIIVCU ^Or\AL 


1.11 


0.94 


00A/170 
00U4/4 


1 oaoao 

L444U0 


Ue 89907 
n5.0££j r 




1.67 


1.17 


00ft47R 

ooU4/o 


1 0R4RR 
LOO** DO 


Uc 9QRDiQ 




0.46 


1.07 


OOlWyO 


M97R9R 
M47 040 


Uc 9R70.1 Q 


pnHnnpnnt re mtrnuiral nmfpnea 


1.07 


0.95 


oouqyo 


M01 OOQ 


Uc 71R49 


mianino nttrtonftrfp htndinn nrAlpin (CX nr 
yudjiinD iiuuicuuuc uuHJuiy piuiciii pi 


0.97 


0.96 


omens 

OOUOUO 


MR1Qn.fi 
MO J bUO 


Uc fi941 




0.17 


3.66 


O0AG1O 

0oU0l4 


MR0.RR0 


Uc fl1 9RR 
nS.Ol ZOO 




0.60 


1.06 


00AC07 
OOUoO/ 


u »y/oo 


Ue 01 in 


7inA finnor nmteln Q /a rplllllar retrAU 


2.81 


2.07 


330547 


uovyoy 


Ue 1R0R71 
nS.lOOO/1 


tnminnK'iri 0 0_rlin wnan oco 

uypiopnoii t,o-uiOAyyoiiaoo 


3.91 


1.49 


•a once 4 
OOUOOl 


uoyo4u 


Ue 9QQRR7 

ns.zyyoo/ 


ncpdiUCyio llUUcdJ laHUl O, oipild 


1 15 

1. 19 


1 03 

I.VIO 


OOACCQ 

ooUOOO 


U004*Jh 






2.83 


4.79 




IIQAA07 




nK'Unman PP1 hnmA?AA mRKIA 0*l tTR rpnlnn 


2.08 


1.54 


oonpm 

OOUOUl 


1 10001 fi 


Uc R9R4*; 


Hnmn eanlpnc rDNA* FLI91930 ft*; rlnnft H 


0.89 


1.35 


ooacac 
OOUbuo 


YAO410 

Auz4iy 


Ue 7797A 
MS./ f Z/4 


piasniuioycn acuvoux, uiuaiiiooc 


1.87 


1.55 


ooQbUy 


AU4/41 


U. 7C4 4 Q 

hs./oiio 


liKtmtiKn /^or^fwtrl larminol aclQr'ipn 1 1 

UDtquiun caruoxyi-KrTninai esierase li 


1 A3 
1.00 


1.30 


OOflRI 7 
00UD1 / 


VC0CR7 
ADO 00/ 


Uc RCORR 


lAipnrin hpta A 


1.54 


1.15 


oonfion 

00U00U 


Y7RRRQ 
A/ 0003 


Uc 7QHRR 
no. / 3U00 


rottrnlnralhiA 9 FP-hanri ralHiim hinrfin 


1.39 


1.19 


00RRA4 


V077CE 
TU/ / OO 


Uc 0QDQ1 


Qlflfi ralrinm-hiAHinA Airstpin A9 


3.83 


1.13 


OOftRRfii 
00 UOOU 


7RRO0R 
4.00 4 ZO 


Uc 90 An. 


JUIlCUun pioAOylUUlll 


1.25 


0.95 


OOACCA 

00U00U 


A A0/17QCQ 
/\A04/000 


Ue 10Q9Q0 

ns.ioyzyo 


PQTe Woatlu cimibr 1a At 1 17 Hi IMAM Al II ^ 

to l s, weaKiy suniiar id /\lu/_ nuiviniM mlu o 


15 50 


29.07 


000.RQ9 

oouoyi 


A AA17Aj1C 


He R7A9 


FRTe 

CO 1 5 


1.00 


1.00 


oon7n7 


A A100RQ1 

MAioooyi 


Ue 9Q3.RQH 
n5.Z3003U 


PRTc 
CO IS 


0.20 


1.35 


00A71 R 
OOU/lO 


A A9007A7 


Ue 1 1571 

nS.lio/1 


Unmr\ rininnp r>nMA PI H1R7flfte rlnnp HP 

nuino sapiens cuiv\ ru i io» u iis, ciuiic nc 


0.12 


1.40 


0"*n717 
OOU/1 / 


A AOOOQOR 

MA^ooyzD 


Uc K9fi9fi 


llllcgim, Uold O 


6.62 


5.42 


OJU/44 


A AO/ OKRA 


Ue OAOQO 


CO IS 


1.40 


1.65 


ooA7^n 


A A0Q77>IR 

AAZb//40 


Ue 90RRA 


nomo sspiens voiioyc-Qaica suuiuin a wiuic 


0.27 


2.04 


0007A9 
00U/44 


A A/AAQ7Q 


Uc 9RRQ1 
ns.zoo3 1 


rppnntnr /^9lnifnnin\ arfiuih/ rrwlifviriA 

ruteptor ycaii>iioniii/ duuviiy nwuiiyiny 


0.44 


0.90 


00A7jI j1 

00U744 


AA4D0 1 4/ 


Ue 100O0 

nS.l ^obO 


o i Ur-u-yiucose 4,0-uenyQraiase 


0.71 


3.23 


OOA7C1 

OOU/01 


A AXOOOOC 


U«» 00OI0 

iis./bo40 


Unmn rnnlnne nf^MA CI 11 01 AO fip nlftna KIT 

nomo sapiens cuna ru i o iuo ns, cione in i 


i.DD 


1 59 


O0A7CA 

OJU/Ou 


A AvIXQCCO 

AA440D0o 


Ue OA^ICO 

ns.0U40y 


CCTe 

co IS 


0 59 
u.04 


0.90 


00A7C0 

330763 


A A^CAOAA 

AA450Z00 


Uf 07VI007 

nS.Z/4oo/ 


nypouiBucaJ proisin ru/uooo 


n 37 


0Q7 


jju/oo 


uouo » *♦ 


Ue 4Q10R 


FRTc MrvlprAtpIv similar to ALU7 HUMANA 


0.78 


0.84 


00A7QA. 


1 40000 


Ue 1rtRRfY7 
nS.lUOoU/ 


PQTe 


0.23 


3.17 


00 Uo 14 


A AA1C70fl 


Ue 9RR0QR 
MSwcOOOyo 


PQTe Wa*>lfhj cimilar tn trancfnrmaltnn-r 
CO 1 5, VfCdJUy SlMUidr ID UdlKjlUIiTldUUIrf 


0.37 


2.07 


00AQ07 

OOU047 


A AA/4A00O 
AAU4U00Z 


Ue 197A4 

ns.lt/44 


PQTe 

cots 


1.60 


1.00 


OOU044 


AAfl.R0.flO7 
MrtUOOUO/ 


Uc RRRfiO 

ns.ooouo 


PQTe 
CO IS 


0.93 


1.16 


ooaoai 


A A 1 C70 1 a 


Ue OR701O 

nS.lo/ jiy 


enuogenous reiroviraj proiease 


1.02 


1.03 


oonooi 


CA1/>I0 
rUl44o 


Ue 9RA9RR 
nS.Zo4ZOO 


nypoineucaj proiem rw ihuoo similar io 


0.24 


0.88 


OOAQCO 

OOUyW 


unoocc 
HUiiooo 


Hs.Zyoo/ 


coTs 


0 OR 
u.uo 


1 31 
1.0 1 


OOfiQfil 


uiaqqo 
n luyyo 


Ue 71 RA 
riS./ IO*t 


d uisiniegnn anu iticiatiupiuicutx»o uuiiid 


1.29 


1.26 


oouyoo 


nloobo 


Ur> 007/ Q 
nS.Zo/40 


PQTe 

co is 


0.48 


0.96 


331014 

OO IUI*> 


HQfl«;q7 
nsooa/ 


He o,no4n 


hvAAthpiiral nrntpin KIAA1 1 


0.29 


0.74 


001IVR 
OOIU40 


[NObObo 


Ue 101 OCR 

nS.iyioDo 


PQTe 

cols 


0 QQ 


R 5R 


001ACA 
001 UoU 


M7CAQ1 
iM/OUOl 


Ue 1C71/1Q 

nS.lon4o 


Unmn ronlone i^DMA PJ M 1 RRO fie dnna HP 

nomo sapiens cuima rui 1000 ns, clone nc 


1 94 


1 00 


001AQQ 

jjiuyy 


R0RR71 
KOOOi 1 


Ue R0Q07 

nS.oObO/ 


hypothetical protein 


0 75 

U.i 0 


1.03 


JJllt/O 




Ue 910R0 


tzotS 


1 00 


9 75 


0.01 111 
001101 


DC 47 A 7 




nk-wn Q7K A7 e 4 Cnorac infant hrain 1 MIR U 

9o.yyu/0u/.si ooares in rani Drain iwion 


6.04 


10.68 


0011 OK 


f\0 lOso 


Uc A1Q7 
nS.*i 13/ 


PQTe 
CO IS 


0.80 


0.96 


00.1 17fl 
Ool 1 fU 


TO0j(R1 


Uc 1 RQ9Q0 


PQTe 
CO IS 


9 63 


4.29 


00 I I OKI 


TO.0AAR 
IO£<HtO 


Ue RRAT) 


Unman nKIA eomtpnro (mm PAP 75W1*^ An rhr 


1.78 


Z71 


001 1R3 
0O110O 


T4H7R0 


He RARQ 


PQTe 
CO is 


1.00 


3.01 


331903 
0014UO 


TB90.U) 




fWnwc\ 


1.70 


3*80 


001071 

Willi 


A AflCOO/17 
AAU0y04/ 


He R999R 


grycoproiein (transmernurane/ nrno 


1 90 


3 1Q 

0. 1? 


00100.fi 
0010U0 




He R0Q01 
nb.OOyo l 


udcnsnuna ^urosopniia/ nuiuuiuy 


0.31 


1.30 


001007 
00104/ 


AA0R1A7R 
MAtO IU/O 


Ue 10Q991 

ns.iuyzzi 


FQTe 

CO is 


2.09 


2.41 


001341 
00 I0*> 1 




He 90.94 n 


HAinn caniane rDWA PI H'fclQRfic Haap PL 
nuiiiu sapiens i*L/iir\ ru iohsw u<3, uiuiio v u 


0.72 


2.43 


0O10CQ 

ooiooy 




He ^RQfil 


t^lAA1y4RO nmlain 

iwv\ 1*104 proicui 


0.09 


0.91 


0010C0 
00 10t>0 


AAAOI^RO 
MMHZIOOZ 


Uc 01D11 

ns.yiuii 


ani£norflraoieni c ^Aenepus laevisj nom 


1 09 

l.vC 


0.87 


331378 


A A/MQQQ1 

AA44oool 


nS.4y2oz 


nypotneucal protein FU1 luoo 


1 no 

1.U0 


1 93 

1.40 


331384 


AA456001 


Hs.93847 " 


NADPH oxidase 4 


1.40 


1.00 


331402 


AA505135 


Hs.44037 


ESTs 


1.80 


3.93 


331422 


F10802 


Hs.163628 


ESTs, Moderately similar to ALU7_HUMAN 


1.65 


1.89 



140 



WO 02/086443 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



331490 


N32912 


Hs.26813 


CQA14 


2.48 


1.73 


331531 


N51343 




nh'vz15a04.s1 So ares mulODle sclerosis 




I.OD 


331547 


N54811 




gb:od74f04.s1 NCl_CGAP_0v2 Homo sapiens 


G.OU 


5.75 


331578 


N67960 


Hs.249989 


ESTs 


0.11 


0.67 


331589 


N71027 


Hs.1 52618 


ESTs 


1.(£ 


13 


331608 


N89B61 


Hs.1 12110 


PTD007 protein 


0.93 


0.76 


331614 


N92293 


Hs.240272 


EST 


0.17 


1.34 


331668 


W69707 


Hs.58030 


EST 


2.24 


3.82 


331671 


W72033 


Hs.1 94695 


ras homolog gene family, member I 


1.00 


1.24 


331676 


W79834 


Hs.58559 


ESTs, Weakly similar to rhotekin [Mmusc 


0.Q8 


1.07 


331681 


W85712 


Hs.119571 


collagen, type 111, alpha 1 (Ehlers-Danl 


8.72 


4.27 


331692 


W93592 


Hs.1 52213 


wingless-type MMTV integration site fami 


0.94 


0.54 


331717 


AA190888 


Hs.1 53881 


Homo sapiens NY-REN-62 antigen mRNA, par 


1.57 


1.34 


331718 


AA191404 


Hs.1 04072 


ESTs 


6.80 


11.77 


331811 


AA404500 


Hs.301570 


ESTs 


1.10 


1.00 


331820 


AA405970 


Hs.97995 


transcription termination factor, mi toe 


0.73 


0.59 


331831 


AA412031 


Hs.97901 


EST 


2.77 


4.08 


331852 


M418988 


Hs.98314 


Homo sapiens mRNA; cDNA DKFZp586L0120 (f 


0.23 


0.93 


331943 


AA453418 


Hs.21275 


hypothetical protein FU11011 


0.36 


1.88 


331989 


AA460702 


Hs.82772 


collagen, type XI, alpha 1 


1.00 


1.00 


331990 


AA478102 


Hs.1 39631 


ESTs 


3.04 


3.87 


332002 


AA482Q09 


Hs 105104 


ESTs 


1.19 


0.78 


332027 


AA489871 


Hs.65641 


hvnolheticd nralein FU20073 


1.27 


1.03 


332029 


AA489697 


Hs 145053 


ESTs 


0.30 


1.62 


332033 


AA489840 


Hs.251014 


EST 


2.30 


3.70 


332048 


AA496019 


Hs.201591 


ESTs 


0.17 


0.52 


332071 


AA598594 


Hs.205293 


K1AA1211 protein 


1.35 


1.23 


332074 


AA599012 




gb:ae41 e1 1 .s 1 Gessler Wilms tumor Homo s 


0.19 


2.00 


332083 


AA60O20O 


Hs.1 55546 


K1AA1080 oroiein* Gobi-associated oamm 


0.31 


1.18 


332085 

wV£Vw w 


AA600353 


Hs 173933 


nuclear factor I7A 


0.30 


1.50 


332125 


AA609861 


Hs 31 2447 

1 iw*w i dm i 


ESTs 


0.22 


0.62 


332177 


F10812 


Hs.101433 


ESTs 


8.21 


18.03 


332180 


H0334B 


Hs.7327 


elautfin 1 


2.27 


1.57 


332185 


H10356 


Hs.1 01 689 


ESTs 


0.09 


1.18 


332203 


H49388 


Hs.317769 


EST 


8.05 


5.02 


332232 


N48891 


Hs.101915 


Stargardt disease 3 (autosomal dominant) 


0.78 


0.85 


332240 


N54803 


Hs.324267 


ESTs, Weakly similar to putative p150 [ 


0.96 


1.23 


332261 


N70294 


Hs 269137 


ESTs 


2.40 


3.74 


332275 


R08838 

IwUVVV 


Hs.26530 


serum deprivation response (phosphatidyl 


0^27 


0.75 


332280 


R38100 

Iwv 1 vJv 


Hs 146381 


RNA binding motif protein, X chromosome 


0.39 


1.88 


332299 


R69250 


Hs.21201 


nectin 3; DKFZP56680846 protein 


5.24 


12.76 


332304 


R74041 


Hs 101539 

1 IwV 1 V IvVj 


ESTs 


1.44 


3.18 


332314 


T25862 


Hs.101774 


hypothetical protein FU 23045 


0.68 


1.32 


332384 


M11433 


Hs 101850 


reUnoi-binding protein 1, cellular 


1.71 


0.88 


332434 


N75542 


Hs 289068 


Homo saniens cDNA FU1 1918 fis ctaneHE 


0.43 


0.86 


332445 


T63781 


Hs.11112 


ESTs 


0.68 


1.00 


332453 


L00205 


Hs.1 11758 


keratin 6A 


31.54 


1.00 


332458 


M33493 


Hs.250700 


tryptase beta 1 


0.51 


1.00 


332504 


AA053917 


Hs.1 51 06 


chromosome 14 open reading frame 1 


0.79 


1.24 


332525 


M17252 


Hs.278430 


cytochrome P450, subfamily XXIA (steroid 


0.98 


1.70 


332530 


M31682 


Hs.1 735 


inhibin, beta B (aebvin AB beta poiypep 


0.88 


0.66 


332535 


N20284 


Hs. 1 9280 


cysteine-ricb motor neuron 1 


0.22 


1.46 


332539 


AA412528 


Hs 20183 


ESTs Wealdv similar to AF1 64793 1 orote 

LU 1 9f 1 »WGJ\IJ QUIUICU Ms/ ill lUllkfw 1 |>rwLw 


0.93 


1.49 


332559 


M13955 


Hs 166189 

1 lO* 1 WW 1 UJ 


cvrnkprjuin / 

W J IU Awl QUI 1 ^ 


0.35 


1.13 


332563 


N92924 


Hs.274407 


protease, serine, 16 (thymus) 


1.00 


1.00 


332565 


AA234896 


Hs.25272 


F_1A bindinn nrotein o300 


0.36 


1.05 


332594 


AA279313 

/Vu, f Jw I w 


Hs 3239 


methvl CoG hindina oratein 2 f Rett svndr 


0.53 


0.59 


332634 


S38953 


Hs.283750 


tenascin XA 


0.38 


1.16 


332638 


AA283034 


Hs.50640 


JAK binding protein 


1.00 


1.70 


332640 


AA417152 


Hs.5101 


protein regulator of cytokinesis 1 


6.15 


1.16 


332654 


AA001296 


Hs.288217 


hypothetical protein MGC2941 


1.50 


2,73 


332665 


AA223335 


Hs.63788 


propionyl Coenzyme A carboxylase, beta p 


1.20 


0.91 


332692 


AA496035 


Hs 247926 


gap junction protein, alpha 5, 40kD (con 


0.17 


1.12 


332716 


L00058 


Hs.79070 


v-myc avian myelocytomatosis viral oncog 


1.00 


1.44 


332736 


L13773 


Hs.1 14765 


myeloid/lymphoid or mixed-lineage leukem 


1.00 


1.81 


332758 


X93921 


Hs 296938 


Hiifll *;npoifirih/ nhftsnhfltase 7 


0.53 


0.78 


332781 


AA23^S8 


Hs 24711? 


hvnolhetieal ornlpin FU 10902 


1.44 


1.56 


332792 






1.70 


1.19 


332816 








1.85 


2.47 


332858 








1.04 


1.57 


332906 








3.48 


8.04 


332911 








1.00 


1.00 


332912 








1.06 


4.40 


332922 








1.00 


1.00 


332956 








0.42 


0.88 


332959 

wv^www 








1.96 


6.34 


332982 








0.56 


0.99 


332984 








0*30 


0.78 


332998 








1.47 


2.01 


333058 








0.47 


1.38 


333097 








2.14 


3.19 


333121 








2.76 


3.70 


333122 








1.92 


1.21 


333123 








1.85 


1.39 


333138 








0.47 


0.52 



141 



WO 02/086443 



PCT/US02/12476 



333139 1.88 0.84 

333140 0.21 0.64 
333221 1.51 1.11 
333260 0.75 1.01 
333380 6.68 15.75 
333387 4.56 12.61 
333512 5.05 8.01 
333524 Z28 3.98 
333585 2.31 1.53 
333803 223 1.17 
333604 2.51 1.58 
333618 0.52 0.98 

333627 1.44 1.36 

333628 1.90 1.90 
333650 1.85 2.10 
333578 1-85 Z35 
333750 2.18 5.67 
333763 1.99 2.60 

333767 1.02 0.96 

333768 1.78 1.65 

333769 2.15 2.13 
333772 1.46 2.53 
333777 1-00 1.42 
333846 2.99 4.50 
333884 0.47 0.94 
333887 0.50 1.00 

333891 0.43 0.89 

333892 0.51 0.91 
333904 0.26 1.13 
333906 0.55 0.98 
333948 1-70 2.15 
333954 0.37 1.09 
333966 8.10 14.30 
333968 0.63 1.38 
334081 4.24 12.30 
334094 1.30 12.03 
334113 4.55 8.63 
334161 0.82 1.59 
334183 0.47 0.76 
334187 1.36 3.70 
334219 0.69 1.04 

334222 1-88 1.70 

334223 4.72 3.14 
334239 0 79 0.62 
334255 0.45 1.10 
334333 1.00 3.56 
334378 3.98 5.76 
334382 1.50 1.31 
334492 3.59 4.75 
334562 5.94 15.40 
334588 8.14 19.53 
334616 1.55 1.58 
334633 5.16 8.07 
334648 0.59 Z13 
334787 3.70 7.15 
334866 8.13 10.60 
334891 0.32 1.14 

334933 1.00 3.84 

334934 4.01 7.43 
334945 1.04 2.96 
334967 0.29 1.14 
334990 1.50 1.39 
335015 5.88 18.65 
335093 0.55 1.75 
335120 4.31 8.01 
335125 0.38 1.97 
335179 1.24 1.98 
335188 0.46 1.47 
335211 1.61 1.42 

335288 0.73 0.97 

335289 0.20 0.26 
335361 2.18 1.58 
335379 0.50 0.71 
335414 3.64 14.94 
335416 -2.93 3.98 

335496 0.96 0.91 

335497 1.71 1.92 
335548 1.15 Z40 
335551 . 3.22 10.54 
335558 3.42 4.89 
3355B6 5.50 1Z75 

335619 2.99 3.07 

335620 3.80 8.29 

335621 0.28 0.57 
335682 0.46 1.17 



142 



WO 02/086443 



PCT/US02/12476 





003000 


2.55 


3.81 




WT7RR 
000/ W 


2?4 


1 07 




009/0*1 


0.20 


0.97 




JJD0 !*♦ 


1.13 


1.48 




A3 0 0 1 0 


2.45 


3.51 




0030&9 


1.00 


4.16 




003003 


0 4Q 


1 70 




003001 


1.66 


1.39 


10 


000000 


9 OR 


fi43 

0.*rO 


003090 


0.98 


0.99 




003900 


12.10 


21.93 




000940 


l.vw 


1 64 




OOOSKm 


1.00 


4.21 


15 


OOCQQC 


0.37 


1.17 


■336091 

OJUUll « 


1.04 


0.84 




OO0UO*} 


11.40 


23.54 




OODUOO 


1.19 


1.21 




OODUOO 


0.54 


1.63 


20 


oooiur 


0.95 


0.70 


OOQcUO 


3.13 


6.29 




0001/3 


3.20 


10.10 




QQCOG1 
000191 


2.34 


3.09 




000001 


1.00 


1.00 




000419 


0.65 


0.79 


19CCV) 
000002 


2.33 


2.16 




000000 


2.55 


2.23 




000004 


2.19 


2.03 




000003 


2.69 


Z48 


10 


000000 


2.13 


1.83 


00000/ 


2.43 


2.24 


09CRQD 
000000 


2.31 


2.03 




000009 


0.60 


1.31 




000073 


OH 


1.18 




000004 


1.50 


114 


000094 


4.74 


7.10 


ooD/lb 


4.43 


6.37 




336721 


2.20 


0.74 




000/90 


1.64 


114 


40 


000b UU 


6\14 


12.73 


000940 


1.00 


1.00 




oofUZo 


1,30 


2.09 




007040 


4.01 


11.53 




00/U40 


1.67 


1.84 


4S 


00/U34 


2.78 


7.35 


00/1 10 


7.20 


16.14 




OOf 101 


3.45 


5.34 




OOVlOO 


5.72 


11.41 




OO7 AO A 

00/104 


3.72 


5.90 


SO 


•W71Q9 
00/ 19* 


1.27 


1.06 


00/ 


1.88 


1.68 




117900 
00/129 


0.22 


1.03 




017OCQ 

0O72bo 


1.00 


3.31 




1*57000 

00/ *yy 


3.23 


5.14 


<5S 


OoVozO 


2.76 


3.72 


00/009 


5.80 


10.42 




00/490 


2.06 


6.30 




007497 


r .00 


90 29 




oo/ouu 


3.80 


4.48 


60 
ou 


337549 


1.66 


2.31 


oo/ouo 


1.27 


8.54 




00 /DUO 


5.76 


7.16 




337671 


U./O 


0Q7 




1777CR 
Oof f 00 


1.54 


0.92 


6S 


00/ /OO 


5.07 


973 


00 /OUs 


6.18 


12.87 




117RR9 
00/ 00 £ 


3.78 


12.97 




337871 


£.00 


R 1fi 

0. ID 




117QCQ 
00/900 


0.26 


1.34 


70 


ooouuo 


1.48 


1.12 


OODUOO 


2.38 


14.59 




ijiRftRi 

ooouoo 


0.65 


2.16 




338110 


1 nn 


1 M 

1.0 1 




TiR119 
00011 c 


5.86 


8.25 


7S 


'VIM AC 
OOO 143 


1.70 


1.97 


000140 


8.07 


18.19 




11Q4 CO 

000130 


1.30 


4.55 




OOO 101 


2.58 


3.57 




0001/9 


1.00 


1.00 


80 


000 tac 


3.32 


4.63 


11R1QQ 
000109 


1.00 


3.34 




00019/ 


0.99 


169 




HR1QQ 

0001 


4.58 


7.62 




338215 


6.01 


15.85 


85 


338279 


0.53 


0.95 


338316 


20.58 


38.66 
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338322 


3.23 


7.39 


338357 


4.10 


11.39 


338359 


10.12 


21.59 


338366 


0.69 


1.02 


338374 


0.40 


1.18 


338414 


0.47 


1.06 


338418 


6.12 


13.86 


338469 


3.09 


5.11 


338501 


6.28 


10.32 


338506 


6.97 


12.41 


338523 


3.10 


5.64 


338549 


1.70 


2.70 


338561 


0.79 


0.81 


338662 


1*72 


1.46 


338571 


0.17 


0.91 


338676 


2.10 " 


15.66 


338726 


1.20 


1.09 


338779 


0.12 


0.57 


338804 


0.99 


1.67 


338836 


* 1.00 


1.00 


338871 


4.30 


9.81 


TOAB7?' 

OOQOt £ 


5.02 


12.81 


338879 


0.23 


1.12 


338937 


6.55 


12.26 


338966 


1.76 


5.42 


338993 


1.00 


2.40 


339047 


5.26 


10.81 


339100 


5.10 


6.88 


339114 


1.00 


1.70 


339121 


1.00 


3.75 


339170 


10.35 


19.67 


339229 


4.08 


13.48 


339264 


2.64 


3.83 


339293 


1.73 


1.94 



TABLE 8B shows the accession numbers for those Pkeys in Table 8A lacking unigenelffs. For each probeset we have listed the gene duster number from which the 
oligonucleotides were designed. Gene dusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each duster are listed in the 
"Accession' coiumn. 

Pkey: Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey 


CAT number Accessions 


322044 


187363 1 


AW340926 AA249063 N86075 


322060 


44320J 


AI341937 AW003063 U34725 AA904742 


321430 


42705J 


X57414X57415 


321467 


43034 1 


X13075X13076 


322125 


46779 1 


R93901 AF075073 R93902 


322166 


46861J 


H69434AF085958 H69846 


322173 


46873J 


H52567 H52557 AF085970 H52164 


322178 


46882J 


H56535AF085980 H56712 


322179 


46885 1 


H92891 AF085982H92777 


321577 


1615102J 


H84849 H84252 H84260 H86664 H85320 


321587 


1615333 1 


H95531 H95521 H84529 


313723 


111953J 


AA07041 2 AA1 02346 AA081 885 


320997 


627492.1 


H22544 H46842A1204929 


322278 


47271 1 


W69304AF086283 W69200 


321687 


218439 1 


AA625149 AA313030AA313052 H97463 


313883 


129439.1 


AA665089 AA135130 AA484059 M10241 9 AW877765 


322320 


47422J 


W79150AF086419 


322339 


814584.1 


AI66B646AJ734214W17348 


3U648 


293660 1 


AW979268 M878419 M431342 AA431 628 


300201 


682222.1 


AI3083Q0AI303296 


306897 


25196_-2 


AI093967 


323155 


979809.1 


AL120701AL135041 AL121524 


322527 


38927J 


AF147359 T58511T58560 


322585 


473768 2 


W88919W89125 


300352 


1574395 1 


242308 H23514 


322635 


82296.1 


AA0051 29 AA679084 AA694399 


322664 


85042.1 


AA011522 AA702841 AA011691 AA330797 


315454 


380580.1 


AI239464 A1239473 AA62581 2 Al 208703 


322687 


37372J 


AF074666 A11 10759 AF090902 


314852 


32747£_1 


AI903735 AA491283 AJ694953 AW976903 AA761362 


307783 


697809 1 


AI347274AW844024 


324072 


269032 1 


AA381722 AA381829 AW963906 AW963902 M381242 


300627 


221345.1 


AA488472 W27363 AA317053 BE082689 AW967036 BE079872 


323505 


196389 1 


AW970512AA280251 A1652287 BE466438 AI650725 AA551854AA281574AW571481 


315791 


403558.1 


M678177AA677034 


324303 


233842 1 


AL1 18754 AA333202 H38001 


316519 


442885 1 


AA847835AA768376 


300926 


333127J 


AA504850AA504911 
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302697 
302711 
302742 
318499 
310624 
302847 
304122 
303598 
311409 
312094 
319312 
319407 
319425 
320007 
320018 
319484 
318865 
312220 
319546 
312389 
319611 
312437 



311896 
319834 
321102 
321158 
321199 

305528 
321270 
314126 
320714 
306442 
306446 
306458 
306510 
306557 
306572 
306582 



306751 
308011 
306892 
308106 
308154 
306956 
306958 
308213 
308216 
308219 
308588 
308599 
308643 
308673 
308697 
308778 
308808 
308875 
308886 
308898 
308966 
308979 
303011 
303077 
305016 
305034 
305072 
305148 
305190 
303978 
303990 
303998 
303999 
305235 
305312 
305413 
305447 
321244 
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275087J 



1515978J 

43219.1 

45419.1 

458.39 

364430.1 

34624.4 

458.105 

77271.-5 

270283.1 

837264.1 

797889.1 

1540116.1 

1688823.1 

1689571J 

229683.1 

1815987.1 

1691553J 

1535937.1 

1671607.1 

243305.1 

902067.1 

1566863.1 

291472.1 



579192.1 

112523.1 

80531.1 

410938.1 

212379.1 

28832.-3 

1662057.1 

177666.1 

743644.1 

AA976899 

M977348 

AA978186 

AA988546 

AA994530 

AA995686 

AA996248 

AI004024 

AI015615 

AI032589 

AI439473 

AI092465 

A1476803 

AI50Q600 

AI125111 

AI125152 

AI557041 

AI557135 

AI557246 

AI718299 

A1719893 

AI745040 

AI760864 

AI767143 

AJ811109 

A1818289 

AI832332 

AI833240 

AI858845 

AI870704 

AI873111 

41689.1 

44060J 

AA626876 

AA630128 

AA641012 

AA654070 

AA665955 

AW513315 

AW515465 

AW516449 

AW516611 

AA670480 

AA700201 

AA724659 

AA737856 

29327.1 



T78054 T79888AA398185 

AI592552 AI393343 AJ8C0510 AI37771 1 F24263 M661876 

D3101OO30991 031168 D31166 D31465 

AJ001409AJ001410 

L08442D51348 

L12061 

T25451 AA585296 AA585305 

U88895 U88898 AA916056 T03285 AI341594 AI359534 A1634031 U88897 

X98941 X98942 X98943 X98953 X98949 

H28966 

AA382814 AA402411 M412355 
AI698839 AJ909260 A1909259 
Z78390T97427 
Z45481 F12393T74437 
R05329 R01555 R08276 
T82930R02424 T85145 

AA336314 T82938 AA327744 AW967388 AA639967 T10753 

T83263 T85731 T85730 

T91772 R07257 R07098 

H10818F07831Z43072 

N74613T98756T98589 

R09692R09414AA346353 

AI863140 W80703 R43474 

H14957 R56522R11908 

BE080180 AW827313 AW231970 AA995028 AA428584 AW872716 AW892508 AW854593 AA578441 AW975234 AA664937 AA984131 
AA528743 AA552874 AA554758 AW063245 A1267534 AW070190 AW893483 AA770330 AA906928 AA905582 AA758746 AA551717 
AW063311 AA429538 

AW206447 A1248530 AI084433 AI400976 R16553 
AA071267 T65940 T64515 M071334 
AA018306 H38925AA001221 
H79670H47798AA700289 

N34524 AA305071 AW954803 AA502335 A1433430 AI203597 AW026670 AW265323 AW850787 M317554 AW993643 AW835572 

AW385512 A1334966 W32951 H62656 H53902 R88904 AW835732 

M769156 

N59537 N7B278 R83560 
AA226431 AA226569 AA488748 
R91883A1445591 



AF090405 AF090407 AF090406 
AF163305AF163307AF163303 



AF068654AF068656AF068655 
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305614 


AA782866 


305637 


AA806124 


305639 


AA806138 


305650 


AA807709 


305630 


AA813477 


305728 


AA828209 


305759 


AA835353 


305792 


AA845256 


307041 


AI144243 


307091 


AI167439 


307181 


AI189251 


305901 


AA872968 


305910 


AA875981 


307415 


AI242118 


307426 


AI243364 


307517 


A1275055 


307551 


AI281556 


307561 


AI282207 


307608 


A1290295 


307691 


A1318285 ' 


307730 


AJ336092 


307760 


AJ342387 


307764 


AI342731 


307796 


AJ350556 


309045 


AI910902 


309051 


A1911975 


307807 


AI351799 


307808 


AI351826 


307820 


A/355761 


307852 


AI365541 


309122 


A1928178 


309164 


AI937761 


309177 


AI951118 


307902 


AI380462 


309299 


AW003478 


309303 


AW004823 


309476 


AW129368 


309532 


AW151119 


309747 


AW264889 


309769 


AW272346 


309799 


AW276964 


309866 


AW299916 


302679 


311853.1 H65022AA186889 


309923 


AW340584 


309928 


AW341418 


309931 


AW341683 


309933 


AW341936 


302705 


31765J U09060U09061 


302789 


34161J AJ245067AJ245070 


304006 


AW517947 


304024 


T03036 


304026 


T03160 


304028 


T03266 


304046 


T54803 


304061 


T61521 


304063 


T62536 


302802 


34487 J Y08250Y08245 


304114 


R78946 


304155 


H68696 


304203 


N56929 


304234 


W81608 


304348 


AA179868 


304430 


AA347682 


304456 


AA411240 


304521 


AA464716 


304526 


AA476427 


304607 


AA513322 


304735 


AA576453 


304760 


M580401 


306015 


AA897116 


306063 


AA906316 


306065 


AA906725 


306104 


AA910956 


306109 


M911861 


306242 


AA932805 


306288 


AA936900 


306396 


AA970223 


330568 


NOTJOUND enlrez U56244 


330599 


15323.-12 1)90437 


331131 


genbank_R54797 R54797 


331203 


NOLFOUND entrez T82310 


331531 


genbank_N51343 N51343 


331547 


467396J AA828597 N5481 1 


332074 


genbankJVA599012 AA599012 
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TABLE 8C shows the genomic position for those Pkeys in Table 8A lacking urugene ID'S and accession numbers. For each predicted exon, we have feted the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 

Pkey: Unique number corresponding to an Eos pro beset 

Reft Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et aL" refers to the pubficaiion entitled The DNA 

sequence of human chromosome 22." Dunham I. et a)., Nature (1999) 402:489495. 
Strand: Indicates OHA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref Strand 


NLposiGon 




332792 


Dunham, I etaL 


plus 


73381-73768 


332816 


Dunham, 1. etal. 


Plus 


359844-360030 


332906 


Dunham, 1. etaL 


Plus 


1923101-1923205 


332911 


Dunham, 1. eta!. 


Plus 


1961767-1961858 


332912 


Dunham, 1. etal. 


Plus 


1962120-1962246 


332922 


Dunham, 1. etaL 


Plus 


2009520-2009738 


332956 


Dunham, 1. eta). 


Pius 


2510528-2510858 


332959 


Dunham, 1. eLal. 


Plus 


2518145-2518213 


333138 


Dunham, 1. elal. 


Plus 


3369205-3369323 


333139 


Dunham, 1. etaL 


Plus 


3369495-3369571 


333221 


Dunham, ). etal. 


Plus 


3978070-3978187 


333380 


Dunham, 1. etal. 


Plus 


4904775-4904846 


333387 


Dunham, 1. etaL 


Pius 


4910935-4910997 


333512 


Dunham, 1. etaL 


Pius 


5560510-5560564 


333524 


Dunham, 1. etal. 


Plus 


5612620.5612780 


333585 


Dunham, I. etal. 


Plus 


6234778-6234804 


333618 


Dunham, 1. eta). 


Plus 


6562391-6562566 


333627 


Dunham, I. etal. 


Plus 


6S20584-6S20903 


333628 


Dunham, 1. etal. 


Plus 


6629004-6829233 


333650 


Dunham, 1. elal. 


Plus 


6796852-6797128 


333678 


Dunham, 1. etal. 


Plus 


7068223-7068288 


333750 


Dunham, 1. etaL 


Plus 


7608165-7608234 


333763 


Dunham, I, etal. 


Plus 


7692491-7692630 


333767 


Dunham, 1. etaL 


Plus 


7694407-7694623 


333768 


Dunham, t. etal. 


Plus 


7695440-7695697 


333769 


Dunham 1. etal. 


Plus 


7696625-7696707 

I WUUiL^I USUI Ul 


333772 


Dunham, 1. etal. 


Ptus 


7706773-7706902 


333777 


Dunham, 1. etal. 


Pius 


7746805-7746916 


333846 


Dunham, 1. etal. 


Plus 


8008623.8008757 


333884 


Dunham. 1 et al 


Ptus 


81 53960-81 541 fi1 


333887 


Dunham 1 et al 


Plus 


8154882-8155025 


333891 


Dunham 1 et ai 


Pius 


81 56417-81 5670Q 


333892 


Dunham I et al 


Plus 


8156825-81570(11 


333948 


Dunham 1 pt a) 

will IJIPIII) 1. wl.CU. 


Plus 


8583497-8583627 


333954 


Dunham L etal. 


Pius 


6563186-6563335 


333966 


Dunham, 1. etal. 


Plus 


8655643-8655826 


333968 


Dunham, 1. etal. 


Plus 


8681004-8681241 


334061 


Dunham 1 et al 


Plus 


9686941-9687077 


334094 


Dunham 1. etal. 


Plus 


9889953-9890105 


334113 


Dunham 1. etal. 


Pius 


10282459-10282597 


334161 


Dunham 1 at at 


Plus 


1059QQ11.105QQ1RO 


334219 


Dunham 1 et al 


Plus 


1 27161 50-1271 6184 


334239 


Dunham 1 et al 


Plus 


13056569-13056693 


334333 


Dunham 1 pt al 


Ptus 


1 1601544-1 1601657 


334378 


Dunham 1 at at 


Plus 


1 1907219-1 1907170 

1 JCfUf tJIf- IJDy/ Of U 


334382 


Dunham 1 pt al 


Plus 


1 1Q1 5RRR-1 1Q1 finiR 


334562 


Dunham 1 at al 


Ptus 


1 4987847-1 49R7Q40 


334588 


Dunham t at at 


Plus 


15019740-15019817 

1 uyJi / fir l OUOtO 1 i 


334616 


Dunham 1 et al 

t>U Ml POM 1, 1. WUOti 


Pius 


15176121-15176470 


334633 


Dunham, I. etal. 


Plus 


15333208-15331305 


334866 


Dunham 1 et at 


Plus 


18872214-18872317 


334891 


Dunham, 1. etal. 


Plus 


19299770-19299944 


334934 


Dunham, 1. etal. 


Plus 


20103970-20104058 


335015 


Dunham, 1. etal. 


. Plus 


20682792-20682945 


335120 


DunhaW t 1. etal. 


Plus 


21436286-21436384 


335125 


Dunham, 1. etal. 


Plus 


21441390-21441471 


335179 


Dunham, L etal. 


Plus 


21634405-21634526 


335188 


Dunham, 1. etal. 


Plus 


21669118-21669328 


335211 


Dunham, 1. etal. 


Plus 


21 77461 1-21 77468Q 


335361 


Dunham, 1. etal. 


Plus 


22807292-22807445 


335379 


Dunham, f. etal. 


Plus 


22899306-22899420 


335414 


Dunham, 1. eta). 


Plus 


23235546-23235684 


335416 


Dunham, 1. etal. 


Plus 


23237354-23237465 


335496 


Dunham, 1. etal. 


Plus 


24164386-24164545 


335497 


Dunham, L etal. 


Plus 


24167666-24167869 


335558 


Dunham, 1. eta). 


Plus 


24740167-24740347 


335586 • 


Dunham, 1. eta). 


Pius 


24990333-24990497 


335686 


Dunham, 1. eta). 


Plus 


25439839-25439920 


335784 


Dunham, 1. etal. 


Plus 


25942710-25942792 


335823 


Dunham, 1. etal. 


Plus 


26365925-26366004 


335983 


Dunham, 1. etal. 


Plus 


27938968-27939070 


335995 


Dunham, 1. etal. 


Plus 


28009044-28009184 


335021 


Dunham, 1. etal. 


Plus 


28686482-28686559 
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336034 


Dunham, 1. etaL 


Plus 


29014404-29014590 


336038 


Dunham, I. etaL 


Plus 


29022963-29023165 


336107 


Dunham, 1. eial. 


Plus 


29987731-29987869 


336632 


Dunham, L etai 


Plus • 


983890*985529 


336633 


Dunham, 1. elaL 


Plus 


985591*986221 


336634 


Dunham, L etaL 


Pius 


986296-986670 


336635 


Dunham, 1. etai. 


Plus 


987908.988364 


336636 . 


Dunham, 1. etai. 


Plus 


988418-989185 


336637 


Dunham, I. etai 


Plus 


989276-990813 


336638 


Dunham, I etai. 


Plus 


991906-993240 


336659 


Dunham, t. eta). 


Plus 


1896402-1895478 


336694 


Dunham, t. etai. 


Plus 


2420546-2420616 


336721 


Dunham, 1. eta}. 


Plus 


3371522-3371586 


336900 


Dunham, i. etai. 


Plus 


10236423-10236523 


336948 


Dunham, 1. etai. 


Plus 


12692290-12692381 


337028 


Dunham, I. etai. 


Phis 


16644817-16644942 


337054 


Dunham, 1. etai. 


Plus 


17821742-17821922 


337162 


Dunham, 1. etai. 


Plus 


23478943-23479145 


337183 


Dunham, 1. etaL 


Plus 


23943606-23943696 


337184 


Dunham, I. etai. 


Phis 


23973949-23974016 


337268 


Dunham, L etai. 


Plus 


28011979-28012034 


337299 


Dunham, t etai. 


Plus 


29022656-29022775 


337389 


Dunham, 1. etai. 


Plus 


31401509-31401579 


337493 


Dunham, 1. etai. 


Plus 


33330760-33330981 


337549 


Dunham, 1. etai. 


Plus 


34474472-34474531 


337755 


Dunham, 1. eta). 


Phis 


3971764-3971900 


337809 


Dunham, 1. etai. 


Plus 


44490694449193 


337871 


Dunham, 1. etai. 


Plus 


5443027-5443101 


337958 


Dunham, 1. etai. 


Plus 


6969162-6969270 


338008 


Dunham, 1. etai. 


Plus 


769706&-7697236 


338033 


Dunham, 1. eta). 


Plus 


8092128-8092271 


338110 


Dunham, 1. etai. 


Plus 


10384481-10384621 


338112 


Dunham, I. etai. 


Plus 


10391398-10391600 


338145 


Dunham, t etai. 


Plus 


11386529-11386692 


338148 


Dunham, 1. etai. 


Pius 


11448985-11449085 


338179 


Dunham, 1. eta). 


PJus 


12808775-12808833 


338197 


Dunham, 1. etai. 


Plus 


13638107-13638181 


338279 


Dunham, 1. etai. 


Plus 


16168944-16169091 


338316 


Dunham, 1. etai. 


Plus 


17089711-17089988 


338322 


Dunham, t. etai. 


Plus 


17132477-17132547 


338357 


Dunham, 1. etai. 


Plus 


18062184-18062402 


338359 


Dunham, 1. etai. 


Plus 


18074402-18074501 


338366 


Dunham, 1. etai. 


Plus 


18252026-18252189 


338374 


Dunham, 1. etai. 


Plus 


18371200-18371282 


338414 


Dunham, ). etai. 


Plus 


19345573-19345660 


338418 


Dunham, 1. etai. 


Plus 


19435506-19435596 


338501 


Dunham, 1. etai. 


Plus 


21244713-21244828 


338506 


Dunham, 1. etai. 


Plus 


21221871-21221953 


338523 


Dunham, 1. etai. 


Plus 


21509763-21509864 


338662 


Dunham, 1. etai. 


PJus 


24404720-24404899 


338804 


Dunham, 1. etai. 


Plus . 


27236005-27236108 


338836 


Dunham, 1. etai. 


Plus 


27792166-27792272 


338879 


Dunham, 1. etai. 


Plus 


28410653-28410734 


338937 


Dunham, 1. etai. 


Plus 


29160655-29160725 


338993 


Dunham, 1. eta). 


Plus 


30077787-00078184 


339047 


Dunham, 1. etai. 


Plus 


30760793-30760968 


339100 


Dunham, 1. etai. 


Plus 


31141580-31141765 


339114 


Dunham, 1. etai. 


Plus 


31456454-31456519 


339121 


Dunham, 1. etai. 


Plus 


31583467-31583536 


339170 


Dunham, 1. etai. 


Plus 


32216399-32216527 


339293 


Dunham, t. etai. 


Plus 


33223671-33223819 


332858 


Dunham, 1. eta). 


Minus 


1339607-1339397 


332982 


Dunham, 1. etai. 


Minus 


2628296-2628109 


332984 


Dunham, 1. eta). 


Minus 


2632606-2632457 


332998 


Dunham, 1. etai. 


Minus 


2711704-2711565 


333058 


Dunham, 1. etai. 


Minus 


3028925-3028811 


333097 


Dunham, 1. etai. 


Minus 


3204124-3204036 


333121 


Dunham, 1. etai. 


Minus 


3308446-3308358 


333122 


Dunham, t etai. 


Minus 


3309596-3309531 


333123 


Dunham, 1. eta). 


Minus 


3310817-3310749 


333140 


Dunham, 1. etai. 


Minus 


3377220-3376309 


333260 


Dunham, 1. etai. 


Minus 


43084004308304 


333603 


Dunham, 1. etai. 


Minus 


6466335-6465727 


333604 


Dunham, 1. etai. 


Minus 


6467090-6466768 


333904 


Dunham, 1. etai. 


Minus 


8217374-8217261 


333906 


Dunham, 1. etai. 


Minus 


8218238-8218063 


334183 


Dunham, 1. etai. 


Minus 


11832582-11832508 


334187 


Dunham, 1. etaL 


Minus 


11921456-11921205 


334222 


Dunham, 1. eta). 


Minus 


12732417-12732289 


334223 


Dunham, 1. etai. 


Minus 


12734365-12734269 


334255 


Dunham, 1. etai. 


Minus 


13200776-13200692 


334492 


Dunham, 1. etai. 


Minus 


14478333-14478172 


334648 


Dunham, ). etai. 


Minus 


15363301-15363222 


334787 


Dunham, I. etai. 


Minus 


16299093-16298937 


334933 


Dunham, 1. etai. 


Minus 


20078117-20077991 
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334945 


Dunham, L etal 


Minus 


20138885-20138637 


334957 


Dunham, 1. eta). 


Minus 


20173311-20173218 


334990 


Dunham, 1. etal. 


Minus 


20341159-20341087 


335093 


Dunham, 1. etal. 


Minus 


21297367-21297214 


335288 


Dunham, 1. etal 


Minus 


22304275-22303770 


335289 


Dunham, 1. etal. 


Minus 


22305950-22305708 


335548 


Dunham, I. eta). 


Minus 


24662773-24662673 


335551 


Dunham, 1. eta). 


Minus 


24679828-24678961 


335619 


Dunham, ). eta). 


Minus 


25082677-25082498 


335620 


Dunham, ). etal 


Minus 


25092561-25092434 


335621 


Dunham, 1. eta). 


Minus 


25098878-25098767 


335682 


Dunham, I. eta). 


Minus 


25421215-25421093 


335755 


Dunham, ). eta). 


Minus 


25763806-25763747 


335814 


Dunham, I etal. 


Minus 


26320043-26319845 


335815 


Dunham, 1. eta). 


Minus 


26320518-26320421 


335835 


Dunham, 1. etal 


Minus 


26393311-26393245 


335851 


Dunham, 1. etal. 


Minus 


26604863-26604742 


335868 


Dunham, t. eta). 


Minus 


26711437-26711300 


335896 


Dunham, I eta). 


Minus 


26977639-26977558 


335936 


Dunham, 1. eta). 


Minus 


27360474-27360400 


335948 


Dunham, 1. etal 


Minus 


27555924-27555788 


336066 


Dunham, I eta). 


Minus 


29241080-29240842 


336205 


Dunham, I etal. 


Minus 


30477456-30477311 


336275 


Dunham, 1. etal 


Minus 


32086675-32086536 


336292 


Dunham, I eta). 


Minus 


32818035-32817927 


336331 


Dunham, I etal 


Minus 


33594527-33594371 


336419 


Dunham, 1. etal 


Minus 


34052568-34052445 


336675 


Dunham, 1. etal. 


Minus 


2020758-2020664 


336684 


Dunham, I eta). 


Minus 


2158060-2157993 


336716 


Dunham, I eta). 


Minus 


3259952-3259862 


336798 


Dunham, 1. etal. 


Minus 


5888954-5888757 


337043 


Dunham, I eta). 


Minus 


17407330-17407251 


337046 


Dunham, I etal. 


Minus 


17610892-17610821 


337128 


Dunham, ). etal 


Minus 


22215251-22215034 


337192 


Dunham, I eta). 


Minus 


24591853-24591771 


337194 


Dunham, 1. etal 


Minus 


24610510-24610359 


337229 


Dunham, I eta). 


Minus 


26716579-26716481 


337325 


Dunham, I eta). 


Minus 


30015948-30015800 


337497 


Dunham, I etal 


Minus 


33371317-33371258 


337500 


Dunham, 1. etal. 


Minus 


33376212-33376158 


337603 


Dunham, I etal 


Minus 


1299296-1299194 


337605 


Dunham, I etal. 


Minus 


1346555-1346397 


337671 


Dunham, I etal 


Minus 


3260634-3260547 


337788 


Dunham, I etal 


Minus 


41332034133081 


337862 


Dunham, I etal. 


Minus 


5347658-5347550 


338083 


Dunham, ). etal. 


Minus 


9318438-9318301 


338158 


Dunham, I etal. 


Minus 


11794465-11794343 


338161 


Dunham, 1. etal. 


Minus 


12124716-12124658 


338182 


Dunham, I etal. 


Minus 


12824919-12824827 


338189 


Dunham, I etal. 


Minus 


12878594-12878478 


338199 


Dunham, I eta). 


Minus 


13760865-13760780 


338215 


Dunham, 1. eta). 


Minus 


14055447-14055355 


338469 


Dunham, I etal 


Minus 


20520387-20520242 


338549 


Dunham, 1. etal. 


Minus 


22049171-22049081 


338561 


Dunham, I eta). 


Minus 


22311966-22311858 


338671 


Dunham, I etal. 


Minus 


24508421-24508346 


338676 


Dunham, I. etal. 


Minus 


24637427-24637369 


338726 


Dunham, I etal. 


Minus 


25926206-25925618 


338779 


Dunham, 1. eta). 


Minus 


27030151-27029795 


338871 


Dunham, I eta). 


Minus 


28301708-28301611 


338872 


Dunham, 1. eta). 


Minus 


28300921-28300790 


338966 


Dunham, 1. etal. 


Minus 


29614876-29614749 


339229 


Dunham, I etal. 


Minus 


3272233042722199 


339264 


Dunham, I etal. 


Minus 


3297514542975053 


325228 


6381940 ■ Plus 


2630-2694 




325235 


6381943 Minus 


162154-162264 


329588 


3962484 Plus 


1169-1619 




329560 


3962491 Plus 


2095-2990 




329541 


3983503 Minus 


2765-3059 




325328 


5866875 Pius 


86780-86854 




325340 


6017033 Minus 


166656-166819 


325373 


5866920 Minus 


1136686-1136777 


325367 


5866920 Minus 


922881-922958 


325389 


5866921 Plus 


239672-239759 


325436 


5866939 Minus 


29778-29907 




325498 


5866967 Plus 


173372-173930 


325471 


6017034 Minus 


289268-289342 


325557 


6056302 Plus 


50921-51050 




325559 


6249595 Minus 


11359ai 19172 


325560 


6249595 Minus 


133794-133981 


325569 


6249599 Plus 


79927-80217 




325587 


6682462 Plus 


126724-126967 


325585 


6682462 Plus 


73476-73574 




325597 


5866992 Plus 


1065020-1065089 


325639 


5867002 Plus 


253525-253608 
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205138-205269 




325739 


5867038 


Minus 




325740 


5867038 


Minus 


207533-207690 




325792 


6469828 


Minus 


1018-1176 




325735 


6552447 


Minus 


269122-269190 


5 


325685 


6582468 


Plus 


117397-117483 




325686 


6682468 


Plus 


118337-118439 




325819 


6682490 


Minus 


130314-130370 




329764 


6048195 


Minus 


109733-109968 


10 


329703 


6065793 


Minus 


139994-140138 


329543 


6448539 


Plus 


53403-53537 




329816 


6624888 


Minus 


7029670423 




329860 


6687260 


Minus 


163474-163605 




325883 


5867087 


Plus 


22498-22663 


15 


325895 


5867097 


Plus 


358317-358476 


325925 


5867124 


Plus 


115749-115962 




325932 


5867127 


Plus 


7369-7441 




325941 


5867133 


Minus 


64228-64402 




325969 


5867153 


Plus 


101911-102081 


20 


325971 


5867153 


Plus 


105841-106035 


329993 


4567166 


Minus 


101307-101434 




330020 


6671887 


Plus 


172397-172491 




326163 


5867168 


Minus 


7831-8035 




326274 


5867171 


Minus 


41028*410404 


25 


326025 


5867176 


Plus 


70854-70915 


326046 


5867182 


Minus 


62668-62825 




326099 


5867186 


Minus 


661381-661510 




326108 


5867187 


Minus 


23784-23903 




326165 


5867208 


Minus 


62787-62929 


30 


326189 


5867212 


Pius 


69288-69413 


326204 


5867218 


Minus 


148088-148200 




326230 


5867230 


Minus 


301868-301972 




330052 


4567182 


Plus 


352560-352963 




330036 


6042048 


Pius 


117120-117216 


35 


326360 


5867293 


Phis 


13627-13844 


326589 


5867320 


Plus 


22760-22919 




326393 


5867341 


Plus 


41702-41841 




326505 


5867435 


Minus 


8818-8949 




326515 


5867439 


Plus 


36683-36809 


40 


326592 


6138928 


Plus 


23689-23828 


330107 


6015249 


Mmu$ 


100091-100282 




330106 


6015249 


Minus 


99443-99778 




330100 


6015253 


Plus 


21166-21301 




330093 


6015278 


Plus 


1043-1199 


45 


330088 


6015293 


Plus 


37517-37638 


330085 


6015302 


Minus 


59613-59770 




330120 


6671864 


Minus 


127553-127656 




330123 


6671869 


Minus 


35311-35406 




326742 


5867611 


Minus 


95187-95248 


50 


326605 


5867637 


Plus 


24656-24749 


326818 


6117831 


Minus 


15199-15309 




326720 


6552456 


Plus 


84525-84677 




326770 


6598307 


Minus 


513603-513668 




326692 


6682502 


Pius 


117697-117899 


55 


326693 


6682502 


Minus 


335002-335095 


326983 


5867657 


Minus 


16023-16581 




326991 


5867660 


Pius 


18147-18339 




326936 


6004446 


Minus 


10217-10357 




326964 


6469836 


Plus 


75340-75456 


60 


327040 


6531965 


Plus 


783670-783817 


327053 


6531965 


Rus 


2247267-2247437 




327075 


6531965 


Plus 


4041318-4041431 




327085 


6531965 


Pius 


4734947-4735069 




327036 


6531965 


Pius 


319951-320040 


65 


327130 


6531976 


Rus 


20247-22343 


327156 


5866841 


Minus 


2462-2620 




327288 


5867481 


Plus 


48583-48773 




327332 


5867516 


Minus 


56351-56532 




327220 


5857525 


Minus 


65701-65781 


70 


327224 


5867534 


PJus 


188468-188544 


327321 


6249562 


Minus 


99745-99836 




327361 


6552412 


Minus 


61013-62130 




327396 


5867743 


Plus 


8702-8820 




327414 


5867750 


Plus 


102461-102586 


75 


327442 


5867759 


Plus 


111483-111618 


327467 


5867772 


Plus 


88030-88151 




327473 


5867775 


Plus 


75101-75181 




327483 


5867783 


Plus 


181573-181662 




327377 


5867793 


Minus 


37610-37676 


80 


327562 


5867804 


Minus 


343989-344474 


327568 


5867811 


Minus 


46152-46287 




327606 


6004463 


Plus 


200262-200495 




327611 


5867868 


Minus 


175063-175392 




327642 


5867891 


Minus 


2513-2743 


85 


327654 


5857910 


Minus 


97564-97710 


327734 


5867940 


Minus 


31003-31583 
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327775 


5867964 


Minus 


130791-130871 


327796 


5RR7Q82 


Plus 


85267-85405 


327840 


6249578 


Minus 


73055-73206 


330208 


6013599 


Rus 


66517-66931 


330263 


6671884 


Minus 


101503-101634 


328004 




Minus 


157407-157887 


328101 


5868020 


Plus 


289920-290014 


328100 


5868020 


Minus 


263545-263635 


328113 


5868024 


Minus 


80378-80491 


328157 


5868064 


Plus 


73326-73615 


328196 


5868080 


Minus 


16551-16729 


328197 


5868081 


Minus 


42133-42438 


327940 


5B68197 


Minus 


95240-95428 


327984 


5868216 


Plus 


66611-66677 


328021 


5902482 


Plus 


713478-714590 


328068 


6117819 


Plus 


253903-254022 


328264 


6381912 


Rus 


55086-55404 


330300 


2905882 


Minus 


3246-3302 


328608 


5B68222 


Minus 


87770-87953 


328600 


5868229 


Minus 


3888940010 


328616 


5868239 


Plus 


293920-294224 


328623 


5868246 


Minus 


120020-12)126 


328632 


5868247 


Plus 


76734-76853 


328666 


5868254 


Minus 


778-901 


328698 


5868264 


Minus 


625555-625533 


328700 


5868264 


Plus 


764089-764203 


328708 


5868271 


Minus 


68114-68854 


328735 


5868289 


Rus 


89389-89455 


328743 


5868289 


Plus 


274638-274726 


328806 


5868324 


Rus 


29408-29684 


328299 


5868366 


Minus 


149708-149889 


328342 


5868383 


Plus 


59955-60094 


328365 


5868387 


Minus 


270724-270798 


328369 


5868388 


Rus 


75371-75583 


328381 


5868392 


Phis 


662758-662848 


328451 


5868425 


Minus 


217275-217336 


328481 


5868449 


Minus 


8987-9180 


328500 


5868464 


Plus 


59098-59481 


328530 


5868482 


Plus 


334973-335406 


328664 


6004473 


Plus 


1193739-1193866 


328861 


6381928 


Minus 


108317-108403 


328908 


5868493 


Rus 


117002-117059 


328933 


5868500 


Plus 


771755-771889 


328934 


5868500 


Plus 


846342-846448 


328949 


6456765 


Minus 


43552-43619 


330313 


6042030 


Minus 


33642-33775 


329005 


5868542 


Plus 


85470-85673 


330366 


2944106 


Plus 


151837-151914 


330372 


6580495 


Minus 


317461-317688 


329033 


5868561 


Minus 


5390-5479 


329037 


5868562 


Minus 


32466-32562 


329067 


5868591 


Minus 


146417-147652 


329134 


5868679 


Plus 


29959-30018 


329157 


5868687 


Minus 


145940-146155 


329178 


5868704 


Plus 


179177-179463 


329192 


586B716 


Plus 


166936-167020 


329194 


5868716 


Minus 


304450-304559 


329204 


5868720 


Minus 


3050-3190 


329224 


5868728 


Plus 


27422-27664 


329228 


5868728 


Minus 


50118-50287 


329288 


5868771 


Plus 


25554-26299 


329337 


5868806 


Minus 


467155467222 


329011 


6682532 


Plus 


4865848741 
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TABLE 9A: Potential Therapeutic Diagnostic and Prognostic targets for Therapy of Umg Cancer 

Table 9A shows about 1312 genes up-regulated in fung tumors (including squamous cell carcinomas, adenocarcinornas, small cefl carcinornas, granulomatous and carcinoid 
tumors) relative to normal body tissues. These genes were selected from about 59680 pro besets on the Eos/Affymetrix Hu03 Genechip array. 

Table 9B show the accession numbers for those Pke/s lacking Unigeneliys for table 9A. For each probeset we have listed the gene duster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Qustering and Alignment Tools (DouWeTwist Oakland California). The Genbank accession numbers for sequences comprising each cluster sre listed in the 
•Accession" column. 

Table 9C show the genomic positioning for those Pkey*s lacking Unigene ID'S and accession numbers in table 9A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucieotide locations of each predicted exon are afeo fisted. 
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Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnfgenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Average of lung tumors pncluding squamous cell carcinomas, adenocarcinomas, smaO cefl carcinornas, granulomatous and carcinoid tumors) divided by the 
average of normal lung samples 

R2: Average of non-malignant lung disease samples (including bronchitis, emphysema, fibrosis, atelectasis, aslhma) divided by the average of normal lung samples 
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57.00 


411027 


Af 072099 


Hs.67846 


leukocyte immunogtobulin-like receptor, 


1.62 


3.78 


411074 


X60435 


Hs.68137 


adenylate cyclase activating polypeptide 


1.00 


1.15 


411089 


AA456454 




ceil division cycle 2-like 1 (PITSLRE pr 


1.56 


1.58 


411152 


BE069199 




gb:QV3-BT0379-O1030O-105-g03 BT0379 Homo 


1.00 


84.00 


411248 


AAS51538 


Hs.334605 


Homo sapiens cDNA FU14408 fis, clone HE 


1.82 


1.45 


411252 


AB018549 


Hs.69328 


MD-2 protein 


7.32 


12.74 


411263 


BE297802 


Hs.69360 


kinesin-like 6 (mitotic centromere- assx 


3.44 


2.55 


411365 


M76477 


Hs.289082 


GM2 gangBoside activator protein 


1.35 


202 


411402 


BE297855 


Hs.69855 


NRAS-related gene 


1.00 


46.00 


411573 


AB029000 


HsJ0823 


KIAA1 077 protein 


11.40 


11.35 


411579 


AC005258 


Hs.70830 


U6 snRNA-associated Sm-Cke protein LSm7 


1.08 


1.90 


411617 


AA247994 


Hs.90063 


neurocaicin delta 


1.74 . 


257 


411732 


AA059325 


Hs.71642 


guanine nucleotide binding protein (G pr 


1.02 


1.00 


411773 


NM 006799 


Hs.72026 


protease, serine, 21 (teslisin) 


1.34 


219 


411789 


AF245505 


HsJ2157 


Adfican 


2.19 


279 


411800 


N39342 


Hs.103042 


microtubule-associated protein 1B 


23.34 


34.00 


411945 


AL033527 


Hs.92137 


v-myc avian myetocytomatosis viral oncog 


1.00 


8.00 


412115 


AK001763 


Hs.73239 


hypothetical protein FU 10901 


207 


1.64 


412140 


AA219691 


Hs.73625 


RAB6 interacting, kmesin-Tike (rabkines 


118.48 


9200 


412276 


BE262621 


Hs73798 


macrophage migration inhibitory factor ( 


1.98 


1.49 


412464 


T78141 


Hs.22826 


ESTs, Weakly similar to 155214 salivary 


1.16 


1.34 


412530 


AA766268 


Hs.266273 


hypothetical protein FU13346 


41.52 


84.00 


412537 


AL031778 




nuclear transcription factor Y, alpha 


17.90 


55.00 
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412659 


AW753865 


Hs.74376 


oifactomedin related ER localized protei 


14.65 


47.00 


412719 


AW016610 


Hs.816 


ESTs 


38246 


128.00 


412723 


AA648459 


Hs.335951 


hypothetical protein AF301222 


54.90 


1.00 


412811 


K06382 




ESTs 


1.00 


11.00 


412817 


AL037159 


Hs.74619 


proteasome (prosome, macropain) 26S subu 


1.63 


1.42 


412863 


AA121673 


Hs.59757 


zinc finger protein 281 


17.63 


56.00 


412924 


BE018422 


Hs.75258 


H2A histone family, member Y 


1.00 


2200 


413004 


T35901 


Hs.75117 


interieukin enhancer binding factor 2, 4 


219 


205 


413011 


AW068115 


Hs.821 


bigfycan 


1.22 


1.88 


413048 


M93221 - 


Hs.75182 


mannose receptor, C type 1 


0.30 


6.23 


413083 


AL035737 


Hs.75184 


chitinase 3-Dke 1 (cartilage gtycoprote 


3.43 


8.71 


413129 


AF292100 


Ks.104613 


RP42 homolog 


4.67 


4.77 


413142 


M81740 


Hs.75212 


ornithine decarboxylase 1 


1.92 


259 


413223 


AI732182 • 


Hs.191866 


ESTs 


5.73 


27.00 


413248 


T64858 


Hs.21433 


hypothetical protein DKFZp547J036 


0.99 


1.06 


413273 


U75679 


Hs.75257 


stem-loop (histone) binding protein 


1.00 


18.00 


413278 


BE563085 


Hs.833 


interferon-stimulated protein, 15 kDa 


1.10 


1.09 


413281 


AA861271 


Hs.222024 


transcription factor SMAL2 


95.94 


69.00 


413364 


BE536218 


Hs.137516 


fidgeMke 1 


1.00 


1.00 


413385 


M34455 


Hs.840 


hdoleamine-pyrrole 2.3 dioxygenase 


0.95 


209 


413409 


A1638418 


Hs.1440 


DEAD/H (Asp-Glu-Ala-Asp/His} box polypep 


1.00 


1.00 


413453 


AA129640 


Hs. 128065 


ESTs 


1.00 


31.00 


413527 


BE250788 


Hs.179882 


hypothetical protein FU12443 


1.08 


1.46 


413554 


AA319146 


Hs.75426 


secretogranin II (chromogranin C) 


79.15 


114.00 


413573 


AI733859 


Hs.149089 


ESTs 


1.00 


1.00 


413582 


AW295647 


Hs.71331 


hypothetical protein MGC5350 


8.80 


10.00 


413597 


AW302885 


Hs.117183 


ESTs 


1.00 


1.00 


413690 


BE157489 




gb:RC1-HT0375-12020T>011-e06 HT0375 Homo 


1.00 


1.00 


413691 


AB023173 


Hs.75478 


ATPase, Oass VJ, type 11B 


3.16 


232 


413719 


BE439580 


Hs.75498 


small inducible cytokine subfamily A (Cy 


288 


9.52 


413753 


U17760 


Hs.75517 


laminin, beta 3 (nicein (125kD), kalinfn 


144.10 


108.00 


413801 


M62246 


Hs.35406 


ESTs, Highly similar to unnamed protein 


1.00 


17.00 


413833 


Z15005 


Hs.75573 


centromere protein E (312kD) 


1.00 


1.00 


413882 


AA132973 


Hs.184492 


ESTs 


64.24 


148.00 


413926 


AA133338 


Hs.54310 


ESTs 


1.00 


67.00 


413943 


AW294416 


Hs.144687 


Homo sapiens cDNA FU 12981 fis, clone NT 


43.42 


4200 


413995 


BE048146 


Hs.75671 


syntaxin 1A (brain) 


1.23 


1.11 


414035 


Y00630 


Hs.75716 


serine (or cysteine) proteinase inhibiio 


202 


251 


414142 


AW368397 


Hs.334485 


Homo sapiens cDNA FU 14438 fis, clone HE 


1.00 


10200 


414180 


AI863304 


Hs.120905 


Homo sapiens cDNA FU 1 1 448 fis, done HE 


6.92 


77.00 


414245 


BE148072 


Hs.75850 


WAS protein family, member 1 


1.00- 


1.00 


414275 


AW970254 


Hs.889 


Charot-Leyden crystal protein 


1.00 


59.00 


414317 


BE263280 


Hs.75888 


phosphogtuconate dehydrogenase 


1.52 


1.73 


414334 


M824298 


Hs.21331 


hypothetical protein FU 10036 


1.78 


1.72 


414341 


D80004 


Hs.75909 


KIAA01 82 protein 


33.90 


151.00 


414368 


W70171 


Hs.75939 


uridine monophosphate kinase 


171.60 


97.00 


414416 


AW409985 


Hs.76084 


hypothetical protein MGC2721 


232 


1.85 


414430 


A1346201 


HsJ6118 


ubiquitin carboxyt-termina) esterase L1 


226.15 


66.00 


414570 


Y00285 


Hs.76473 


insulin-like growth factor 2 receptor 


1.64 


1.98 


414618 


A1204600 


Hs.96978 


hypothetical protein MGC10764 


1.87 


72.00 


414675 


R79015 


Hs.296281 


interieukin enhancer binding factor 1 


1.51 


1.39 


414683 


S78296 


Hs.76888 


hypothetical protein M6C12702 


43.61 


64.00 


414696 


AF002020 


Hs.76918 


Niemann-Pick disease, type Cf 


28.63 


71.00 


414711 


AI310440 


Hs.288735 


Homo sapiens cDNA FU 13522 fis, clone PL 


14.86 


4200 


414718 


H95348 


Hs.107987 


ESTs 


100 


5.00 


414732 


AW410976 


Hs.77152 


minichromosome maintenance deficient (S. 


1.64 


1.44 


414747 


U30872 


Hs.77204 


centromere protein F (350/400kD, mitosin 


65.01 


74.00 


414761 


AU077228 


Hs.77256 


enhancer of zeste (Drosophlla) homolog 2 


130.35 


121.00 


414774 


X02419 


Hs.77274 


plasminogen activator, urokinase 


2.24 


219 


414806 


D14694 


Hs.77329 


phosphatidytserine synthase 1 


1.63 


1.53 


414809 


AJ434699 


Hs.77356 


transferrin receptor (p90, CD71) 


1.97 


260 


414812 


X72755 


Hs.77367 


monokine hduced by gamma interferon 


3.48 


10.60 


414825 


X06370 


Hs.77432 


epidermal growth factor receptor (avian 


103.22 


143.00 


414839 


X63692 


Hs.77462 


DNA (cytostne-5-)-methyltransferase 1 


1.80 


1.69 


414883 


AA926960 




CDC28 protein kinase 1 


14.29 


10.06 


414907 


X90725 


Hs.77597 


polo (Drosophia)-like kinase 


1.95 


220 


414914 


U49844 


Hs.77613 


ataxia telangiectasia and Rad3 related 


3.00 


. 290 


414945 


BE076358 


Hs.77667 


lymphocyte antigen 6 complex, locus E 


1.02 


1.21 


414972 


BE263782 


Hs.77595 


KJ AA0003 gene product 


1.00 


1.00 


415014 


AW954064 


Hs.24951 


ESTs 


1.42 


284 


415091 


AL044872 


Hs.77910 


3-hydroxy-3-methyiglutaryl-Coenzyme A sy 


1.00 


30.00 


415138 


C18356 


Hs.295944 


tissue factor pathway inhibitor 2 


34.72 


107.00 


415227 


AW821113 


Hs.72402 


ESTs 


1.87 


49.00 


415238 


R37780 


Hs.21422 


ESTs 


1.00 


1.00 


415263 


AA948033 


Hs. 130853 


ESTs 


1.00 


1.00 


415295 


R41450 


Hs.6546 


ESTs 


1.00 


1.00 


415339 


NM.015156 


Hs.78398 


KIM0071 protein 


51.18 


166.00 


415669 


NM.005025 


Hs.78589 


serine [or cysteine) proteinase inhibiio 


30.84 


63.00 


415674 


BE394784 


Hs.78596 


proteasome (prosome, macropain) subu nit, 


1.48 


1.39 


415709 


AA649850 


Hs.278558 


ESTs 


1.00 


1.00 


415735 


AA704162 


Hs. 120811 


ESTs, Weakly similar to 138022 hypotheti 


1.00 


72.00 


415799 


AA653718 


Hs.225841 


DKFZP434D193 protein 


6.23 


31.00 


415817 


U88967 


Hs.78867 


protein tyrosine phosphatase, receptor-t 


24.30 


1.00 


415857 


AA866115 


Hs.127797 


Homo sapiens cDNA FU1 1 381 fis, clone HE 


3251 


35.00 


415989 


A1267700 




ESTs 


78.89 


1.00 
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416018 


AW138239 


Hs.78977 


proprotein convertase subtiHsinAexin t 


1.00 


1.00 


416065 


BE267931 


Hs.78996 


proliferating ceD nuclear antigen 


3.35 


Z32 


416111 


AA033813 


Hs-79018 


chromatin assembly factor 1 , subunit A ( 


39.03 


3.00 


416177 


AA174069 


Hs.187607 


ESTs 


1.00 


9.00 


416178 


AI808527 


Hs.192822 


serologically defined breast cancer anti 


183 


176 


416208 


AW291168 


Hs.41295 


ESTs, Weaidy similar to MUC2.HUMAN MUCIN 


3.67 


1.00 


416209 


AA236776 


Hs.79078 


MAD2 (mitotic arrest deficient, yeast, h 


9.70 


1.00 


416239 


AL038450 


Hs.48948 


ESTs 


83.87 


'129.00 


416250 


AA581386 


Hs.73452 


hypothetical protein MGC10791 


1.96 


Z12 


416322 


BE019494 


Hs.79217 


pyrroBne-5-carboxylate reductase 1 


2.08 


1.73 


416423 


H54375 


Hs.268921 


ESTs 


1.00 


89.00 


416448 


L13210 


Hs.79339 


lectin, galactoside-binding, soluble, 3 


1.28 


1.54 


416498 


U33532 


Hs.79351 


potassium channel, subfamily K, member 1 


2729 


67.00 


416658 


U03272 


Hs.79432 


fibrillin 2 (congenital contractural ara 


53.29 


51.00 


416661 


AA634543 


HS79440 


1GF-H rnRNA-binding protein 3 


9.98 


5.00 


416722 


AA354604 


Hs.122546 • 


hypothetical protein FU2301 7 


3.68 


33.00 


416819 


U77735 


Ks.80205 


pim-2 oncogene 


1.59 


1.84 


416936 


N21352 


Hs.42987 


ESTs. Weaidy similar to S21348 probable 


1.00 


1.00 


417034 


NM006183 


Hs.80962 


neurotensin 


1.00 


1.00 


417061 


A1675944 


Hs.188691 


Homo sapiens cDNA FU12033 fis, clone HE 


32.95 


156.00 


417079 


U65590 


Hs.81134 


interleukin 1 receptor antagonist 


3.91 


4.93 


417218 


AA129547 


Hs.285754 


met proto-oncogene (hepatocyte growth fa 


1.00 


51.00 


417233 


W25005 


Hs.24395 


small inducible cytokine subfamily B (Cy 


3.38 


Z05 


417308 


H60720 


Hs.81892 


K1AA0101 gene product 


82.94 


25.36 


417315 


AI080042 


Hs.180450 


ribosomal protein S24 


106.61 


121.00 


417324 


AW265494 




ESTs 


1.20 


1.28 


417366 


BE185289 


Hs.1076 


small proline-rich protein 1B (comifin) 


8.97 


3.27 


417389 


BE260964 


Hs.82045 


mldJdne (neurite growth-promoting factor 


Z59 


1.82 


41742B 


N87579 


Hs.278871 


gb:LL2030F Human fetal heart Lambda ZAP 


1.00 


52.00 


417433 


BE270266 


Hs.82128 


5T4 oncofetal trophoblast glycoprotein 


304.75 


173.00 


417466 


AI681547 


Hs.59457 


hypothetical protein FU22127 


1.24 


1.34 


417512 


A1979168 


Hs.344096 


glycoprotein (transmembrane) nmb 


2.14 


5.50 


417515 


L24203 


Hs.82237 


ataxia-telangiectasia group D- associated 


2.66 


168 


417542 


J04129 


Hs.82269 


progestagen-associated endometrial prole 


1.28 


1.35 


417576 


AA339449 


Hs.82285 


phosphoribosylgrycinamide formyltransfer 


42.76 


51.00 


417715 


AW969587 


Hs.86366 


ESTs 


6.35 


Z75 


417720 


AA205625 


Hs.208067 


ESTs 


113.31 


56.00 


417791 


AW965339 


Hs.1 11471 


ESTs 


39.98 


16.00 


417830 


AW504786 


Hs.122579 


hypothetical protein FU10461 


Z61 


31.00 


417866 


AW067903 


Hs.82772 


collagen, type X), alpha 1 


2.35 


Z44 


417900 


BE250127 


Hs.82906 


COC20 (cell division cycle 20, S. cerevi 


1.52 


1.11 


417933 


X02308 


Hs.82962 


thymidylate synthetase 


4.74 


Z55 


417944 


AU077196 


Hs.82985 


coDagen, type V, alpha 2 


3.61 


5.21 


417975 


AA641836 


Hs.30085 


hypothetical protein FU23186 


12.49 


38.00 


417991 


AA731452 


Hs.190008 


ESTs 


1.00 


26.00 


418004 


U37519 


Hs.87539 


aldehyde dehydrogenase 3 family, member 


3.02 


Z12 


418007 


M13509 


Hs.83169 


matrix metaJloproteinase 1 (interstitial 


187.59 


1.00 


418054 


NM_002318 


Hs.83354 


lysyl oxidase-like 2 


2.85 


Z63 


418057 


NM.012151 


Hs.83363 


coagulation factor Vlll-associated (intr 


1.54 


1.69 


418113 


AI272141 


Hs.83484 


SRY (sex determining region Y)-box 4 


6.82 


5.22 


418140 


BE613836 . 


Hs.83551 


microfibrillar-associaled protein 2 


1.26 


1.46 


418203 


X54942 


Hs.83758 


CDC28 protein kinase 2 


134.19 


144.00 


418207 


C14685 


Hs.34772 


ESTs 


1.00 


1.00 


418216 


AA662240 


Hs.283099 


AF15q14 protein 


64.66 


61.00 


418236 


AW994005 


Hs.337534 


ESTs 


18.53 


147.00 


418249 


H89226 


Hs.34892 


K1AA1323 protein 


30.53 


106.00 


418281 


U09550 


Hs.1154 


oviductal glycoprotein 1, 120kD (mucin 9 


1.00 


. 3.00 


418283 


S79895 


Hs.83942 


calhepsin K (pycnodysostosis) 


3.96 


5.16 


418300 


AJ433074 


Hs.86682 


Homo sapiens cDNA: FLJ21578 fis, clone C 


3.18 


Z91 


418322 


AA284166 


Hs.84113 


cycEn-dependent kinase inhibitor 3 (CDK 


11.96 


6.68 


418327 


U70370 


Hs.84136 


paired-tike homeodomain transcription fa 


9.23 


Z22 


418345 


AJ001696 


" Hs.241407 


serine (or cysteine) proteinase inhibto 


■ 1.00 


1.00 


418379 


AA218940 


Hs.137516 


fidgetin-Rke 1 


21.68 


44.00 


418397 


NM_001269 


Hs.84746 


chromosome condensation 1 


1.00 


8.00 


418403 


D86978 


Hs.84790 


K1AA0225 protein 


16.91 


18.98 


418462 


BE001596 


Hs.85266 


integrin, beta 4 


1.56 


1.16 


418478 


U38945 


Hs.1174 


cyclin-dependent kinase inhibitor 2A (me 


3.22 


Z38 


418506 


AA084248 


Hs.85339 


G protein-coupled receptor 39 


Z66 


Z22 


418526 


BE01902O 


Hs.85838 


solute carrier family 16 (monocarboxylic 


204 


Z21 


418538 


BE244323 


Hs.85951 


exportin, tRNA (nuclear export receptor 


1.33 


37.00 


418543 


NMJ05329 


Hs.85962 


hyaluronan synthase 3 


1.04 


1.23 


418574 


N28754 




M-phase phosphoproiein 9 


48.60 


85.00 


418592 


X99226 


Hs.284153 


Fanconi anemia, complementation group A 


18.24 


26.00 


418641 


BE243136 


Hs.86947 


a disintegrin and metalloproteinase doma 


1.19 


1.41 


418661 


NMJ01949 


Hs.1189 


E2F transcription factor 3 


29.05 


43.00 


418663 


AK001100 


Hs.41690 


desmoco§n3 


11Z17 


19.00 


418678 


NM.001327 


Hs.87225 


cancer/teslis antigen 


1.18 


1.10 


418686 


Z36830 


Hs.87268 


annexin A8 


1.54 


1.98 


418589 


AI360883 


Hs.274448 


hypothetical protein FU11029 


1.19 


1.04 


418712 


Z42183 




gb:HSC0BF041 normalized infant brain cDN 


1.00 


1Z00 


418727 


AA227609 


Hs.94834 


ESTs 


1.00 


49.00 


418738 


AW388633 


Hs.6682 


solute carrier family 7, (cationic amino 


49.85 


1.00 


418819 


AA228776 


Hs.1 91721 


ESTs 


1.00 


140.00 


416830 


BE513731 


Ks.88959 


hypothericai protein MGC4S16 


20.97 


23.00 


418882 


NMJJQ4996 


Hs.89433 


ATP-binding cassette, sub-family C (CFTR 


57.09 


35.00 
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418971 


AA360392 


Hs.87113 


ESTs 


1.00 


12.00 


418373 


AA233056 


Hs.191518 


ESTs 


4.89 


28.00 


419078 


M93119 


Ks.89584 


InsuDnoma-associated 1 


1.00 


10.00 


419079 


AW014836 


Hs.18844 


ESTs 


1.09 


1.98 


419080 


AW150835 


Hs.18876 


hypothetical protein FU21620 


2.06 


1.68 


419088 


AJ538323* 


Hs.52620 


integrin, beta 8 


15.60 


51.00 


419092 


J05581 


Hs.89603 


mucin 1, transmembrane 


1.11 


1.83 


419121 


AA374372 


Hs.89626 


parathyroid hormone-Rke hormone 


1.00 


1.00 


419171 


NM.002846 


Hs.89655 


protein tyrosine phosphatase, receptor t 


1.10 


1.14 


419183 


U60669 


Hs.89663 


cytochrome P450, subfamily XXIV (vitamin 


1.00 


1.00 


419216 


AU076718 


Hs.164021 


small inducible cytokine subfamily B (Cy 


3.18 


Z43 


419288 


AA256106 


Hs.87507 


ESTs 


1.00 


34.00 


419335 


AW960146 


Hs.284137 


hypothetical protein FU 12888 


1.00 


aoo 


419354 


M62839 


Hs.1252 


apolipoprotein H (beta-2-giycoprotein I) 


22.63 


54.00 


419359 


AL043202 


Hs.90073 


chromosome segregation 1 (yeast homolog) 


2.50 


1.98 


419423 


D26488 


Ks.90315 


KJAA0007 protein 


1.00 


7.00 


419443 


D62703 




gb:HUM316G1QB Ciontech human aorta poryA 


1.00 


12.00 


419452 


U33S35 


Hs.90572 


PTK7 protein tyrosine kinase 7 


1.64 


1.84 


419474 


AW968619 


Hs. 155849 


ESTs 


13.63 


62.00 


419485 


AA489023 


Hs.99807 


ESTs, Weakly similar to unnamed protein 


4.27 


2.26 


419488 


AA316241 


Hs.90691 


nudeophosmin/nucieoplasmin 3 


3.66 


3.63 


419502 


AU076704 




fibrinogen, A alpha polypeptide 


13.05 


115.00 


419539 


AF070590 


Hs.90869 


Homo sapiens clones 24622 and 24623 mRNA 


74.60 


117.00 


419556 


U29615 


Hs.91093 


chitinase 1 (chitotriosidase) 


1.47 


4.98 


419569 


AI971651 


Hs.91143 


jagged 1 (AlagHle syndrome) 


1.00 


4.00 


419594 


AA013051 


Hs.91417 


topoisomerase (DMA) II binding protein 


94.30 


94.00 


419703 


AI793257 


Hs.128151 


ESTs 


15.26 


50.00 


419721 


N&L001650 


Hs.288650 


aquaporin 4 


1.00 


191.00 


419729 


AA586442 


Hs.2141 1 


gb:no53a03.s1 NCLCGAP_SS1 Homo sapiens 


1.00 


59.00 


419741 


NM_007019 


Hs.93002 


ubtquitin carrier protein E2-C 


2.02 


1.08 


419745 


AF042O01 


Hs.93005 


shig (chicken homolog), zinc finger prot 


1.00 


1.00 


419752 


AA249573 


Ks.152618 


ESTs, Moderately similar to ZN91.HUMAN 2 


29.87 


77.00 


419839 


U24577 


Hs.93304 


phosphoGpase A2, group VII (plateJet-ac 


50.99 


214.00 


419936 


AI792788 




gb:o!91d05.y5 NCLCGAP_Kid5 Homo sapiens 


1.00 


1.00 


419937 


AB040959 


Hs.93836 


DKFZP434N014 protein 


1.64 


2.47 


419983 


W55956 


Hs.94030 


Homo sapiens mRNA; cDNA DKFZp586E1624 (f 


15.72 


94.00 


420005 


AW271106 


Hs.133294 


ESTs 


3.15 


1.43 


420047 


AI478658 


Hs.94631 


brefeldin A-inhibited guanine nucleotide 


12.45 


39.00 


420056 


AK001423 


Hs.94694 


Homo sapiens cDNA FU 10561 fis, clone NT 


1.00 


117.00 


420162 


BE378432 


Hs.95577 


cyciin*dependent kinase 4 


1.43 


1.21 


420251 


AW374968 


Hs.348112 


Human DNA sequence from done RP5-1 10367 


2.35 


3.23 


420259 


AF004884 


Hs.95253 


calcium channel, voltage-dependent, P/Q 


0.77 


1.15 


420281 


AI623693 


Hs.323494 


ESTs 


45.04 


54.00 


420309 


AW043837 


Hs.21766 


ESTs, Weakly similar to ALU5_HUMAN ALU S 


49.22 


31.00 


420332 


NM_001756 


Hs.1305 


serine (or cysteine) proteinase inhibito 


0.05 


2.82 


420380 


AA640891 


Hs.102406 


ESTs 


0.99 


2.74 


420462 


AF050147 - 


Hs.97932 


chondromodulin I precursor 


1.00 


1.00 


420520 


AK001978 


Hs,98510 


similar to rab1 1 -binding protein 
hypothetical protein 


49.74 


133.00 


420552 


AK000492 


Hs.98806 


94.65 


88.00 


420560 


AW207748 


Hs.59115 


ESTs 


1.00 


17.00 


420610 


A1683183 


Hs.99348 


distaHess homeo box 5 


1.00 


13.00 


420689 


H79979 


Hs.88678 


ESTs 


50.09 


95.00 


420721 


AA927802 


Hs.159471 


ZAP3 protein 


1.00 


31.00 


420759 


T11832 


Hs. 127797 


Homo sapiens cDNA FU 11381 fis, clone HE 


1.00 


48.00 


420783 


AI659838 


Hs.99923 


lectin, galactoside-blnding, soluble, 7 


3.04 


1.25 


420900 


AL045633 


Hs.44269 


ESTs 


2,24 


7.00 


420931 


AF044197 


Hs.100431 


small inducible cytokine B subfamily (Cy 


1.00 


8.00 


421002 


AF1 16030 


Hs. 100932 


transcription factor 17 


1.00 


27.00 


421027 


AA761198 


Hs.55254 


ESTs 


2.87 


38.00 


421037 


A1684808 


Hs.197653 


ESTs 


1.00 


46.00 


421041 


N36914 


Hs.14691 


ESTs, Moderately similar to I38022 hypot 


1.00 


98.00 


421073 


NM_004689 


Hs. 101446 


metastasis associated 1 


1.34 


1.46 


421110 


AJ250717 


Hs.1355 


cathepsin E 


119.47 


427.00 


421133 


AA401369 


Hs. 190721 


ESTs 


1.10 


17.00 


421150 


AI913562 


Hs.189902 


ESTs 


1.45 


1.63 


421155 


H87879 


Hs. 102267 


lysyl oxidase 


1.00 


15.00 


421307 


BE539976 


HsJ03305 


Homo sapiens mRNA; cDNA DKF2p434B0425 (f 


1.37 


1.10 


421316 


AA287203 


Hs.324728 


SMA5 


1.00 


21.00 


421379 


Y15221 


Hs.103982 


small inducible cytokine subfamily B (Cy 


1.92 


394 


421451 


AA291377 


Hs.50831 


ESTs 


5.89 


14.00 


421474 


U76362 


Hs.1 04637 


solute carrier family 1 (glutamate trans 


1.46 


1.76 


421505 


BE302796 


Hs.1O5097 


thymidine kinase 1 , soluble 


1.56 


1.08 


421508 


NM.004833 


Hs.105115 


absent in melanoma 2 


5.11 


5.23 


421515 


Y11339 


Hs.105352 


GalNAc alpha-2, 6-sia!yltransferase 1, 1 


1.00 


3.00 


421524 


AA3120S2 


Hs.105445 


GDNF family receptor alpha 1 


2.63 


10.58 


421526 


AL080121 


Hs.1 05460 


DKFZP564O0823 protein 


1.46 


1.88 


421552 


AF026692 


Hs.105700 


secreted frizzled-related protein 4 


30.21 


50.32 


421574 


AJ000152 


Hs.1 05924 


defensin, beta 2 


1.67 


1.74 


421582 


AI910275 




trefoil factor 1 (breast cancer, estroge 


1.23 


1.00 


421633 


AF1 21860 


Hs.106260 


sorting nexin 10 


1.00 


116.00 


421659 


NMJ)14459 


Hs.106511 


protocadherin 17 


0.05 


6.33 


421677 


H64092 


Hs.38282 


ESTs 


1.31 


1.42 


421753 


BE314828 


Hs.107911 


ATP-binding cassette, sub-family B (MOR/ 


1.41 


1.20 


421773 


W69233 


Hs.112457 


ESTs 


1.12 


1.14 


421777 


BE562088 


Hs.108196 


HSPC037 protein 


1.97 


1.29 
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421800 


AA298151 


Hs.222969 


ESTs 


1.03 


1.30 


421817 


AF146074 


Hs.108660 


ATP-binding cassette, sub-family C (CFTR 


1.88 


1.59 


421896 


N62293 


Hs.45107 


ESTs 


11.84 


22.80 


421928 


AF013758 


Hs.109643 


pofyadenylaie binding prolein-interactin 


45.89 


90.00 


421931 


NM_G0QB14 


Hs.1440 


gamma-aminobutyric acid (GAB A) A recepto 


1.13 


1.49 


421948 


L42583 


Hs.334309 


keratin 6A 


51.83 


20.25 


421975 


AW961017 


Hs.6459 


hypothetical protein FU11856 


1.17 


1.15 


422026 


U80736 


Hs.1 10826 


trinucleotide repeat containing 9 


1.00 


52.00 


422094 


AF129535 


Hs.272027 


F-box only protein 5 


67.61 


62.00 


422095 


AI868872 


Hs.282804 


hypothetical protein FU22704 


4.37 


234 


422109 


S73265 


Hs.1473 


gastrirweleasing peptide 


4.18 


95.50 


422128 


AW861145 




gj):GV0-OTG033410400-182-a07 OT0033 Homo 


40.89 


71.00 


422129 


AU076635 


Hs.1478 


serine (or cysteine) proteinase tnhibito 


1.13 


1.38 


422134 


AW179019 


Hs.112110 


mitochondrial ribosoma! protein L42 


41.59 


96.00 


422158 


L10343 


Hs.1 12341 


protease inhibitor 3, skin-derived (SKAL 


2.37 


1.10 


422168 


AA586894 


Hs.112408 


S100 calcium-binding protein A7 (psorias 


3.29 


1.68 


422278 


AF072873 


Hs.1 14218 


frizzled (Drosophila) homoiog 6 


4.93 


5.73 


422282 


AF019225 


Hs.1 14309 


apolipoprotein L 


1.49 


1.71 


422283 


AW411307 


Hs.114311 


CDC45 (ceii division cyde 45, S.cerevis 


25.99 


10.91 


422310 


AA316622 


Hs.98370 


cytochrome P450, subfamily IIS, pdypept 


1.54 


1.41 


422311 


AF073515 


Hs.1 14948 


cytokine receptor-tike factor 1 


1.15 


1.78 


422330 


D30783 


Hs.1 15263 


epireguBn 


1.00 


11200 


422364 


AF067800 


Hs.115515 


C-type (calcium dependent carbohydrate- 


9.39 


60.00 


422406 


AF025441 


Hs.116206 


Opa-interacting protein 5 


18.33 


53.00 


422424 


AI186431 


Hs.296638 


prostate differentiation factor 


171 


3.21 


422440 


NM.004812 


Hs.116724 


aldo-keto reductase family 1 , member B10 


47.53 


32.00 


422487. 


AJ010901 


Hs.198267 


mucin 4, tracheobronchial 


73.68 


35.54 


422511 


AU076442 


Hs.117938 


collagen, type Xvll, alpha 1 


173.97 


26.00 


422515 


AW500470 


Hs.1 17950 


multifunction^ polypeptide similar to S 


4.68 


292 


422656 


AJ870435 


Hs.1569 


UM homeobox protem 2 


1.00 


1.00 


422737 


M26939 


Hs.119571 


collagen, type III, alpha 1 (Ehlers-Dani 


3.89 


4.55 


422756 


AA441787 


Hs.119689 


glycoprotein hormones, alpha polypeptide 


1.05 


1.46 


422765 


AW409701 


Hs.1578 


baculovira) IAP repeat-containing 5 (sur 


3.88 


1.53 


422809 


AK001379 


Hs.121028 


hypothetical protein FLJ10549 


99.56 


53.00 


422867 


L32137 


Hs.1584 


cartilage oligomeric matrix protein [pse 


1.69 


3.17 


422938 


NM.001809 


Hs.1594 


centromere protein A (17kD) 


70.46 


61.00 


422956 


BE545072 


Hs.122579 


ECT2 protein (Epithelial eel) transform) 


77.74 


3.00 


422960 


AW890487 


Hs.63984 


cadherin 13, H-cadherin (heart) 


5.88 


8.55 


422963 


AA401369 


Hs.190721 


ESTs 


171.41 


17.00 


422976 


AU076657 


Hs.1600 


chaperonin containing TCP1, subunit 5 (e 


212 


1.62 


422981 


AF026445 


Hs.122752 


TATA box binding protein (TBP)-associate 


10.49 


35.00 


422986 


AA319777 


Hs.221974 


ESTs 


12.40 


32.47 


423034 


AL1 19930 




gb:DKFZp761A092_r1 761 (synonym: hamy2) 


16.41 


60.00 


423049 


X59373 


Hs.188023 


ESTs. Moderately similar to HXDA.HUMAN H 


1.00 


1.00 


423081 


AF262992 


Hs.123159 


sperm associated antigen 4 


1.82 


2.96 


423184 


NMJXJ4428 


Hs.1624 


ephrin-A1 


1.14 


1.53 


423217 


NMJJ00094 


Hs.1640 


collagen, type VII, alpha 1 (epidermoiys 


214 


1.69 


423248 


AA380177 


Hs.125845 


ribulose-5-phosphate-3-epimerase 


7.18 s 


14.00 


423309 


BE006775 


Hs.126782 


sushi-repeat protein 


21.90 


64.00 


423361 


AW170055 


Hs.47628 


ESTs 


1.00 


1.00 


423453 


AW450737 


Hs.128791 


CGI-09 protein 


55.52 


66.00 


423511 


AF036329 


Hs.129715 


gonadotroptn-releasing hormone 2 


0.88 


1.17 


423516 


AB007933 


Hs.129729 


ligand of neuronal nitric oxide synthase 


1.76 


5.40 


423551 


AA327598 


Hs.233785 


ESTs 


3.54 


4.33 


423554 


M90516 


Hs.1674 


glutamine-fructose-6-phosphate transamin 


1.00 


50.00 


423575 


C18863 


Hs.1 63443 


Homo sapiens cDNA FU1 1 576 fis, clone HE 


38.88 


70.00 


423624 


Al 807408 


Hs.166368 


ESTs 


1.00 


67.00 


423634 


AW959908 


Hs.1690 


heparin-binding growth factor binding pr 


76.02 


1.00 


423642 


AW452650 


Hs.157148 


hypothetical protein MGC13204 


19.14 


58.00 


423662 


AA642452 


Hs.130881 


B-cell CLL/fymphoma 1 1A (zinc finger pro 


3.61 


13.57 


423673 


BE003054 


Hs.1695 


matrix metaiioproteinase 12 (macrophage 


240.73 


40.00 


423698 


AA329796 


Hs.1098 


DKFZp434J1813 protein 


1.00 


59.00 


423725 


AJ4O3108 


Hs.132127 


hypothetical protein LOC57822 


4.20 


1.00 


423761 


NM_006194 


Hs.132576 


paired box gene 9 


1.00 


1.00 


423787 


AJ295745 


Hs.236204 


nuclear pore complex protein 


7.1B 


6.64 


423816 


AF151064 




hypothetical protein 


1.00 


44.00 


423826 


U20325 


Hs.1707 


cocaine- and amphetamine-regulated trans 


1.00 


1.00 


423849 


AL157425 


Hs.133315 


Homo sapiens mRNA; cDNA DKFZp761 J1324 (f 


1.00 


1.00 


423887 


AL080207 


Hs.134585 


DKFZP434G232 protein 


1.00 


1.00 


423934 


U89995 


Hs.159234 


forkhead box El (thyroid transcription f 


31.33 


31.00 


423954 


AW753164 


Hs.288604 


KIAA1632 protein 


5.81 


10.87 


423961 


D13666 


Hs.136348 


osteoblast specific factor 2 (fasciclin 


3.55 


3.30 


424012 


AW368377 


Hs.137569 


tumor protein 63 kDa with strong homoiog 


233.42 


68.00 


424016 


AW163729 


Hs.6140 


hypothetical protein MGC15730 


0.93 


1.01 


424028 


AF055084 


Hs.153692 


Homo sapiens cDNA FU14354 fis. done Y7 
serine (or cysteine) proteinase inhibito 


21.30 


5200 


424046 


AF027866 


Hs.138202 


1.00 


1.00 


424086 


AI351010 


Hs.102267 


Sysyl oxidase 


21.91 


70.00 


424098 


AF077374 


Hs.139322 


small proline-rich protein 3 


137.82 


54.00 


424120 


T80579 


Hs.290270 


ESTs 


1.00 


1.00 


424165 


AW582904 


Hs.142255 


islet amyloid polypeptide 


1.00 


34.00 


424200 


AA337221 




gb:EST41944 Endometrial tumor Homo sapie 


13.06 


48.00 


424279 


L29306 


Hs.171814 


tryptophan hydroxylase (tryptophan 5-mon 


1.00 


1.00 


424308 


AW975531 


Hs.154443 


minichromosome maintenance deficient (S. 


164.58 


87.00 


424326 


NM.014479 


Hs.145296 


disintegrin protease 


53.72 


30200 


424340 


AA339036 


Hs.7033 


ESTs 


0.88 


1.15 
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424351 


BE622117 


Hs.145567 


hypothetical protein 


0.93 


1.03 


424364 


AW383226 


Hs.201189 


ESTs, Weakfy similar to G01763 airophin- 


7.02 


3.24 


424381 


AA285249 


Hs.146329 


protein kinase Chk2 


95.55 


92.00 


424411 


NMJ305209 


Hs.146549 


crystallin, betaA2 


1.63 


3.25 


424420 


8E614743 


Hs.146588 


prostaglandin E synthase 


1.63 


1.33 


424441 


X14850 


Hs.147097 


H2A histone family, member X 


1.82 


1.29 


424502 


AF242388 


Hs.149585 


lengsin 


1.00 


1.00 


424503 


X06256 


Hs.149509 


integrin, alpha 5 (fibronectin receptor, 


1.02 


224 


424513 


BE385864 


Hs.149894 


mitochondria) transiationa) inifiaSon f 


1.00 


17.00 


424539 


L02911 


Hs.150402 


Actrvin A receptor, type I (ACVR1)(AIK 


32.46 


108.00 


4245S8 


AF005418 


Hs.150595 


cytochrome P450. subfamHy XXVTA, polype 


3.40 


258 


424602 


AK002055 


Hs.151046 


hypothetical protein FU11193 


31.87 


25.00 


424629 


M90656 


Hs.151393 


gtutamate-cystelne ligase, catalytic sub 


3.58 


237 


424645 


NMJ)14682 


Hs.151449 


KIAA0535gene product 


1.00 


1.00 


424687 


J05070 


Hs.151738 


matrix metafloproteinase 9 (gelafinase B . 


2.12 


223 


424717 


AW992292 


Hs.152213 


wingless-type MMTV integration site fami . 


1.00 


1.00 


424834 


AK001432 


Hs.153408 


Homo sapiens cDNA FL) 10570 fis, clone NT 


56.19 


1200 


424840 


079987 


Hs.153479 


extra spindle poles, S. cerevisiae, homo 


2.65 


1.30 


424867 


AJ024850 


Hs.153591 


Not56 p. melanogaster)-like protein 


1.23 


1.05 


424905 


NMJJ02497 


Hs.153704 


NiMA (never in mitosis gene a)-related k 


21.35 


1.00 


424979 


D87989 


Hs.154073 


UDP-galactose transporter related 


1.36 


1.35 


424999 


AW953120 




gb:EST3651 90 MAGE resequences, MAGB Homo 


1.24 


1.41 


425048 


H05468 


Hs.164502 


ESTs 


1.00 


11.00 


425057 


AA826434 


Hs.1619 


achaete-scute complex (Drosophila) homo! 


7.46 


87.00 


425081 


X74794 


Hs.154443 


minichromosome maintenance deficient (S. 


2.52 


3.82 


425118 


AU076611 


Hs.154672 


methylene tetrahydrofblate dehydrogenase 


4.84 


4.03 


425159 


NMJW4341 


Hs.154868 


carbamoyt-phosphate synthetase 2, aspart 


3.62 


273 


425202 


AW962282 


Hs.152049 


ESTs, Weakly similar to 138022 hypotheti 


1.00 


53.00 


425234 


AW152225 


Hs.165909 


ESTs, Weakly similar to 138022 hypotheti 


100.77 


44.00 


425236 


AW067800 


Hs.155223 


stanniocalcin 2 


3.30 


290 


425245 


AI751768 


Hs.155314 


KIAA0095 gene product 


1.91 


232 


425247 


NMJQ5940 


Hs.155324 


matrix metalloproteinase 11 (stromelysin 


1.41 


1.49 


425266 


J00077 


Hs.155421 


alpha-feloprotein 


1.00 


68.00 


425274 


BE2B1191 


Hs.155462 


minichromosome maintenance deficient (mi 


1.97 


1.63 


425322 


US3630 


Hs.155637 


protein kinase, DMA-activated, catalytic 


141.49 


12100 


425349 


AA425234 


Hs.79886 


ribose 5-phosphate isomerase A (ribose 5 


1.00 


84.00 


425371 


D49441 


Hs. 155981 


mesothelin 


0.87 


1.59 


425397 


J04088 


Hs.156346 


topoisomerase (DNA) II alpha (170XD) 


14.90 


5.76 


425420 


BE536911 


Hs.234545 


hypothetical protein NUF2R 


1.00 


1.00 


425424 


NM.004954 


Hs.1 57199 


ELKL motif kinase 


10.58 


9.74 


425483 


AF231022 


Hs.158159 


FAT tumor suppressor (Drosophila) homolo 


1.74 


1.40 


425566 


AW162943 


Hs.250618 


UL16 binding protein 2 


1.49 


1.14 


425580 


L11144 


Hs.1907 


galanin 


53.29 


233.00 


425650 


NMJW1944 


Hs.1925 


desmoglein 3 (pemphigus vulgaris antigen 


33.45 


1.00 


425692 


D90041 


Hs.155956 


N-acetyllransferase 1 (arylamine N-acely 


1.00 


55.00 


425695 


NMJJ05401 


Hs.159238 


protein tyrosine phosphatase, non-recept 


1.00 


10.00 


425734 


AF056209 


Hs.159396 


peptMylglydne alpha-amidating monooxyg 


1.00 


41.00 


425776 


U25128 


Hs,159499 


parathyroid hormone receptor 2 


1.00 


48.00 


425810 


AI923627 


Hs.31903 


ESTs 


27.39 


98.00 


425811 


AL039104 


Hs.159557 


karyopherin alpha 2 (RAG cohort 1, impor 


1.99 


1.58 


425849 


AI077288 


Hs.296323 


senim/glucocorticoid regulated kinase 


71.16 


3.42 


425852 


AK001504 


Hs.159651 


death receptor 6, TNF superfamily member 


1.35 


1.34 


426067 


AA401369 


Hs.190721 


ESTs 


1.01 


17.00 


426088 


. AF038007 


Hs.166196 


ATPase, Class I, type 8B, member 1 


26.26 


47.00 


426215 


AW067800 


Hs.155223 


stanniocalcin 2 


191 


290 


426227 


U67058 


Hs.1 54299 


Human proteinase activated receptor-2 mR 


22.40 


25.00 


426269 


H15302 


Hs.168950 


Homo sapiens mRNA; cDNA DKFZp566A1046 (f 


1.00 


1.00 


426283 


NM-003937 


Hs.169139 


kynureninase (L-kynurenlne hydrolase) 


91.39 


229.00 


426329 


AL389951 


Hs.271623 


nucleoporin 50kO 


4.34 


4.08 


426427 


M86699 


Hs.169840 


TTK protein kinase 


7.02 


1.00 


426432 


AF001601 


Hs.169857 


paraoxonase 2 


1.16 


1.68 


426440 


BE382756 


Hs.169902 


solute carrier family 2 (facilitated glu 


259 


1.71 


426459 


AF151812 


Hs.169992 


hypothetical 43.2 Kd protein 


1.56 


1.66 


426471 


M22440 


Hs.170009 


transforming growth factor, alpha 


20.60 


26.00 


426496 


031765 


Hs.170114 


KIAA0061 protein 


9.81 


22.00 


426501 


AA401369 


Hs.190721 


ESTs 


19.23 


17.00 


426514 


BE616633 


Hs.170195 


bone morphogenetic protein 7 (osteogenic 


103.74 


41.00 


426536 


AI949749 


Hs.44441 


ESTs 


4.65 


23.00 


426572 


AB037783 


Hs.170623 


hypothetical protein FU1 1 183 


1.00 


43.00 • 


426682 


AV660038 


Hs.2056 


UDPglycosyitransferase 1 family, polype 


160.06 


8.00 


426691 


NMJ06201 


Hs.171834 


PCTAIRE protein kinase 1 


1.51 


1.35 


426746 


J03626 


Hs.2057 


uridine monophosphate synthetase (orotat 


2.13 


1.68 


426752 


X69490 


Hs.172004 


titin 


0.02 


5.14 


426784 


U03749 


Hs.172216 


chromogranin A (parathyroid secretory pr 


1.72 


1.71 


426807 


AA385315 


Hs.156682 


ESTs 


1.30 


1.64 


426812 


AF105365 


Hs.172613 


solute carrier family 12 (potassium/chlo 


1.47 


1.53 


426814 


AF036943 


Hs.172619 


myelin transcription factor 14ike 


1.00 


1.00 


426831 


BE296216 


Hs.172673 


S-adenosylhomocysteine hydrolase 


1.51 


1.25 


426897 


AA401369 


Hs.190721 


ESTs 


141.56 


17.00 


426925 


NMJ01196 


Hs.315689 


Homo sapiens cDNA: FU22373 fis, clone H 


32.61 


38.00 


426935 


NM_000088 


Hs.172928 


collagen, type 1, alpha 1 


2.65 


3.16 


426964 


AA393739 


Hs.287416 


Homo sapiens cDNA FU 11439 fis, done HE 
sclerostin 


1.97 


3.49 


426966 


AI493134 




1.00 


1.00 


426991 


AK001536 




Homo sapiens cDNA FU10574 fis, clone NT 


3.39 


228 


427099 


AB032953 


Hs.173560 


odd Oz/teiwn homobg 2 (Drosophila, mous 


4.24 


' 17.00 
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427239 


BE270447 


Hs.174070 


ubiquitin carrier protein 


1.58 


1.05 


427260 


M653848 




gb:ae70b06.s1 Stratagene schizo brain S1 


1.34 


1.60 


427281 


AA906147 


Hs.102869 


ESTs 


1.00 


66.00 


427335 


M448542 


Hs.251677 


G antigen 7B 


51.83 


4.00 


427354 


T57898 


Hs.191095 


ESTs 


1.17 


1.95 


427355 


AWQ23482 


Hs.97849 


ESTs 


7.31 


41.00 


427376 


M401533 


Ks.19440 


ESTs 


1.00 


57.00 


427383 


NMJHJ5411 


Hs.177582 


surfactant, pulmonary-associated protein 


0.42 


1.32 


427427 


AF077345 


Hs.177938 


lectin, superfamiry member 1 (cartflage- 


1.00 


20.00 


427441 


M412605 


Hs.343879 


SPANX family, member C 


1.00 


1.00 


427445 


X80818 


Hs.178078 


glutamate receptor, meiabotroplc 4 


0.97 


1.03 


427505 


AA361562 


Hs.178761 


26S proteasom^assotiated pad1 homotog 


4.60 


4.04 


427510 


247542 


Hs.179312 


small nuclear RNA activating complex, po 


22.00 


45.00 


427528 


AU077143 


Hs.179565 


rninichromosome maintenance deficient (S. 


97.45 


9200 


427546 


AA18B763 


Hs.36793 


hypothetical protein FU23188 


1.50 


3.24 


427562 


R56424 


Hs.26534 


ESTs 


6.81 


40.00 


427585 


D31152 


Hs.179729 


coBagen, type X, alpha 1 (Schmid metaph 


69.91 


62.00 


427660 


AI741320 


Hs.114121 


Homo sapiens cDNA: FU 23228 fis, clone C 


2.70 


49.00 


427666 


AI791495 


Hs.180142 


catmoduBn-fike skin protein 


1.37 


1.88 


427668 


AA298760 


Hs.180191 


hypothetical protein FU14904 


29.55 


67.00 


427677 


NMJJ07045 


Ks.180296 


FGFR1 oncogene partner 


3.52 


2.63 


427701 


AA411101 


Hs.243886 


nuclear autoantigenic sperm protein (his 


7.41 


34.00 


427711 


M31659 


Hs.180408 


solute carrier family 25 (mitochondrial 


15.84 


70.00 


427719 


AI393122 


Hs.134726 


ESTs 


7.03 


4.52 


427722 


AK000123 


Hs.180479 


hypothetical protein FU20116 


2.92 


1.74 


427747 


AW411425 


Hs.180655 


serine/threonine kinase 12 


1.76 


1.26 


427912 


AL022310 


Hs.181097 


tumor necrosis factor (ligand) superfami 


9.63 


59.00 


427961 


AW293165 


Hs.143134 


ESTs 


41.97 


11R00 


428004 


AA449563 


Hs.151393 


gtutamate-cysteine ligase, catalyGc sub 


23.82 


1.00 


428023 


AL038843 




Homo sapiens cOMA; FU23602 lis, clone I 


1.40 


1.33 


428046 


AW812795 


Hs.337534 


ESTs, Moderately similar to 138022 hypot 


96.28 


167.00 


428093 


AW594506 


Hs.104830 


ESTs 


1.25 


1.29 


428098 


AU077258 


Hs.182429 


protein disulfide isomerase-related prot 


1.86 


1.60 


428129 


AI244311 


Hs.26912 


ESTs 


1.00 


42.00 


428169 


A1928984 


Hs.182793 


gotgi phosphoprotein 2 


£76 


211 


428182 


BE386042 


Hs.293317 


ESTs, Weakly similar to GGC1.HUMAN G ANT 


1.00 


1.00 


428227 


AA321849 


Hs.2248 


small inducible cytokine subfamily B (Cy 


85.59 


181.00 


428242 « 


H55709 


Hs.2250 


leukemia inhibitory factor (cholinergic 


8.57 


21.64 


428330 


L22524 


Hs.2256 


matrix metalJoproleinase 7 (matriJysin, 


7.77 


15.90 


428434 


AI909935 


Hs.65551 


Homo sapiens, Similar to DNA segment, Ch 


0.58 


1.43 


428450 


NMJ514791 


Hs.184339 


KfAA0175gene product 


237.53 


204.00 


428471 


X57348 


Hs.184510 


strati fin 


6.00 


4.60 


428479 


Y00272 


Hs.334562 


cell division cycle 2, G1 to S and G2 to 


56.54 


16.00 


428484 


AF104032 


Hs.184601 


solute carrier family 7 (calionic amino 


3.53 


2.15 


428505 


AL035461 


Hs.2281 


chromogranin B (secretogranin 1) 


1.00 


1.00 


428532 


AF157326 


Hs.184786 


TBP-lnteracting protein 


1.00 


58.00 


428645 


AA431400 


Hs.98729 


ESTs, Weakly similar to 201 7205A dihydro 


1.00 


16.00 


428664 


AK001666 


Hs.189095 


similar to SALL1 {sal (DrosophBaHike 


1.00 


1.00 


428698 


AA852773 


Hs.334838 


KIAA1866 protein 


187.37 


255.00 


428728 


NM.016625 


Hs.191381 


hypothetical protein 


47.24 


80.00 


428748 


AW593206 


Hs.98785 


Ksp37 protein 


1.00 


87.00 


428758 


AA433988 


Hs.98502 


hypothetical protein FU14303 


1.06 


1.13 


428771 


AB028992 


Hs.193143 


KIAA1069 protein 


1.98 


92.00 


428801 


AW277121 


Hs.254881 


ESTs 


1.67 


6.15 


428B10 


AF06B236 


Hs.193788 


nitric oxide synthase 2A (inducible, hep 


1.03 


1.27 


428839 


AI767756 


Hs.82302 


Homo sapiens cDNA FU14814 lis, clone NT 


124.17 


43.00 


428845 


AL157579 


Hs.153610 


KIAA0751 gene product 


1.00 


1.00 


428959 


AF100779 


Hs.194680 


WNT1 inducible signaTing pathway protein 


15.16 


27.00 


428969 


AF120274 


Hs.194689 


artemin 


1,36 


1.24 


429038 


AL023513 


Hs.194766 


seizure related gene 6 (mouseHike 


0.97 


3.31 


429065 


AI753247 


Hs.29643 


Homo sapiens cDMA FU13103 fis, clone NT 


6.82 


16.47 


429164 


AI688663 


Hs.116586 


ESTs 


19.08 


67.00 


429170 


NM.001394 


Hs.2359 


dual specificity phosphatase 4 


16.18 


105.00 


429183 


AB014604 


Hs,197955 


KIAA0704 protein 


79.72 


104.00 


429201 


X03178 


Hs.198246 


group-specific component (vitamin D bind 


1.00 


1.00 


429211 


AF052693 


Hs.198249 


gap junction protein, beta 5 (connexin 3 


1.33 


1.09 


429220 


AW207206 




ESTs 


1.00 


7.00 


429228 


AI553633 


Hs.326447 


ESTs 


39.47 


29.25 


429259 


AA420450 


Hs.292911 


ESTs, Highly similar to S60712 band-6-pr 


2.01 


1.18 


429263 


AA019004 


Hs.198396 


ATP-bfnding cassette, sub-famfiy A (ABC1 


1.07 


1.00 


429276 


AF056085 


Hs.198612 


G protein-coupled receptor 51 


3.70 


142.00 


429359 


W00482 


Hs.2399 


matrix metalloproteinase 14 (membrane-in 


1.30 


1.94 


429412 


NM.006235 


Hs.2407 


POU domain, class 2, associating factor 


94.09 


85.00 


429413 


NM 014058 


Hs.201877 


DESC1 protein 


41.91 


10.00 


429486 


AF155827 


Hs.203963 


hypothetical protein FU10339 


12.19 


1.00 


429504 


X99133 


Hs.204238 


lipocafin 2 (oncogene 24p3) 


1.61 


1.08 


429538 


BE182592 


Hs.11261 


small proline-rich protein 2A 


4.43 


290 


429547 


M401369 


Hs.1 90721 


ESTs 


1.06 


17.00 


429551 


AW450624 


Hs.220931 


ESTs 


2.89 


65.00 


429563 


BE619413 


Hs.2437 


eukaryotic translation initiation factor 


1.49 


1.37 


429597 


NMJ03816 


Hs.2442 


a disintegrin and metalloproteinase rfoma 


61.66 


100.00 


429610 


AB024937 


Hs.211092 


LUNX protein; PLUNC (palate lung and nas 


1.59 


1.69 


429612 


AF082649 


Hs.252587 


pituitary tumor-transforming 1 


278 


1.74 


429616 


AI982722 


Hs.120845 


ESTs 


1.00 


1.00 


429656 


X05608 


Hs.211584 


neurofilament, light polypeptide (68kD) 


1.00 


4.00 
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429863 


M58874 


Hs.211587 


phosphoGpase A2, group tVA (cytoso&c, 


69.95 


104.00 


429736 


AF125304 


Hs.212680 


tumor necrose factor receptor superfami 


1.25 


121 


429782 


NM.005754 


Hs.220689 


Ras-GTPase-acb'valing protein SH34omain 


1.00 


7.00 


429903 


AL134197 


Hs.93597 


cycfin-dependent kinase 5, regulatory su « 


11.80 . 


100 


429918 


AW873985 


Hs.119383 


ESTs 


100 


78.00 


429978 


AA249027 




ribosomal protein S6 


1.98 


3.09 


429986 


AF092047 


Hs.227277 


sine ocufis homeobox (Drosophila) homoJo 


100 


48.00 


430044 


M464510 


Hs.152812 


ESTs 


69.27 


59.00 


430114 


AA847744 


Hs.99640 


ESTs 


100 


100 


430134 


BE3B0149 


Hs.105223 


ESTs, Weakly similar to T331 88 hypotheti 


100 


5100 


430147 


R60704 


Hs.234434 


hairy/enhancer-of-spfif related with YRP 


110 


222 


430287 


AW182459 


Hs.125759 


ESTs, Weakly similar to LEU5.HUMAN LEUKE 


100 


127.00 


430294 


AI538226 


Hs.32976 


guanine nucleotide binding protein 4 


3.80 


147 


430300 


U60805 


Hs.238648 


oncostaiinM receptor 


1.00 


35.00 


430315 


NM 004293 


Hs.239147 


guanine deaminase 


92.31 


28.00 


430337 


M36707 


Hs.239600 


calmodulirviike 3 


118 


108 


430378 


Z29572 


Hs.2556 


tumor necrosis factor receptor superfami 


5.28 


66.00 


430388 


AA356923 


Hs.240770 


nuclear cap binding protein subunit 2, 2 


16.76 


38.00 


430393 


BE185030 


Hs.241305 


estrogen-responsive B box protein 


163 


150 


430439 


AL133561 




DKFZP434B061 protein 


100 


100 


430451. 


AA836472 


Hs.297939 


cathepsin B 


164 


212 


430454 


AW469011 


Hs.105635 


ESTs 


63.35 


44,00 


430466 


AF052573 


Hs.241517 


polymerase (DNA directed), Iheia 


247 


1.91 


430481 


AA479678 


Hs.203269 


ESTs, Moderately similar to ALU8_HUMAN A 


1.00 


3100 


4304B6 


BE062109 


Hs.241551 


chloride channel calcium activated, fam 


12.28 


4100 


430508 


A1015435 


Hs.104637 


ESTs 


4.75 


7.27 


430533 


AA480895 


Hs.57749 


ESTs, Weakly similar to T17288 hypotheti 


100 


100 


430563 


AF146074 


Hs.108660 


ATP-binding cassette, sub-family C (CFTR 


100 


159 


430677 


Z26317 


Hs.94560 


desmcglein 2 


172 


1.30 


430678 


AA401369 


Hs.190721 


ESTs 


0.90 


17.00 


430686 


NM.001942 


Hs.2633 


desmoglein 1 


100 


1.00 


430788 


A1742925 


Hs.7179 


ESTs, Weakly similar to 2004399A chromos 


162 


184 


430890 


X54232 


Hs.2699 


glypican 1 


158 


140 


430935 


AW072916 




zinc finger protein 131 (clone pHZ-10) 


90.28 


13200 


430985 


AA490232 


Hs.27323 


ESTs, Weakly similar to 178885 serineAh 


0.94 


128 


431009 


BE149762 


Hs.48956 


gap junction protein, beta 6 (connexin 3 


60.25 


28.00 


431089 


BE041395 




ESTs, Weakly similar to unknown protein 


23.32 


941.00 


431092 


A1332764 


Hs.125757 


ESTs 


13.46 


63.00 


431124 


AF284221 


Hs.59506 


doublesex and mab-3 related transcriptio 


49.43 


6200 


431164 


AA493650 


Hs.94367 


Homo sapiens cDNA: RJ23494 fts, clone L 


0.44 


220 


431211 


M86849 


Hs.323733 


gap junction protein, beta 2, 26kD (conn 


182.26 


10100 


431221 


AW207837 


Hs.286145 


SRB7 (suppressor of RNA polymerase B, ye 


4.15 


13.97 


431277 


AA501806 


Hs.345824 


ESTs 


100 


86.00 


431322 


AW970622 




gb:EST382704 MAGE resequences, MAGK Homo 


40.55 


200.00 


431342 


AW971018 


Hs.21659 


ESTs 


1.00 


53.00 


431384 


BE158000 


Hs285026 


gb:MR2-HT0377-150200-202^03HT0377 Homo 


0.94 


114 


431462 


AW583672 


Hs.256311 


granin-like neuroendocrine peptide precu 


130 


1.25 


431494 


AA991355 


Hs.298312 


hypothetical protein OKFZp434A1315 * 


3.90 


26.00 


431515 


NMJM2152 


Hs.258583 


endothelial differentiation, lysophospha 


141 


187 


431548 


AI834273 


Hs.9711 


novel protein 


5.66 


15.00 


431630 


NMJJ02204 


Hs.265829 


integrin, alpha 3 (antigen CD49C, alpha 


0.99 


144 


431745 


AW972448 


Hs.163425 


ESTs 


0.99 


3.51 


431770 


BE221880 


Hs.268555 


S-3 exoribonuctease 2 


67.12 


91.00 


431830 


Y16645 


Hs.271387 


small inducible cytokine subfamily A (Cy 


3.36 


4.71 


431846 


BE019924 


Hs.271580 


uroplakln 1B 


4.49 


2.51 


431890 


X17033 


Hs.271986 


integrin, alpha 2 (CD49B, alpha 2 subuni 


2.20 


3.32 


431934 


AB031481 


Hs.272214 


STG protein 


101 


104 


431958 


X63629 


Hs.2877 


cadherin 3, type 1, P-cadherin (placenta 


5117 


46.35 


432006 


AL137382 


Hs.272320 


Homo sapiens mRNA; cDNA DKFZp434L1226 (f 


0.94 


165 


432023 


R43020 


Hs.236223 


EST 


0.94 


47.00 


432201 


AJ538613 


Hs.298241 


Transmembrane protease, serine 3 


110 


2.24 


432210 


AI567421 


Hs.273330 


Homo sapiens, clone 1MAGE3544662, mRNA, 


142 


145 


432226 


AW182766 


Hs.273558 


phosphate cytidylyttransferase 1, cholin 


100 


1.00 


432239 


X81334 


Hs.2936 


matrix metaBoproteinase 13 (collagenase 


18.67 


1.00 


432265 


BE382679 


Hs.285753 


SCGIO-like-proteln 


1.09 


121 


432281 


AK001239 


Hs.274263 


hypothetical protein RJ10377 


40.98 


58.00 


432365 


AK001106 


Hs.274419 


hypothetical protein FU 10244 


100 


214.00 


432374 


W68815 


Hs.301885 


Homo sapiens cDNA FU1 1346 fis, clone PL 


157.34 


37.00 


432375 


BE536069 


Hs.2962 


S100 calcium-binding protein P 


165 


106 


432407 


AA221036 




gb:zr03f1Zr1 Stratagene NT2 neuronal pr 


73.71 


75.00 


432441 


AW292425 


Hs.163484 


ESTs 


56.35 


72.00 


432489 


AJ604855 


Hs.207530 


ESTs 


100 


24.00 


432543 


AA552690 


Hs.152423 


Homo sapiens cDNA: FU21274 fe, clone C 


137.72 


98.00 


432552 


AI537170 


Hs.173725 


ESTs, Weakly similar to ALU8_HUMAN ALU S 


100 


. 31.00 


432583 


AW023624 


Hs.162282 


potassium channel TASK4; potassium chan 


0.27 


35.18 


432606 


NM 002104 


Hs.3066 


granzyme K (serine protease, granzyme 3; 


2.87 


6.22 


432625 


AJ243596 


Hs.94830 


ESTs, Moderately similar to T03094 A-kin 


26.63 


56.00 


432653 


N62096 


Hs.293185 


ESTs, Weakly similar to JC7328 amino aci 


192 


5.29 


432677 


NWL004482 


Hs.278611 


UDP-N-acetylaIpha-[>galactosamine:po]yp 


100 


48.00 


432715 


AA247152 


Hs.200483 


ESTs, Weakly similar to KIAA1074 protein 


45.13 


31.00 


432753 


NM.014075 


Hs.336938 


Homo sapiens PRO0593 mRNA, complete cds 


100 


68.00 


432788 


AA521091 


Hs.178499 


Homo sapiens cDNA: FU23117 fis, done L 


269 


3.67 


432842 


AW674093 


Hs.334822 


hypothetical protein MGC4485 


122 


134 


432867 


AW016936 


Hs.233364 


ESTs 


1.00 


1.00 


432917 


NMJJ14125 


Hs.241517 


PRO0327 protein 


10.25 


6.62 
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432920 


U37689 


Hs.3128 


polymerase (RNA) I! (DNA directed) polyp 


1.44 


1.30 


433001 


AF217513 


Hs.279905 


clone HQ0310PRO0310p1 


154.79 


85.64 


433023 


AW864793 


Hs.87409 


Ihrombospondin 1 


20.96 


100.00 


433042 


AW193534 


Hs.281895 


Homo sapiens cDNA RJ11660 fis, clone HE 


1.00 


10.00 


433091 


Y12642 


Hs.3185 


lymphocyte antigen 6 complex, locus D 


1.20 


1.09 


433159 


AB035898 


Hs.150587 


kinesin-Iike protein 2 


13.82 


39.00 


433183 


AF231338 


Hs.222024 


transcription factor BMAL2 


1.00 


69.00 


433258 


AA622788 


Hs.203613 


ESTs, Weakly similar to ALUB.HUMAN fill 


1.00 


1.25 


433409 


AI278802 


Hs.25661 


ESTs 


44.81 


117.00 


433437 


LT20536 


Hs.3280 


caspase 6, apoptosis-retated cysteine pr 


70.39 


105.00 


433485 


AJ493076 


Hs.201967 


atdo-keto reductase family 1, member C2 


11.55 


£00 


433537 


AJ733692 


Hs.1 12488 


ESTs 


&66 


55.00 


433547 


W04978 


Hs.303023 


beta tubulin 1. class VI 


25.16 


83.00 


433555 


W56321 


Hs.1 11460 


caldum/catmodulin-dependent protein kin 


1.00 


19.00 


433647 


M603367 


Hs.222294 


ESTs 


20.30 


49.00 


433658 


L03678 


Hs.156110 


immunogtobuiin kappa constant 


5.92 


10.03 


433800 


AI094221 


Hs.135150 


lung type-) ceil membrane-associated gty 


£29 


£22 


433819 


AW511097 


Hs.112765 


ESTs 


3.71 


8.00 


433862 


D86960 


Hs.3610 


KIM0205 gene product 
phosphoserine aminotransferase 


62.08 


104.00 


433980 


AA137152 


Hs.286049 


108.91 


47.00 


434088 


AF116677 


Hs.249270 


hypothetical protein PR01966 


1.00 


1.00 


434094 


AA305599 


Hs.238205 


hypothetical protein PRO2013 


121.27 


87.00 


434105 


AW952124 


Hs.13094 


preseniBns associated rhomboid-like pro 


1.22 


1.23 


434217 


AW014795 


Hs.23349 


ESTs 


14.11 


57.00 


434340 


AI193043 


Hs.128685 


ESTs, Weakly simitar to T17226 hypotheti 


2.10 


£56 


434360 


AA401369 


Hs.190721 


ESTs 


40.98 


17.00 


434414 


A1798376 




gb:tr34b07.x1 NCLCGAP_Ov23 Homo sapiens 


1.48 


1.56 


434424 


A1811202 


Hs.325335 


Homo sapiens cDNA: FU 23523 fis, done L 


1.00 


64.00 


434467 


BE552368 


Hs.231853 


Homo sapiens cDMA FU13445 tls, done PL 


54.91 


85.00 


434551 


BE387162 


Hs.280858 


ESTs, Highly similar to A35661 DNA exris 


2.46 


£00 


434627 


AI221894 


Hs.39311 


ESTs 


1.00 


1.00 


434699 


AA643687 


Hs.149425 


Homo sapiens cDNA FU11980 fis, clone HE 


1.00 


23.00 


434769 


AA648884 


Hs.134278 


Homo sapiens cDNA FU12676 fis, clone NT 


7.08 


56.00 


434792 


M649253 


Hs.132458 


ESTs 


852 


44.00 


434808 


AF155108 


Hs.256150 


Homo sapiens, Simitar to RIKEN cDNA 2810 


11.33 


1.00 


434828 


090070 


Hs.96 


phorboM 2-myristate-1 3-acetate-induced 


1.00 


1.00 


434876 


AF160477 


Hs.61460 


Ig superfamify receptor LNIR 


1.25 


1.29 


434891 


AA814309 


Hs.123583 


ESTs 


1.00 


6.00 


434928 


AW015595 


Hs.4267 


Homo sapiens clones 24714 and 24715 mRNA 


1.00 


1.00 


435013 


H91923 


Hs.1 10024 


Target CAT 


1.26 


1.10 


435066 


8E251750 


Hs.4747 


dyskeratosis congenita 1, dyskerin 


1.69 


1.37 


435087 


AW975241 


Hs.23567 


ESTs 


1.00 


1.00 


435099 


AC004770 


Hs.4756 


flap structure-specific endonuciease 1 


2.90 


1.93 


435159 


AA668879 


Hs.116649 


ESTs 


1.00 


1.00 


435205 


X54136 


Hs.181125 


immunoglobulin lambda tocus 


1.02 


1.46 


435232 


NMJJ01262 


Hs.4854 


cyctin-dependent kinase inhibitor 2C {pi 


2.04 


2.70 


435304 


H10709 


Hs.269524 


ESTs 


27.58 


139.00 


435313 


AI769400 


Hs.189729 


ESTs 


1.00 


14.00 


435505 


AF200492 


Hs.211238 


interteukin-1 homotog 1 


1.00 


38.00 


435509 


AI458679 


Hs.181915 


ESTs 


1.00 


1.00 


435525 


AI831297 


Hs.123310 


ESTs 


1.00 


56.00 


435532 


AW291488 


Hs.1 17305 


Homo sapiens, done IMAGE:3682908, mRNA 


1.00 


£00 


435550 


AJ224456 


Hs.324507 


Hsapiens poryA site DNA 


3.42 


&92 


435602 


AF217515 


Hs.283532 


uncharacterized bone marrow protein BM03 


3.95 


1.80 


435766 


R11673 


Hs.186498 


ESTs 


1.00 


28.00 


435793 


AB037734 


Hs.4993 


KIAA1313 protein 


23.66 


42.00 


436069 


AI056879 


Hs.263209 


ESTs 


1.00 


58.00 


436170 


AW450381 


Hs.14529 


ESTs 


1.00 


18.00 


436211 


AK001581 


Hs.334828 


hypothetical protein FU10719; KIM1794 


5.84 


22.00 


436213 


AA325512 


Hs.71472 


hypothetical protein FU10774; KIAA1709 


1.42 


1.27 


436217 


T53925 


Hs.107 


fibrinogen-fite 1 


57.97 


31.00 


436238 


AK002163 


Hs.301724 


hypothetical protein FU1 1301 


2.51 


1.71 


436251 


BE515065 


Hs.296585 


nucleolar protein (KKE/D repeat) 


2.33 


1.64 


436291 


BE568452 


Hs.344037 


protein regulator of cytokinesis 1 


108.99 


52.00 


436302 


AL355841 


Hs.99330 


hypothetical protein FU23588 


0.75 


£81 


436396 


AW992292 


Hs.1 52213 


wingtess-type MMTV integration site fami 


■60.01 


1.00 


436414 


BE264633 


Hs.143638 


WD repeat domain 4 


2.50 


£19 


436419 


AI948626 


Hs.171356 


ESTs 


0.95 


1.33 


436443 


AW138211 


Hs.128746 


ESTs 


1.12 


9.26 


436474 


AJ270693 


Hs.199887 


ESTs 


1.00 


1.00 


436481 


AA379597 


Hs.5199 


HSPC150 protein similar to ubiquitin-con 


3.28 


1.56 


436486 


AA742221 


Hs.120633 


ESTs 


1.00 


19.00 


436511 


AA721252 


Hs.291502 


ESTs 


16.76 


14.00 


436553 


X57809 


Hs.181125 


immunoglobulin lambda tocus 


1.08 


1.74 


436557 


W15573 


Hs.5027 


ESTs, Weakly similar to A47582 frceil gr 


19.20 


9.75 


436608 


AA628980 




down syndrome critical region protein DS 


33.92 


25.00 


436667 


AW025183 


Hs.127680 


ESTs 


0.89 


1.19 


436771 


AW975687 


Hs.292979 


ESTs 


1.00 


10.00 


436839 


AA401369 


Hs.190721 


ESTs 


1.00 


17.00 


436887 


AW953157 


Hs.193235 


hypothetical protein DKFZp547D155 


1.08 


1.15 


436944 


AW268614 


Hs.5840 


ESTs 


1.00 


1.00 


436961 


AW375974 


Hs.156704 


ESTs 


25.13 


25.00 


436972 


AA284679 


Hs.25640 


claudin 3 


1.59 


1.46 


437016 


AU076916 


Hs.5398 


guanine monphosphate synthetase 


£35 


1.78 


437044 


AL035864 


Hs.69517 


cDNA for differentially expressed C016 g 


1.34 


1.10 
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437181 


A1306615 


Hs.125343 


ESTs, Weakly similar to K1AA0758 protein 


1.00 


17.00 


437204 


AL110216 


Hs.22826 


ESTs, Weakly simHar to 155214 saKvary 


40.55 


82.00 


437205 


AL1 10232 


Hs.279243 


Homo sapiens mRNA; cDNA DKFZp564D207t (f 


1.00 


11Z00 


437259 


AI377755 


Hs.120695 


ESTs 


1.00 


205.00 


437270 


R18087 


Hs.323769 


cisplatin resistance related protein CRR 


1.56 


1.54 


437271 


AL137445 


Hs.28846 


Homo sapiens mRNA; cONA 0KF2£566O134 (fr 


113.25 


125.00 


437370 


AL359567 


Hs.161962 


Homo sapiens mRNA; cONA DKFZp547D023 (fir 


1.82 


4.57 


437390 


A1125859 


hte.112607 


ESTs 


1.35 


1.75 


437412 


BE069288 


Hs.34744 


Homo sapiens mRNA; cDNA DKFZp547C136 (fr 


3.56 


3.20 


437435 


AI306152 


Hs.27027 


hypothetical protein OKFZp762H1311 


3.G3 


1.08 


437444 


H46008 


Hs.31518 


ESTs 


1.00 


39.00 


437568 


A1954795 


Hs.156135 


ESTs 


1.00 


19.00 


437623 


063880 


Hs.5719 


chromosome condensation-related SMC-asso 


1.95 


1.57 


437789 


AI581344 


Hs.127812 


ESTs, Weakly simitar to T17330 hypotheti 


1.00 


3.00 


437814 


AJ088192 


Hs.135474 


ESTs, Weakly similar to DDX9_HUMAN ATP-D 


1.00 


45.00 


437840 


AA884835 


Hs.292014 


ESTs 


1.07 


1.78 


437852 


BE001836 


Hs.256897 


ESTs, Weakly similar to dJ36501£1 JRsa 


1.68 


3.26 


437879 


BE262082 


Hs.5894 


hypothetica) protein FU10305 


1.87 


2.52 


437915 


AI637993 


Hs.202312 


Homo sapiens clone N11 NTera2D1 teraloca 


74.05 


35.00 


437916 


BE566249 


Hs.20999 


hypothetical protein FU23142 


23.15 


89.00 


437937 


AI917222 


H5.121655 


ESTs 


1.00 


1.00 


437942 


A1888256 


Hs.307526 


ESTs 


12.28 


31.00 


43B091 


AW373062 




nuclear receptor subfamily 1, group 1, m 


1.53 


10.85 


438113 


AI467908 


Hs.8882 


ESTs 


1.80 


£39 


438119 


AW983217 


Hs.203961 


ESTs, Moderately similar to AF1 16721 89 


22.67 


36.90 


438274 


A1918906 


Hs.55080 


ESTs 


1.00 


1.00 


438378 


AW970529 


Hs.86434 


hypothetical protein RJ21816 


38.92 


38.00 


438403 


AA806607 


Hs.292206 


ESTs 


1.00 


1.00 


438494 


AA908678 


Hs.130183 


ESTs 


2.05 


80.00 


438546 


AW297204 


Hs.125811 


ESTs 


1.00 


131.00 


438552 


AJ245820 


Hs.6314 


type 1 transmembrane receptor (seizure-r 


1.43 


1.45 


438702 


A1879064 


Hs.54618 


ESTs 


1.00 


34.00 


438724 


AW612553 


Hs.1 14670 


Human DMA sequence from clone RP1 1-16L21 


1.33 


1.10 


438746 


AI685815 


Hs.184727 


Human melanoma-associated antigen p97 (m 


2.42 


1.59 


438779 


Nl^.003787 


Hs.6414 


nucleolar protein 4 


1.00 


18.00 


438821 


M826425 


Hs.192375 


ESTs 


£03 


£57 


438885 


AI886558 


Hs.184987 


ESTs 


6.42 


88.00 


438898 


AA401369 


Hs.190721 


ESTs 


22.41 


17.00 


438915 


AA280174 


Hs.285681 


WOfiams-Beuren syndrome chromosome regi 


1.00 


1.00 


438956 


W00847 


Hs.1 35056 


Human DNA sequence from clone RP5-B50E9 


2.20 


1.88 


439000 


AW979121 




gb;EST391231 MAGE resequences, MAGP Homo 


2.78 


4.81 


439023 


AA745978 


Hs.28273 


ESTs 


1.17 


1.31 


439024 


R98696 


Hs.35598 


ESTs 


1.00 


28.00 


439128 


AI949371 


Hs.153089 


ESTs 


1.00 


67.00 


439146 


AW138909 


Hs.156110 


immunoglobulin kappa constant 


1.38 


1.41 


439223 


AW238299 


Hs.250618 


UL16 binding protein 2 


1.93 


1.64 


439285 


AL1 33916 




hypothetical protein FU20093 


46.23 


139.00 


439318 


AW837046 


Hs.6527 


G protein-coupled receptor 56 


£00 


2.20 


439343 


AF0B6161 


Hs.114611 


hypothetical protein FU11808 


6.10 


7.37 


439394 


AA401369 


Hs.190721 


ESTs 


3.39 


17.00 


439410 


AA632012 


Hs.188746 


ESTs 


1.83 


3.07 


439451 


AF086270 


Hs.278554 


heterochromatin-iike protein 1 


23.28 


52.00 


439452 


AA918317 


Hs.57987 


B-ceii CLUlymphoma 11 B (zinc finger pro 


18.76 


' 122.00 


439453 


BE264974 


Hs.6566 


thyroid hormone receptor interactor 13 


m 


1.58 


439477 


W69813 


Hs.58042 


ESTs, Moderately similar to GFR3_HUMAN G 


1.22 


1.44 


439492 


AF086310 


Hs.103159 


ESTs 


7.46 


39.00 


439523 


W72348 


Hs.185029 


ESTs 


1.00 


1.19 


439592 


AF086413 


Hs.58399 


ESTs 


1.00 


1.00 


439606 


W79123 


Hs.58561 


G protein-coupled receptor 87 


33.61 


1.00 


439670 


AF088076 


Hs.59507 


ESTs, Weakly similar to AC004858 3 U1 sm 


1.00 


1.00 


439702 


AW085525 


Hs.134182 


ESTs 


4.30 


10.00 


439706 


AW872527 


H5.59761 


ESTs, Weakly similar*) DAPLHUMAN DEATH 


86.55 


11.00 


439738 


BE246502 


Hs.9598 


sema domain, immunoglobulin domain (Ig), 


2.36 


1.88 


439750 


AL359053 


Hs.57664 


Homo sapiens mRNA full length Insert cDN 


2.02 


6.08 


439759 


AL359055 


Hs.67709 


Homo sapiens mRNA M length Insert cDN 


1.00 


21.00 


439780 


AL109688 




gb:Homo sapiens mRNA full length insert 


7.27 


25.00 


439840 


AW449211 


Hs.105445 


GDNF family receptor alpha 1 


1.00 


1.00 


439926 


AWD14875 


Hs.137007 


ESTs 


32.58 


71.00 


439963 


AW247529 


Hs.6793 


platelet-activating factor acetylhydrola 


21.28 


9.55 


439979 


AW600291 


Hs.6823 


hypothetical protein FU10430 


68.83 


61.00 


440008 


AK000517 


Hs.6844 


hypothetical protein FU20510 


163 


4.02 


440028 


AW473675 


Hs.125843 


■ ESTs, Weakly similar to T17227 hypotheti 


1.42 


£54 


440106 


AA864968 


Hs.127699 


K1AA1603 protein 


1.00 


54.00 


440138 


AB033023 


Hs.318127 


hypothetical protein FU10201 


24.18 


52.00 


440273 


A3805392 


Hs.325335 


Homo sapiens cDNA: FU23523 fis, clone L 


3.21 


4.72 


440289 


AW450991 


Hs.192071 


ESTs 


38.63 


113.00 


440325 


NM.003812 




a disintegrin and metalloprotemase doma 


62.88 


147.00 


440492 


R39127 


Hs.21433 


hypothetical protein DKFZp547J036 . 


2.35 


3.62 


440527 


AV657117 


Hs.184164 


ESTs, Moderately similar to S65657 alpha 


10.84 


57.00 


440659 


AF134160 


Hs.7327 


daudinl 


3.18 


£37 


440704 


M69241 


Hs.162 


insulin-like growth factor binding prote 


2.89 


£09 


440943 


AW082293 


Hs.146161 


hypothetical protein MGC240S 


2.02 


1.41 


440994 


AI160011 


Hs.272068 


ESTs 


1.29 


1.14 


441020 


AA401369 


Hs.190721 


ESTs 


142.99 


17.00 


441031 


A1110684 


Hs.7645 


fibrinogen, B beta polypeptide 


1.41 


99.00 
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441128 


AA570256 




ESTs, Weakly similar to T23273 hypotheti 


4.13 


3.50 


441290 


W27501 


Hs.89605 


cholinergic receptor, nicotinic, alpha p 


1.00 


1.00 


441362 


BE61441G 


Hs^3044 


RAD51 (S. cerevistas) homotog (EcofiRe 


130.23 


43.00 


441377 


BE218239 


Hs.202656 


ESTs 


22.03 


1.00 


441390 


AI692560 


hte.131175 


ESTs 


3.65 


7.70 


441497 


R51064 


Hs.23172 


ESTs 


1.00 


1.00 


441525 


AW241867 


Hs.127728 


ESTs 


1.53 


1.42 


441553 


AA281219 


Hs.121296 


ESTs 


1.89 


1.57 


441607 


NM.005010 


Hs.7912 


neuronal celi adhesion molecule . 


1.47 


211 


441633 


AW958544 


Hs.1 12242 


normal mucosa of esophagus specific 1 


21R22 


363.00 


441636 


AA081846 


Hs.7921 


Homo sapiens mRNA; cDNA DKFZp566E183 (fr 


2.31 


205 


441737 


X79449 


Hs.7957 


adenosine deaminase, RNA-specific 


1.30 


1.49 


441790 


AA401369 


Hs.190721 


ESTs 


44.15 


17.00 


441801 


AW242799 


Hs.86366 


ESTs 


1.00 


1.00 


441919 


A1553802 


Hs.128121 


ESTs 


1.00 


12200 


441937 


R41782 


Hs.22279 


ESTs 


0.66 


1.37 


441954 


AI744935 


Hs.8047 


Fancon! anemia, complementation group 6 


1.48 


1.39 


442025 


AW887434 


Hs.11810 


CDA11 protein 


1.00 


46.00 


442029 


AW955698 


Hs.14456 


neural precursor ceil expressed, develop 


9.92 


45.00 


442072 


AI740832 


Hs.12311 


Homo sapiens clone 23570 mRNA sequence 


25.05 


77.00 


442108 


AW452649 


Hs.166314 


ESTs 


3.61 


3.14 


442117 


AW664964 


Hs.128899 


ESTs 


3.00 


5.49 


442137 


AA977235 


Hs.128830 


ESTs, WeaWy similar to Z19^HUMAN ZINC 


1.00 


1.00 


442159 


AW163390 


Hs.278554 


heterochromatijvfike protein 1 


1.92 


1.66 


442179 


AA983842 


Hs.333555 


chromosome 2 open reading frame 2 


27.22 


50.00 


442328 


A1952430 


Hs.150614 


ESTs, Weakly similar to ALU4_HUMAN ALU S 


5.00 


3.42 


442432 


BE093589 


Hs.38178 


hypothetical protein FU23468 


181.59 


76.00 


442530 


A1560B30 


Hs.176508 


Homo sapiens cDNA FU14712 fis, clone NT 


10.59 


144.00 


442547 


AA306997 


Hs.217484 


ESTs, Weakly similar to ALU1.HUMAN ALU S 


109.23 


98.00 


442556 


AL137761 


Hs.8379 


Homo sapiens mRNA; cDNA DKFZp586L2424 (f 


1.00 


53.00 


442619 


AA447492 


Hs.20183 


ESTs, WeaWy similar to AF164793 1 prote 


29.02 


50.00 


442710 


A1015631 


Hs^3210 


ESTs 


1.00 


19.00 


442717 


R88362 


Hs.180591 


ESTs, WeaWy similar to T23976 hypotheti 


1.00 


5.00 


442875 


BE623003 


Hs.23625 


Homo sapiens clone TCCCTA00142 mRNA sequ 


22.85 


50.00 


442914 


AW188551 


Hs.99519 


hypothetical protein FU14007 


25.33 


82.00 


442932 


AA457211 


Hs.8858 


bromodomain adjacent to zinc finger doma 


3.18 


4.41 


442942 


AW167087 


Hs.131562 


ESTs 


8.45 


64.00 


443068 


AI188710 




ESTs 


1.00 


27.00 


443204 


AW20587B 


Hs.29643 


Homo sapiens cDNA FU13103 fis, clone MT 


1.00 


24.00 


443211 


AI128388 


Hs.143655 


ESTs 


12.42 


2.00 


443247 


BE614387 


Hs.333893 


c-Myc target JP01 


128.84 


96.00 


443324 


R44013 


Hs.164225 


ESTs 


0.02 


4.59 


443383 


A1792453 


Hs.166507 


ESTs 


1.00 


47.00 


443400 


R28424 


Hs.250648 


ESTs 


18.52 


61.00 


443426 


AF098158 


Hs.9329 


chromosome 20 open reading frame 1 


4.02 


1.75 


443572 


AA025610 


Hs.9605 


cleavage and polyadenylation specific fa 


198 


2.57 


443575 


AI078022 


Hs.269636 


ESTs, WeaWy similar to ALULHUMAN ALU S 


1.00 


29.00 


443614 


AV655386 


Hs.7645 


fibrinogen, B beta polypeptide 


1.00 


16.00 


443633 


AL031290 


Hs.9654 


similar to pregnancy-associated plasma p 


1.00 


39.00 


443648 


A1085377 


Hs.143610 


ESTs 


39.81 


70.00 


443715 


A1583187 


Hs.9700 


cyclin E1 


48.74 


7.00 


443723 


AI144442 


Hs.157144 


symaxin6 


1.29 


1.30 


443802 


AW504924 


Hs.9805 


KIAA1291 protein 
foHistatin 


1.75 


1.61 


443859 


NM.013409 


Hs.9914 


1.35 


1.13 


443892 


AA401369 


Hs.190721 


ESTs 


1.00 


17.00 


443947 


W24187 




gb:zb47f09.r1 SoaresJetaLlung_NbHL19W 


1.33 


1.64 


443991 


NMJ302250 


Hs.10082 


potassium intermediate/small conductance 


5.71 


6.87 


444006 


BE395085 


Hs.10086 


type I transmembrane protein Fn14 


1.47 


1.92 


444009 


AI380792 


Hs.135104 


ESTs. 


1.00 


77.00 


444017 


U04840 


Hs.214 


neuro-oncological ventral antigen 1 


1.00 


1.00 


444127 


N63620 


Hs.13281 


ESTs 


1.00 


29.00 


444129 


AW294292 


Hs.256212 


ESTs 


1.00 


1.00 


444279 


U62432 


Hs.89605 


cholinergic receptor, nicotinic, alpha p 


0.60 


7.80 


444371 


BE540274 


Hs.239 


fbrkhead box Ml 


291 


1.14 


444378 


R41339 


Hs.12569 


ESTs 


1.00 


1.00 


444381 


BE387335 


Hs.283713 


ESTs, Weakly similar to S64054 hypotheti 


469.00 


556.00 


444461 


R53734 


Hs.25978 


ESTs, WeaWy similar to 2109260A B celi 


12.88 


105.00 


444471 


AB020684 


Hs.11217 


WAA0877 protein 


24.91 


90.00 


444489 


AI151010 


Hs.157774 


ESTs 


1.00 


111.00 


444619 


BE538082 


Hs.8172 


ESTs, Moderately similar to A46010 X-iin 


1.00 


70.00 


444665 


BE613126 


Hs.47783 


B aggressive lymphoma gene 


30.56 


139.00 


444707 


AH88613 


Hs.41690 


desmocollin 3 


1.00 


100 


444735 


BE019923 


Hs.243122 


hypothetical protein FU13057 similar to 


77.02 


90.00 


444781 


NMJ)14400 


Hs.11950 


GPi-artchored metastasis-associated prote 


1.57 


1.31 


444783 


AK001468 


Hs.62180 


anilfin (Drosophila Scraps homotog), act 


77.55 


200 


445236 


AK001676 


Hs.12457 


hypothetical protein FLJ10814 


1.00 


27.00 


445258 


A1635931 


Hs.147613 


ESTs 


1.00 


73.00 


445413 


AA151342 


Hs.12677 


CGH47 protein 


28,14 


50.00 


445417 


AK001058 


Hs.12680 


Homo sapiens cDMA FU10196 fis, clone HE 


1.81 


262 


445443 


AV653838 


Hs.322971 


ESTs 


1.00 


1.00 


445462 


AA378776 


Hs.288649 


hypothetical protein MGC3077 


209 


1.70 


445517 


AF208855 


Hs.12830 


hypothetical protein 


1.67 


70.00 


445537 


AJ245671 


Hs.12844 


EGF-like-domain, multiple 6 


1.71 


272 


445580 


AF167572 


Hs.12912 


skbl (S. pombe) homotog 


1.52 


1.34 


445554 


X91247 


Hs.13046 


thioredoxin reductase 1 


1.51 


1.52 
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445659 


A1570830 


Hs.174870 


ESTs 


10.95 


11.45 


445818 


BE045321 


Hs.136017 


ESTs 


1.00 


1.00 


445873 


AA250970 


Hs.251946 


poty(A)-binding protein, cytoplasmic 14 


49.42 


54.00 


445885 


AI734009 


Hs.127699 


KIM1603 protein 


1.00 


13200 


445898 


AF07O523 


Hs.13423 


Homo sapiens clone 24468 mRNA sequence 


1.00 


1.00 


445903 


A1347487 


Hs.132781 


class 1 cytokine receptor 


1.00 


36.00 


445932 


BE046441 


Hs.333555 


Homo sapiens clone 24859 mRNA sequence 


2.41 


283 


445982 


BE410233 


Hs.13$01 


pescadffio (zebrafish) homotog 1, contai 


1.60 


1.35 


446078 


A/339982 


Hs.156061 


ESTs 


1.00 


4200 


448102 


AW168067 


Hs.317694 


ESTs 


1.00 


1.00 


446157 


BE270828 


Hs.131740 


Homo sapiens cDNA: FU22562 fis, clone H 


1.70 


1.53 


446269 


AW263155 


Hs.14559 


hypothetical protein FU 10540 


73.01 


48.00 


446292 


AF081497 


Hs.279682 


Rh type C glycoprotein 


1.55 


1.26 


446293 


AI420213 


Hs.149722 


ESTs 


1.00 


2.00 


446423 


AW139655 


Hs.150120 


ESTs 


1.10 


4.19 


446428 


AW082270 


Hs.12498 


ESTs. Weakly similar to ALU4.HUMAN ALU S 


0.53 


3.26 


446432 


AI377320 


Hs.150058 


ESTs 


1.00 


5.00 


446528 


AU076640 


Hs.15243 


nucleolar protein 1{1 20kD) 


1.36 


1.31 


446574 


A1310135 


Hs.335933 


ESTs 


3.B9 


7200 


446619 


AU076643 


Hs.313 


secreted phosphoprotein 1 (osteopontin, 


32.03 


20.23 


446636 


AC002563 


Hs.15767 


citron (rho-interacting, serine/lhreonin 


4.19 


5.07 


446783 


AW138343 


Hs.141867 


ESTs 


2.82 


9.47 


446839 


BE091926 


Hs.16244 


mitotic spindle cofled-coil related prot 


110.26 


28.00 


446849 


AU076617 


Hs.16251 


cieavage and potyadenylation specific fa 


3.26 


294 


446856 


AI814373 


Hs.164175 


ESTs 


6.38 


11.30 


446872 


X97058 


Hs. 16362 


pyrimicfmergic receptor P2Y, G-proiein c 


1.98 


203 


446880 


A1811B07 


Hs.108646 


Homo sapiens cDNA FU14934 fis, clone PL 


94.90 


113.00 


446921 


AB012113 


Hs.16530 


small inducible cytokine subfamily A (Cy 


1.67 


3.90 


446989 


AK001898 


Hs.16740 


hypothetical protein RJ11036 


2.82 


3.12 


447022 


AW291223 


Hs.157573 


ESTs 


1.00 


170.00 


447033 


AI357412 


Hs.157601 


ESTs 


7.15 


107.00 


447078 


AW885727 


Hs.9914 


ESTs 


47.24 


24.00 


447081 


Y13896 


Hs.17287 


potassium irrwardly-rectifying channel, s 


0.12 


17.88 


447131 


NMJW4585 


Hs.17466 


retinoic acid receptor responder (tazaro 


0.97 


1.48 


447149 


BE299857 


Hs.326 


TAR (HIV) RNA-btnding protein 2 


1,24 


1.26 


447153 


AA805202 


Hs.315562 


ESTs 


1.00 


54.00 


447164 


AF026941 


Hs.17518 


Homo sapiens cig5 mRNA, partial sequence 


LOO 


67.00 


447178 


AW594641 


Hs.192417 


ESTs 


3.42 


50.00 


447250 


AJ878909 


Hs.17883 


protein phosphatase 1G (formerly 2C), ma 


1.60 


1.52 


447289 


AW247017 


Hs.36978 


melanoma antigen, family A, 3 


1.00 


1.00 


447342 


AI199268 


Hs.19322 


Homo sapiens, Simitar to RIKEN cDNA 2010 


28.63 


1.00 


447343 


AA256641 


Hs.236894 


ESTs, Highly similar to S02392 alpha-2-m 


146.62 


51.00 


447350 


A1375572 


Hs.172634 


ESTs 


1.00 


12.00 


447377 


N27687 


Hs.334334 


transcription factor AP-2 alpha (activat 


2.55 


63.00 


447415 


AW937335 


Hs.28149 


ESTs, Weakly similar to KF3BJWMAN KINES 


0.91 


1.13 


447425 


Ai963747 


Hs.18573 


acylphosphatase 1, erythrocyte (common) 


1.00 


35.00 


447519 


U46258 


Hs.339665 


ESTs 


59.89 


49.00 


447532 


AK000614 


Hs.18791 


hypothetical protein FU20607 


1.23 


1.63 


447534 


AA401369 


Hs.190721 


ESTs 


1.00 


17.00 


447636 


Y10043 




high-mobility group (nonhistone chramoso 


1.41 


1.11 


447688 


N87079 


Hs.19236 


Target CAT 


1.00 


39.00 


447733 


AF157482 


Hs.19400 


MAD2 (mitotic arrest deficient, yeast, h 


1.17 


1.12 


447769 


AW873704 


Hs.320831 


Homo sapiens cDNA FU14597 fis, clone NT 


6.47 


5.95 


447802 


AW593432 


Hs.161455 


ESTs 


0.73 


234 


447850 


AB018298 


Hs.19822 


SEC24 {S. cerevisiae) related gene famit 


86.45 


116.00 


447924 


AI817226 


Hs.313413 


ESTs, Weakly similar to T23110 hypotheO 


1.00 


• 1.00 


447973 


AB011169 


Ks.20141 


similar to S. cerevisiae SSM4 


3.50 


4.27 


448030 


N30714 


Hs.325960 


membrane-spanning 4-domains, Bubfamity A 


4.13 


14200 


448105 


A1538613 


Hs.298241 


Transmembrane protease, serine 3 


1.15 


224 


448243 


AW369771 


Hs.52620 


mtegrin, beta 6 


15.84 


1.00 


448278 


WQ7369 


Hs.11782 


ESTs 


0.97 


1.90 


448290 


AK002107 


Hs.20843 


Homo sapiens cDNA FU11245 fis, clone PL 


1.00 


1.00 


448296 


BE622756 


Hs.10949 


Homo sapiens cDNA FLJ14162 fis, clone NT 


2.42 


217 


448357 


BE274396 


Hs.108923 


RAB38, member RAS oncogene family 


1.44 


1.08 


448390 


AL035414 


Hs.21068 


hypothetical protein 


1.00 


43.00 


448469 


AW504732 


Hs.21275 


hypothetical protein RJ1 1011 


263 


249 


448569 


BE382657 


Hs.21485 


signal transducer and activator of trans 


1.84 


253 


448663 


BE614599 


Hs.106823 


hypothetical protein MGC14797 


3.29 


46.00 


448672 


A1955511 


Hs.225106 


ESTs 


1.00 


21.00 


448733 


NM 005629 


Hs.187958 


solute carrier famHy 6 (neurotransmitte 


1.82 


1.08 


448741 


BE614567 


Hs.19574 


hypothetical protein MGC5469 


2.48 


1.92 


448757 


AI366784 


Hs.48820 


TATA box binding protein (TBP)-associaie 


23,53 


20.00 


448775 


AB025237 


Hs.388 


nudix (nucleoside diphosphate linked moi 


2.34 


1.97 


448826 


AI580252 


Hs.293246 


ESTs, Weakly similar to putative p150 [H 


74.07 


6267 


448830 


AL031656 


Hs.22181 


hypothetical protein dJ310O13.3 


1.37 


1.31 


448844 


AI581519 


Hs.177164 


ESTs 


1.00 


31.00 


44B988 


Y09763 


Hs.22785 


gamma-aminobutyric add (GABA) A recepto 


1.84 


1.95 


448993 


A1471630 




KIAA0144 gene product 


1.63 


1.49 


449003 


X76342 


Hs.389 


alcoho) dehydrogenase 7 (class IV), mu o 


1.00 


1.00 


449029 


N28989 


Hs.22891 


solute carrier family 7 (cationip amino 


1.97 


226 


449040 


AF040704 


Hs.149443 


putative tumor suppressor 


0.97 


1.56 


449048 


Z45051 


Hs.22920 


similar to S68401 (cattle) glucose indue 


27.13 


90.00 


449053 


AI625777 


Hs.344766 


ESTs 


8.33 


44.00 


449054 


AF148848 


Hs.22934 


myoneurin 


73.85 


104.00 


449101 


AA205847 


Hs.23016 


G protein-coupled receptor 


2.58 


27.00 
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449167 


T05095 


Hs.19597 


K1AA1694 protein 


1.61 


2.35 


449207 


A1044222 


Hs.23255 


nucfeoponn 155kO 


Z36 


1.56 


449228 


AJ403107 


Hs.148590 


protein related with psoriasis 


1.15 


1.15 


443230 


BE613348 


Hs.211579 


melanoma eel) adhesion molecule 


206.65 


151.00 


449305 


A1638293 




gb:ft09b07jc1 NCLCGAP.GC6 Homo sapiens 


1758 


45.00 


449318 


AW236021 


Hs.78531 


Homo sapiens, Similar to R1KEN cDNA 5730 


26.39 


35.00 


449448 


060730 


Hs.57471 


ESTs 


1.00 


1.00 


449467 


AW205006 


Hs.197042 


ESTs 


1.00 


1.00 


449523 


NM.000579 


Hs.54443 


chemokine (C-C motif) receptor 5 


56.80 


216.86 


449722 


8E280074 


Hs.23960 


cycfinBI 


150.03 


1.00 


449976 


H06350 


Hs.135055 


Human DNA sequence from clone RP5-850E9 


116 


Z85 


450001 


NNL001Q44 


Hs.406 


solute carrier family 6 (neurotransmitte 


1.17 


1.45 


450098 


W27249 


Hs.8109 


hypothetical protein FU21080 


1.79 


2.38 


450101 


AV649989 


Hs.24385 


Human hbc647 mRNA sequence • 


1.00 


69.00 


450149 


AW969781 


Hs.132863 


2c family member 2 (odd-paired Drosophi 


1.00 


1.00 


450193 


AI916071 


Hs.15607 


Homo sapiens Fanconi anemia complemental 


29.85 


34.00 


450221 


AA328102 


Hs.24641 


cytoskeleton associated protein 2 


1.00 


1.00 


450372 


BE218107 


Hs.202436 


ESTs 


1.00 


1.00 


450375 


AA009647 


Hs.8850 


a distntegrin and metaltoproteinase doma 


51.26 


93.00 


450447 


AF212223 


Hs.25010 


hypothetical protein P15-2 


123,20 


181.00 


450568 


AL050078 


Hs.25159 


Homo sapiens cONA FU 10784 fis, done NT 


1.00 


19.00 


450589 


AJ7015Q5 


Hj*202526 


ESTs 


1.00 


23.00 


450684 


AA872605 


Hs.25333 


intedeukln 1 receptor, type 11 


1.00 


100.00 


450701 


H39960 


Hs.288467 


Homo sapiens cDMA FU 12280 fis, done MA 


1.89 


1.55 


450705 


U90304 


Hs.25351 


Iroquois homeobox protein 2A (IRX-2A) ( 


1.00 


45.00 


450832 


AA401369 


Hs.190721 


ESTs 


25.17 


17.00 


450937 


R49131 


Hs.26267 


ATP-dependant interferon response protei 


90.92 


90.00 


450983 


AA305384 


Hs.25740 


EROI (S. cereWslaeHite 


133 


170 


451105 


AI761324 




gbwi60b1 1.x1 NCI_CGAP_Co16 Homo sapiens 


15.02 


124.00 


451110 


AI955040 


Hs.265398 


ESTs, Weakly similar to transformatlon-f 


1.00 


143.00 


451253 


H48299 


Hs.26126 


daudin 10 


3.02 


Z29 


451291 


R39288 


Hs.6702 


ESTs 


1.00 


1.00 


451320 


AW498974 




diacyigtycerol kinase, zeta (104kD) 


2.92 


18.00 


451380 


H09280 


Hs.13234 


ESTs 


6.90 


6.67 


451386 


AB029008 


Hs.26334 


spastic paraplegia 4 (autosomal dominant 


35.75 


72.00 


451437 


H24143 


Hs.31945 


hypothetical protein FU1 1071 


1.00 


69.00 


451462 


AK000367 


Hs.26434 


hypothetical protein FU20360 


1.83 


Z10 


451524 


AK001466 


Hs.26516 


hypothetical protein FU10604 
plakophitin 3 


1.13 


1.07 


451541 


BE279383 


Hs.26557 


1.88 


1.33 


451592 


Ai805416 


Hs.213897 


ESTs 


1.00 


1.00 


451635 


AA018899 


Hs.127179 


cryptic gene 


1.52 


1.92 


451743 


AA401369 


Hs.190721 


ESTs 


4.95 


17.00 


451806 


tWLQQ3729 


Hs.27076 


RNA 3 , -termtnal phosphate cydase 


13.55 


31.00 


451807 


W52854 




hypothetical protein FU23293 similar to 


1.55 


35.00 


451871 


AI821005 


Hs.1 18599 


ESTs 


1.81 


Z53 


451952 


AL120173 


Hs.301663 


ESTs 


1.00 


22.00 


452012 


AA307703 


Hs.279766 


kinesln family member 4A 


3.43 


Z26 


452046 


AB018345 


Hs.27657 


K1AA0802 protein 


56.59 


19.00 


452194 


AI694413 


Hs.332649 


dfactory receptor, family 2, subfamily 


1.67 


4.09 


452206 


AW340281 


Hs.33074 


Homo sapiens, clone IMAGE:3606519, mRNA, 


9.31 


53.00 


452240 


AA401369 


Hs.190721 


ESTs 


13.42 


17.00 


452256 


AK000933 


Hs.28661 


Homo sapiens cDNA FL) 10071 fis, done HE 


39.03 


94.00 


452281 


T93500 


Hs.28792 


Homo sapiens cDNA FU11041 fis, done PL 


153.01 


340.00 


452291 


AF015592 


Hs.28853 


CDC7 (cell division cycle 7, S. cerevisi 


1.95 


23.00 


452295 


BE379936 


Hs.28866 


programmed ceil death 10 


42.33 


61.00 


452304 


AA025386 


Hs.61311 


ESTs, Weakly similar to S1 0590 cysteine 


1.17 


Z14 


452340 


NMJJ02202 


Hs.505 


ISL1 transcription fador, UM/homeodoma 


1.00 


13.00 


452349 


AB028944 


Hs.29189 


ATPase, Class VI, type 11A 


1.09 


1.42 


452367 


U71207 


Hs.29279 


eyes absent (Drosophtfa) homofog 2 


54.49 


53.00 


452401 


NMJJ07115 


Hs-29352 


tumor necrosis factor, alpha-induced pro 


1.00 


32.00 


452410 


AI133619 




Homo sapiens mRNA; cDNA DKF2p434E2321 (f 


1.26 


1.99 


452461 


N78223 


Hs.108106 


transcription factor 


24.47 


35.00 


452571 


W31518 


Hs.34665 


ESTs 


54.61 


102.00 


452613 


AA461599 


Hs.23459 


ESTs 


1.39 


1.32 


452699 


AW295390 


Hs.213052 


ESTs 


1.00 


26.00 


452705 


H49805 


Hs.246005 


ESTs 


1.00 


1.00 


452747 


AF160477 


Hs.61460 


tg superfamiry receptor LNIR 


112.87 


1.29 


452787 


AW294022 


Hs.222707 


K1AA1718 protein 


1.00 


1.00 


452795 


AW392555 


Hs.18878 


hypothetical protein FU21620 


1.00 


1.00 


452823 


AB012124 


Hs.30696 


transcription factor-like 5 (basic helix 


7.91 


75.00 


452633 


BE559681 


Hs.30736 


KJAA01 24 protein 

preferentially expressed antigen in mela 


3.16 


1.92 


452838 


U65011 


Hs.30743 


174.35 


1.00 


452862 


M401369 


Hs.190721 


ESTs 


98.26 


17.00 


452865 


AW173720 


Hs.345805 


ESTs, Weakly similar to A47582 B-cell gr 


1.55 


1.00 


452934 


AA581322 


Hs.4213 


hypothetical protein MGC16207 


1.73 


1.19 


452946 


X95425 


Hs.31092 


EphAS 


1.00 


1.00 


452976 


R44214 


Hs.101189 


ESTs 


1.58 


1.98 


453028 


AB0Q6532 


Hs.31442 


RecQ protein-like 4 


1.80 


1.60 


453095 


AW295660 


Hs.252756 


ESTs 


0.77 


1.50 


453102 


NM.007197 


Hs.31664 


frizzled (Drosophila) homolog 10 


1.00 


1.00 


453103 


AI301052 


Hs.153444 


ESTs 


1.00 


1.00 


453120 


AA292891 


Hs.31773 


pregnancy-Induced growth inhibitor 


1.23 


1.20 


453153 


N53893 


Hs.24360 


ESTs 


1.00 


83.00 


453160 


A1263307 


Hs.239884 


H2B histone family, member L 


1.00 


30.00 


453197 


AI916269 


Hs.109057 


ESTs, Weakly similar to ALU5.HUMAN ALU S 


1.00 


134.00 
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4532t0 


AL133161 


Hs.32360 


hypothetical protein FU 10867 


1.69 


1.93 


453240 


AI969564 


Hs.166254 


hypothetical protein DKFZp566i133 


1.00 


1.00 


453317 


NMJJ02277 


Hs.41696 


keratin, hair, acidic, 1 


1.19 


1.27 


453323 


AF034102 


Hs.32951 


sotute carrier family 29 {nucleoside tra 


4.90 


4.11 


453331 


AI240665 


Hs.8850 


ESTs 


199.42 


340.00 


453392 


U23752 


Hs.32964 


SRY (sex detemuning region Y)-box 1 1 


1.00 


16.00 


453431 


AF094754 


Hs.32973 


glycine receptor, beta 


1.00 


1.00 


453439 


AI572438 


Hs.32976 


■ guanine nucleotide binding protein 4 


3.44 


5.17 


453459 


BE047032 


Hs.257789 


ESTs 


£84 


5.58 


453563 


AW608906.comp 


Hs.181163 


hypothetical protein MGC5629 


453633 


AA357001 


Hs.34045 


hypothetical protein FU20764 


1.74 


1.60 


453775 


NMJH)2916 


Hs.35120 


replication factor C (activator 1) 4 (37 


19.49 


1.00 


453B30 


AA534296 


Hs.20953 


ESTs 


24.92 


25.00 


453857 


AL080235 


Hs.35861 


DKFZP586E1621 protein 


167.59 


66.00 


453867 


AI929383 


Hs.33032 


hypothetical protein DKFZp434N185 


1.00 


39.00 


453883 


A! 538516 


Hs.347524 


cof actor required for Sp1 transcriptions 


1.97 


1.58 


453884 


AA355925 


Hs.36232 


K1AA0186 gene product 


63.89 


20.00 


453900 


AW003582 


Hs 226414 


ESTs, Weakly simflar to ALU8.HUMAN ALU S 


20.41 


16.00 


453922 


AF053306 


Hs.36708 


budding uninhibited by benzimidazoJes 1 


7.09 


2200 


453941 


U39817 


Ks.36820 


Bloom syndrome 


29.75 


19.00 


453964 




Hs. 12744 


ESTs 


1.00 


1.00 


453968 


AA847843 


Hs.62711 


Homo sapiens, clone IMAG£:3351295, mRNA 


Z06 


i!bi 


453976 


BE463B30 


Hs.163714 

% 1 VWI *"T 


ESTs 


3.02 


131.00 


454024 


AA993527 


Hs 293907 


hypotheUcai protein FU23403 


1.00 


131.00 


454034 


MM 000691 


Hs.575 


aldehyde dehydrogenase 3 family, member 


1,23 


1.02 


454042 


T19228 


Hs.172572 


hypothetical protein FU20093 


30.63 


171.00 


454059 


NM_003154 


Hs 37048 


stalherin 


1.00 


1.00 


454066 


XQ0356 


Hs 37058 


calriton Wcalcitonin-reiated potypeptid 


1.01 


1.45 


454098 


W27953 


Hs.292911 


ESTs, Highly similar to S60712 band-6-pr 


1.26 


1.11 


454241 






gb:CM2-HT017fr4M1099-017-c02 HT0176 Homo 


6.33 


5.04 


454417 


AI244459 


Hs 110826 


trinucleotide repeat containing 9 


4.30 


7.82 


454439 


AW819152 


Hs 154320 


DKFZP56601646 protein 


1.00 


1.00 


455175 


AW993247 




gb:RC2-BN0033-18020(WJ14.h09 BN0033 Homo 


13.75 


103.00 


455601 


AI368680 


Hs.816 


SRY (sex detemuning region Y)-box 2 


206.11 


1.00 


456237 






gb:zx52e07.r1 Soaresjetaljrver.spleen^ 


1.00 


1.00 


456321 


mu 001327 


Hs.87225 


cancerfetis antigen 


1.14 


1.10 


456475 


NM 0D0144 


Hs 95998 


Friedreich ataxia 


1.00 


48.00 


456508 


AA502764 


Hs 123469 


ESTs, Weakly simitar to AF208855 1 BM41 


162.25 


189.00 


456534 


X91195 


Hs 100623 


phosphotipase C, beta 3, neighbor pseudo 


2.12 


1.80 


456736 


AW248217 


Hs.1619 


achaete-scute complex (Drosophfla) homo! 


1.15 


1.94 


456759 


BE259150 


Hs.1 27792 


delta (DrosophilaJ-fike 3 


1.00 


1.00 


456990 


mm 004504 


Hs 171545 


HIV-1 Rev binding protein 


16.42 


84.00 


457200 


U33749 


Hs 197764 


thyroid transcription factor 1 


0.57 


1.76 


457234 


AW9SA360 

rtrt 3UOO0V 


Hs. 14355 


Homo sapiens cONA FU13207 fis, done NT 


2.71 


4.15 


457465 


AW301344 


Hs.1 22908 


DNA replication factor 


46.37 


47.00 


457489 


MiUJUO J 3 


Hs 19717Q 


cryptic gene 


1.12 


1 35 


457646 


MM/ £OO0U 


He 1 1 9Q4A 


ESTs 


1.55 


c.ui 


457733 


AW974812 


Hs.291971 


ESTs 


1.00 


55.00 


457819 


AA057484 


Hs.35406 


ESTs, Highly similar to unnamed protein 


4.36 


3.18 


45B092 


BE545684 


Hs.343566 


KIAAQ251 protein 


1.00 


1.32 


458098 


BE550224 




metaitoihionein 1E (functional) 


1.00 


22.00 


458207 


T28472 


Hs.7655 


U2 email nuclear ribonucteoprotein auxfl 


2.06 


1.88 


458242 


BE299588 


Hs.28465 


Homo sapiens cONA: FU21869 fis, clone H 


1.00 


1.00 


458247 


R14439 


Hs.209194 


ESTs 


7,00 


9.85 


458679 


AW975460 


Hs.142913 


ESTs 


1.00 


3.00 


45877B 


AW451034 


Hs.326525 


arylsulfataseO 


1.31 


2.01 


458933 


A1638429 


Hs.24763 


RAN binding protein 1 


1.98 


1.71 


459352 


AW810383 


Hs.206828 


ESTs 


12.60 


63.00 


459670 


F01020 


Hs.172004 


m 


1.00 


1.00 


459702 


AI204995 




gb:an03c03j(1 Stratagene schizo brain S1 


1.00 


237.00 



TABLE 9B 

Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey 
407746 



408070 
408660 
409522 
409866 
410032 
411089 

411152 

412537 



CAT Number 
10125J 



1036588J 

107294J 

113735J 

1156522J 

11 70435 J 

123172J 

1234028.1 

1304J 



Accession 

AK001 962 R6941 5 BE464605 AA418699 AA053293 AA149075 AA058396 AW338226 AW272659 AA454607 AI139535 AW469852 AJ275461 
AW271982 AA730033 AA576507 AA991217 AA782067 A1985851 AA805864 AA505598 AW469857 R69546 AA988279 AW001647 N63320 
D82661 T27343AA306950AA360989R58778 
AW1488528E350895 

AA525775 AA056342 AI538978 AW975281 AA664986 

AA075382AA075431 

AW502152H41202H29772 

BE065985 BE065944 BE066008 BE066083 BE066093 

AA456454 AA713730AA091294AA584921 N86077 AW836781 AA601031 AA579876AA551106AA633188AW905577AI955808AI679386 
A1679895 AA514764 AA454562 A1082382 AA595822 AA551351 AA586369 AA666384 AA188934 AA666398 AA551297 AA565188 
BE069199 AW936012 AW877466 AW819782 AW935798 AW835546 AW936042 BE069121 AW835625 AW877536 AW935885 BE069202 
AW820019 AW935937 BE160180 AW935946 BE069101 BE069125 AW877527 BE160316 BE160398 AW935794 AW835701 AW935784 
AL031778X59711 NMJ025051\/^9AI870439AI494259AW664010^ 

T29403 BE079412 BE079428 N90322 A1631202 AM1758 AI016793 AI167566 A1862075 AI375230 AI20B445 AW235763 AL044113 AA382556 
AW953918 AA927051 AA889823 BE003094 AW390155 AW360805 AW360823 AW360810 AA425472 A1694282 AL0441 14 A1684577 AI809865 



167 



WO 02/086443 



412811 132943.1 



413690 1383256 1 
414883 15024J 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 

85 



415989 155454.1 

417324 166714J 

418574 17690.1 

418712 17841 25 J 

419443 184788 1 

419502 18535J 



419936 
421582 



422128 
423034 
423816 
424200 
424999 
426966 
426991 
427260 
428023 



429220 
429978 



430439 
430935 
431089 
431322 
432407 



189181 1 
2041J 



211994J 

2241 2ZJ 

23234.1 

236595.1 

245835.1 

273896.1 

27415.1 

276598 1 

28589.2 



301384 1 
31150J 



31808J 

325772.1 

327825.1 

331543.1 

34624.1 



434414 38585.1 



436608 42361.3 



PCTYUS02/12476 

AI478773 AI160445 A1674630 N69088 AW665529 N49278 AI129239 A1457890 AI621 264 AW297152 AI268215 AA907787 AI286170 AI01 7982 
AI963541 AM69807 A1969353 BE552356 N66509 AA736741 AA382555 AW07581 1 AW292026 

H06382 AW957730 AA352014 R13591 AA121201 D60420 BE263253 BE047862 Z41952 A1424991 AI693507 AI863108 AA599060 AI091 148 
AA598589 R39887 AA813482 AW0 16452 H06383 R41807 AJ364268 AA620528 AI241 940 AW089149 AW090733 AW088B75 Z38240 
AA121202R17734 
BE157489BE157560 

AA926960 AA926959 W76521 W24270 W21526 AA037172 BE267636 H83186 AA469909 N86396 AA001348 BE535736 AA081745 BE566245 
AA082436 H72525 H77575 N49786 W80565 H78746 BE569085 W04339 R98127 T55938 BE279271 AW960304 T29812 M476873 BE297387 
AA292753 AA177048 NM.001826 X54941 BE314366 AA908783 AI719075 BE270172 BE269819 AA889955 A1204630 W25243 AI935150 
AA872039 W72395 T99530 A1422691 H98460 N31428 BE255916 H03265A1857576 AA776920 AA91 0644 AA459522 AA2931 40 AW51 4667 
R75953 AW662396 AA662522 A1865147 AI423153 AW262230 AA584410 AA583187 AW024595 AW069734 AI828995 AA282997 AA876046 
AW613002 AA527373 AW972459 AI831360 AA621337 AA100926 AA772418 AA594628 AI033892 W95096 AI034317 AA398727 AI085O31 
N95210 AI459432 A1041437 AA932124 AA627684 AA935829 A1004827 A1423513 AI094597 H42079 R54703 A163Q359 AA617681 AA978045 
AA643280 W44561 AI991988 AI537692 A1090262 AA740817 AI312104 AI91 1822 AA416871 A1185409 AA129784 AA701623 AI075239 
All 39549 AA633648 AI339996 AI336880 AA399239 A1078708 AI0B5351 AI362835 A1346618 A11 46955 A1989380 AI348243 N92892 M765850 
AI494230 AI278887 AA962596 AI492600 WB0435 AA001979 R97424 AI129015 N24127 AA157451 AA235549 AA459292 AA037114 AA129785 
AI49421 1 AW059601 AW88671 0 R92790 N59755 AI361 1 28 AW589407 H47725 H97534 H48076 H48450 T99631 AW300758 H03431 R76789 
AA954344 H77576 R96823 AI457100 N92845 N49682 H42038 BE220698 BE220715 H99552 AA701624 N74173 R54704 H79520 H72923 
H03266 BE261 919 AA769633 AA480310 AA507454 AA910586 A1203723 AW104725 W2561 1 W25071 T88980 H03513 T77589 R99156 
W95095 R97470 AA702275 T77551 AA91 1952 H82956 N83673 AA283872 

AI267700 AI720344 AA191424 AI023543 AI469633 AA172056 AW958465 AA172236 AW953397 AA355086 
AW265494 AA455904 AA195677 AW265432 AW991605 AA456370 

N2B754 N28747 AI5S8146 AI979339 AA322671 AA322672 AW955043 AI990326 AA776406 AI01 6250 AA843678 AW451882 N23137 N231 29 
W70051 AI038748 AA831327 AI925B45 AW945895 
Z42183 T31621 T97478 
D62703AA242966 D79798 

AU076704 T74854 T74860 T72098 T73265 T73873 T69180 T74658 T58786 T60385 T73410 T68781 T67845 T67593 T73952 T67864 T60630 
T68367 T68401 T53959 172360 172099 T60377 T58961 T71712 U282i 164728 T74645 T72037 T68688 T72063 T73258 T72826 T64242 
T68220 T74673 T71800 T68355 T61227 T62738 T69317 T53850 T64692 T73768 T73962 T73382 T68914 T70975 T73400 T60631 T73277 
T73203 770498 T61409 T58925 NM.000508 M64982 T68301 T73729 T69445 T60424 T67922 T67736 T68716 T67755 T74765 T73819 T58719 
T74756 T60477 T74853 T61 109 T68329 T5B850 T71857 T73425 T53736 T68607 T5B898 T64309 T72031 T72079 T64305 T71908 T68107 
T71 916 T73787 T56035 T64425 T71870 T60476 T61376 T67820 T71895 T41006 T69441 T68170 T74617 T71958 T69440 T61875 R06796 
H48353 T71914 T53939 T64121 M693996 T72525 T67779 T6B078 AA01 1465 AA345378 AV654847 AV654272 AV656001 AI064740 T82897 
N33594 AA344542 AW805054 AI207457 T61743 AA026737 H94389 AA382695 AA918409 T68044 S82092 T39959 A1017721 AA312395 
AA312919T40156 H66239 AV652989 H38728 R98521 AV655200 R95790 W03250 W00913 AA344136 AV660126 R97923 AA343596 
AW470774 AV651256 N54417 AA812862 AW182929 AH 1 1 192 H61463 H72060 AA344503 H38639 AI27751 1 AV661 1 08 A1207625 T4781 0 
AA235252 T27853T47778 R95746 H70620 AA701463AW827166 R98475 C20925 AV657287 T71959 T71313T73920 T73333T61618 T69293 
T69283 T73931 T72178 T72456 AV645639 AV653476 T72957 T72300 T58906 T71457 T70494 T72956 T70495 T68267 T74407 T85778 
AA344726 T27854 T74485 T74101 T73868 T71518 T72304 AA343853 T73909 T68070 T72065 H72149 T73493 T73495 AV645993 R02293 
T70475 T64751 AA344441 AA343657 M345732 M344328 All 10639 AA344603 AF063513 T64696 T68516 T72223 T60507 T67633 R29500 
T72517 R02292 T60599 T69206T70452T74677 R29366 T61277 T74914 T50352 R29675T74843 AV645792 AA344408T69197 T72057 
T69368 T69358 T6S258 AV650429 T73341 T61702 T74598 T40095 K02272 T401 06 AA343045 AA341908 AA341907 AA342807 AA341 964 
T53747 T72042 T62764 AI064899 AA343060 T67832 T72440 T71770 T68091 T69108 T72449 T691 67 T71289 T68251 AV654844 T64375 
AA345234 T67598 AA011414 T68036 H48262 A1207557 T68219 W86031 T69081 T64232 R93196 T62136 AV650539 H67459 T72978 
AA344583 T60362 H581 21 T9571 1 T72803 T68055 T7171 5 R29036 T72793 T69122 T64595 T62888 T691 39 T68291 T64652 T67971 T46862 
AA693592 A1248502 R29454 T64764 T57001 T73052 T71429 T51 176 T5B866 AV655414 H90426 AA342489 T73666 T67848 T72512 T53835 
T67837 T73317 T74273 T69420 T68245 T74380 T67862 T74474 T56068 
AI792788 BE142230AA252019 

AI910275 X00474 X52003 X05030 NM.003225 AA314326 AA308400 AA506787 AA314825 AI571948 AA507595 AA614579 AA587613 R83818 
AA568312 M614409AA307578A1925552AW950155AI910083M12075BE0740 

BE074140 AA514776 AA588034 BE074051 BE074068 AW009769 AW050690 M858276 R55389 AI001051 AW050700 AW750216 AA614539 

BE074045 AI307407 AW602303 BE073575 AI202532 AA524242 A1970839 AI909751 BE076078 A1909749 R55292 

AW881145AA490718M85637AA304575 T06067AA331991 

AL1 1 9930 AA320696 AW752565 

AL031985 AL137241 Af 792386 AI733664 AI857654 AIQ49911 

AA337221 AA336756AW966196 

AW953120 R56325AA349562 

AI493134 AI498691 AW771508 A1498457 A1768408 A1783524 AI383985 A! 580267 D79813 AA393768 
AK001536 AA191 092 AW510354 AI554256 AL353968 AA134266 
AA663848 AA400100 AA401424 

AL038843 AA161338 BE268213 AA425597 N87306 AA092969 BE566038 AA247451 N47392 A1928802 AW182584 AW027872 AI819831 

AI936994 W56258 A1653448 A127861 1 AI283557 Al 824306 AW338658 AW150899 AA687514 N47393 N29885 AA973469 AI038904 AI292064 ' 

A1034339AVV674593 N72156A1079733AI038683A1291616M491599M993675 M837380BE006554BE006473W 

AA652043 AI203503 AA583959 W352B3 AI1 29926 Z41844 AW020925 AW575848 AI684603 AA493297 A1140689 AI277175 AA425444 

AI932767 W02632 BE396786 R37261 

AW207206AW341473M448195A1951341 

AA249027 AL038984 AK001993 AL0B0066 AV652725 BE566226 AA345557 AA315222 AA090585 AA375688 AA301092 AA298454 W05762 
AW607939 H51658 D83880 N84323 BE296821 AW947007 061461 AW079261 AA329482 AW901780 AI354442 AA772275 R31663 A1354441 
AI767525 H92431 AI91 6735 H93575 AJ394255 AW014741 AI573090 C06195 AW612857 AW2651 95 AI339558 AI377532 AI308821 A191 9424 
AI589705 AW055215 AI336532 A1338051 AA808547 C75509 C00618 AW071 172 AW769904 AA630381 A1678018 A1853985 D79662 BE221049 
AW265018 A1589700 AW196655 N76573 AI370908 BE042393 N75017 AI698870 AW9601 15 
AL133561 AL041090AL117481 AL1 22069 AW439292 A1968826 

AW0729 1 6 AJ 18491 3 AA4891 95 AW466994 AW469044 N59350 A1819642 A12S0239 A1220572 AA789302 AI47361 1 AW841126 D60937 
BE04 1 395 AA491 826 AA621 946 AA71 5980 AA666 1 02 
AW970622 AA503009 AA502998 AA502989 AA502805 T92188 

AA221038 R87170 BE537068 BE544757 C18935 AW812058T92565AA227415 AA233942 AA223237 AA668403AA601627 AW889639 
BE061833 BE000620AW961170 AW847519 AA308542 AW821833 AW945688 C04699 AA205504 AA377241 AW821667AA055720 
AW817981 AW856468 AA155719 AA1 79928 T03007 AW754298 AA227407 M113928 AA307904 C16859 

A/798376 S46400 AW81 1617 AW81 1616 W00557 BE142245 AW858232 AW861851 AW858362 AA232351 AA218567 AA055556 AW858231 
AW857541 AWB14172 H66214 AW814398 AF134164 AA243093 M173345 AA199942 AA223384 AA227092 AA227080 T12379 AA092174 
T61 139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 AA157719 AA100472 AA1 00774 AA 1 30756 AA1 57705 
AA157730 AA157715 AA053524 AW849581 AW854566 C05254 AW882836 T92637 AW812621 AA206583 AA209204 BE1 56909 AA226824 
AI829309 AW991 957 N66951 AA527374 H66215 AA045554 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 
AW817705AW817703AW817659BE081531 H59570 
AA628980AI126603BE504035 



168 



WO 02/086443 
438091 44984.1 



439000 467716J 

439285 47055J 

439780 47673.1 

441128 51021.2 

443068 558874J 

443947 586160J 

447636 7301J 



448993 79225J 



449305 804424.1 
451105 859083.1 
451320* 86576.1 



451807 . 8865.1 
452410 9163.1 



454241 
455175 
456237 



1067807.1 
1257335.1 
168730.1 
47395.1 



PCT/US02/12476 

AW373062 T55662 A1299190 BE1 74210 AW579001 H01811 W40186 R67100 AI923886 AW952164 AA628440 AW898607 AW898616 
AA709126 AW898628 AW898544 AA947932 AW898625 AW898622 AI2761 25 Al 185720 AW510698 AA987230 T52522 BE467708 AW243400 
AW043642 AI288245 AI186932 D52654 D55017 D52715 D52477 D53933 D54679 AI298739 Al 146984 A1922204 N98343 BE174213 AA845571 
AI813854 AI214518 AI635262 AI139455 AI707807 AI698085 AW884528 AI024768 AI004723 AW087420 AI565133 N94964 A1268939 
AW513280 AI061 126 A1435818 A1859106 A1350506 AI024767 AA513019 AA757598 X56196 AA902959 A1334784 AI860794 AA010207 
AW890091 AW513771 AI951391 A1337671 T52499 AA890205 AI640908 H75966 AA463487 AA358888 A1961767 A1866295 AA780994 
AI985913 BE174196 AAO29094 AW592159 T55581 N79072 AI61 1201 AA910812 A1220713 AW149306 AI758412 AA045713 R79750 N76096 
AW979121 AA847986 AA829098 

AL133916 N79113 AF086101 N76721 AW950828 AA364013 AW955584 AI346341 A1867454 N54784 AJ655270 AI421279 AW014882 
AA775552 N62351 N59253 AA626243 AI341407 BE175639 AA456968 AI358918 AA457077 
AL109688 R23665R26578 

AA570256 AW014761 AA573721 AI473237 AI022165 AA554071 AA127551 N90525 AW973623 AA447991 AA243852 BE328850 AI148171 
AI359627AI005068 AI356567AA232991 AW016855AA906902AA233101 AA1 27550 BE512923 
A!188710A1032142AW078833 N30308AW675632A1219028A!341201 N22181 H95390 
W24187 W24194R17789 

Y10043 NMJJ05342 L05085 AL034450 BE614226 AW749053 AA379173 AA248230 BE514634 AA334622 R70656 AA367593 AA214649 
AA369318 AW957081 R05760 AA039903 AI886597 AW630122 AA906264 AA041527 R01145 AI088688 BE463637 AA398795 AI354883 
AI768938 AI569996 AI452952 AI168582 All 89869 AI086670 AW262560 AW61 3854 AA862839 AA435840 AA670197 AI024032 A1990659 
AI990089 N81095 AA847919 AW960150 AA21 1075 AA044704 AA367594 AW582587 AW858854 AW818630 AW818281 AW818433 AW582595 
AA096002N83992 

A1471630 0E54O837 BE265481 AW407710 BE513882BE546739AA053597 BE1405O3 BE218514AW956702AI656234 A1636283 AI567265 

AW340858 BE207794 AA053085 R69173 AA292343 AA454908 AA293504 AI659741 AI927478 AA399460 A1760441 AA346416 BE047245 

AA730380 AA394063AA454833 AI982791 AI567270 AI813332 AI767858 AA427705 D20284 AI221458 BE048537 AI263048AA346417 

AA911497BE537702 

AI638293AW813561 

A1761324 AW880941 AW880937 

AW1 18072 AI631982 T15734 AA224195 A1701458 W20198 F26326 AA890570 N90552 AW071907 AI671352 A1375892 T03517 R88265 
A1124088 AA224388 AI084316 AI354686T33652 A1140719 AI720211 T03490 AI372637 T15415 AW205836 AA630384 T03515 T33230 
AA017131 AA443303 T33623 AI222556 T33511 T33785 AI419606 D55612 

W52854 AL117600 BE208116 BE208432 BE206239 BE082291 AW953423 AA351619 BE180648 BE140560 W60080 AA865478 N90291 
AW450652 AW449519 AA993634 AI806539 AA351618 AW449522 A1827626 AA904788AA380381 AA886045AA774409 BE003229 Z41756 
AL133619 AA4681 18 AA383064 AI476447 T09430 AI673758 AA524895 AJ581345 AI300820 AW498812 AA256162 AI559724 AI685732 
AA602400 AA905453 AI204595 AW1 66541 AA1 57456 AA156269 AA383652 AA431 072 AW592707 AI435410 AW272464 AI21 5594 AA622747 
R74039 N35031 AI804128 AW513621 AA868351 AI026826 AI493388AA614641 W81604AI567080 AI214351 M730140 AI125754 AI200813 
AI269603 AI565082 A1807095 AI476629 AA505909 AI368449 AI686077 A1582930 AW085038 AA757863 AA7301 54 A1767072 AA46831 6 
AI734130 AI734138 M426284 AA433997 A1741241 AW043563 AI732741 AI732734 AA437369 AA425B20 M664048 R74130 
BE144666 BE184942 AW238414 BE184946 
AW993247AW861464 
AA203682R11958 

BE550224 AA832519 N45402 AW885857 N29245 BE465409 W07677 AW970089 AI299731 AA482971 BE503548 H18151 W79223 AF086393 
AA461 301 W74510 R34182 A1090689 N46003 BE071550 R26075 AW134982 AI240204 AI138906 AW026179 AI572316 BE466182 AI206395 
AJ276154 A1273269 AI422817 A1371014 AI421274 AI188525AA939164 BE549810AW1 37865 A1694996 8E503841 AA459718 BE327407 
BE467534 BE218421 BE467767 AA989054 BE467063 AI797130 BE327781 



. TABLE 9C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gi) numbers. "Dunham I. et al' refers to the publication entitled "The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489495. 
Strand: * Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400512 


9796593 


Minus 


1439-1615 


400517 


9796686 


Minus 


49996-50346 


400560 


9843598 


Plus 


94182-94323,97056-97243,101095-101236.102824-103005 


400664 


8118496 


Plus 


13558-13721,13942-14090,14554-14679 


400665 


8118496 


Plus 


16879-17023 


400666 


8118496 


Plus 


17982-18115,20297-20456 


400749 


7331445 


Minus 


9162-9293 


400763 


8131616 


Minus 


"35537-35784 


401027 


7230983 


Minus 


70407-70554,71060-71160 


401093 


8516137 


Minus 


22335-23166 


401203 


9743387 


Minus 


172961-173056,173868-173928 ( 
87839-68028 


401212 


9858408 


Plus . 


401411 


7799787 


Minus 


144144-144329 


401435 


8217934 


Minus 


54508-55233 


401464 


6682291 


Minus 


170588-170834 


401714 


6715702 


Plus 


96484-96681 


401747 


9789672 


Minus 


118596-118816,119119-1192H119609-119761,12(M22-^ 








131932,132451-132575,133580-134011 


401760 


9929699 


Plus 


83126-83250,8532045540,94719-95287 


401780 


7249190 


Minus 


28397-28617,28920-29045,29135-29296^9411-29567,29705-29787,30224-30573 


401781 


7249190 


Minus 


83215.83435.83531-83656,83740^3901,54237-84393,84955-85037,66290-86814 


401785 


7249190 


Minus 


165776.165996,166189-166314,166408-166569,167112-167268,167387-167469.168634-168942 


401797 


6730720 


Plus 


6973-7118 


401961 


4581193 


Minus 


124054-124209 


401985 


2580474 


Plus 


61542-61750 


401994 


4153858 


Minus 


4290443124,43211-43336,44607-44763,4519945281,46337-46732 


402075 


8117407 


Plus 


121907-122035,122804-122921,124019-124161.124455-124510,125672-126076 


402260 


3399665 


Minus 


1 13765-1 13910,1 15653-1 15765,116808-1 16940 


402265 


3287673 


Plus 


21059-21168 


402297 


6598824 


Plus 


35279-35405.3557345659 


402408 


9795239 


Minus 


110326-110491 
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405676 
405770 
405932 
406137 
406360 
406399 
405467 



Minus 

9211494 Minus 

8099945 Plus 

8516120 Phis 

9438267 Minus 

9958258 Pius 

9966528 Plus 



Minus 
7239669 Plus 
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40242) 9796339 Plus 
402674 8077108 Minus 
402802 3287155 Minus 
402994 
403137 
403306 
403329 
403381 
403478 
403485 
403627 
403715 
404044 
404076 
404101 
404140 
404165 
404185 
404210 
404253 
404287 
404298 
404347 
404440 
404721 
404794 
404854 
404877 
404927 
404996 
405449 
405568 
405572 
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Minus 
9931752 Minus 
8076925 Minus 
9843520 Plus 



9926489 

4572584 Minus 

5006246 Pius 

9367202 Minus 

2326514 Plus 

9944263 Mmis 



Plus 

7528051 Pius 

9856648 Jvfinus 

4826439 Plus 

7143420 Plus 

1519284 Plus 

7342002 Plus 

6007890 Phis 

7622497 Plus 

6006906 Plus 

3800891 Plus 

4914350 Plus 

4557087 Plus 

2735037 Plus 

7767812 Minus 

9166422 Minus 

9256107 Minus 

9256288 Minus 

9795551 Pius 



129750-129919 
39290-39502 
53242-53432 
4727-4969 

92349-92572,92958-93084 > 93579.93712,93949-94072,94591.94748.95214-95337 

127100-127251 

96450-96598 

260)9-26178 

116458-116554 

2888-3001,3198-3532,38554117 

23868-24342 

8512845292 

225757-225939 

3848-3967 

125742-125997 

37761-38147 

69025-69128 

129171-129327 

169926-170121 

55675-56055 

53134-53281 

73591-73723 

74493-74829 

80430-81581 

173763-174294 

101619-101898 

14260-14537 

1095-2107 

6869049563 

37999-38145,38652-38998,39727-39872,4055740674,42351-42450 

4223642570 

35912-36065 

85230-85938 

741-969 

73195-73917 

61057-62075 

123525-123713 

30487-31058 

7513-7673 

63448-63554 

182212-182958 



TABLE 10A: Potential Therapeutic, Diagnostic and Prognostic targets tor Therapy of Lung Cancer and Non-malignant Lung Disease 

Table 2A shows about 307 genes up-regulated in non-maiignant lung disease relative to lung tumors and normal body tissues and/or down-regulated in lung tumors relative to 
normal lung and non-malignant tung disease. These genes were selected from about 59680 probesets on the Eos/AfrVmetrix Hu03 Genechip array. 

Table 108 show the accession numbers for those Pke/s lacking UnigenelD's for table 10A For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRMAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
'Accession" column. 



Table 10C show the genomic positioning for those Pkey's lacking Unlgene 
sequence source used for prediction. Nucleotide locations 



ID'S and accession numbers in table 10A. For each predicted exon, we have fisted the genomic 
exon are also listed. 



Pkey: j Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank a 
UnigenelD: Unlgene number 
Unigene Title: Unlgene gene title 

Average of lung tumors (including squamous cell carcinomas, adenocarcinomas, small ceil carcinomas, granulomatous and carcinoid tumors) divided by the 
average of normal lung samples 

Average of non-ma!ignant lung disease samples (including bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal tung samples 



R1 

R2: 



Pkey 
404394 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 




ENSP00000241075:TRRAP PROTEIN. 


0.79 


3.10 


404916 






Target Exon 


1.00 


159.00 


405257 






Target Exon 


1.00 


42Z00 


407228 


M25079 


Hs.155376 


hemoglobin, beta 


0.47 


2.33 


407568 


AA740964 


Hs.62699 


ESTs 


1.00 


123,00 


408562 


AI436323 


Hs.31141 


Homo sapiens mRNA for KJAA1568 protein, 


1.00 


230.00 


409031 


AA376836 


Hs.76728 


. ESTs 


1.00 


128.00 


410434 


AF051152 


Hs.63668 


tolHike receptor 2 


39.65 


149.00 


410467 


AF102546 


Ks.63931 


dachshund (Orosophtla) homofog 


1.00 


109.00 


410808 


T40326 


Hs.167793 


ESTs 


1.14 


13.14 


412351 


AL135960 


Hs.73828 


T-cel! acute lymphocytic leukemia 1 


0.37 


2.27 


412372 


R65998 


Hs.285243 


hypothetical protein FU22029 


1.00 


173.00 


413795 


AL040178 


Hs.142003 


ESTs 


0.10 


11.90 


414154 


AW205314 


Hs.323060 


ESTs 


0.62 


Z09 


414214 


D49958 


Hs.75819 


glycoprotein M6A 


0.03 


4.55 


414998 


NJVL002543 


Hs.77729 


oxidised low density lipoprotein (lectin 


0.64 


Z97 


415122 


D60708 


Hs.22245 


ESTs 


0.07 


8.97 


415765 


NMJJ05424 


Hs,78824 


tyrosine kinase with immunoglobulin and 


0.67 


1.65 


415775 


H00747 


Hs.29792 


ESTs, Weakly similar to 138022 hypothefi 


0.29 


264 


415910 


U20350 


Hs.78913 


chemokine (C-X3-C) receptor 1 


1.00 


145.00 
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415319 


AI815601 


Ks.79197 


415402 


NNL0Q0715 


Hs.1012 


417355 


D13168 


Hs.82002 


417421 


AL138201 


Hs.62120 


417511 


AL049176 


Hs.82223 


418489 


U76421 


Hs.85302 


418726 


BE241812 


Hs.87850 


418741 


H83265 


Hs.8881 


418883 


BE387036 


Hs.1211 


419086 


NM_000216 


Hs.89591 


419150 


T29618 


Hs.89540 


419235 


AW470411 


Hs.288433 


419407 


AW410377 


Hs.41502 


420556 


AA278300 


Hs.124292 


420556 


AA279098 


Hs.187636 


420729 


AW964897 


Hs.290825 


421177 


AW070211 


Hs.102415 


422060 


R20893 


Hs.325823 


422426 


W79117 


Hs.58559 


422652 


AW967969 


Hs.118958 


423099 


NM.002837 


Hs.123641 


424433 


H04607 


Hs.9218 


424585 


AA464840 


Hs.131987 


424711 


NMJ1Q5795 


Hs.152175 


424973 


X92521 


Hs.154057 


425023 


AW956889 


Hs.154210 


425664 


AJ0Q6276 


Hs.159003 


425998 


AU076629 


Hs.165950 


426657 


NM_015865 


Hs.171731 


426753 


T89832 


Hs.170278 


427558 


D49493 


Hs.2171 


427983 


M17706 


Hs.2233 


428467 


AK002121 


Hs.184465 


428927 


AA441837 


H$.90250 


429496 


AA453800 


Hs.192793 


430468 


NNL004673 


Hs.241519 


431385 


BE178536 


Hs.11090 


431728 


NMJJ07351 


Hs.268107 


431848 


AI378857 


Hs.126758 


432128 


AA127221 


Hs.117037 


432519 


AI221311 


Hs.1 30704 


433043 


W57554 


Hs.125019 


433803 


AI823593 


Hs.27688 


434730 


AA644669 


Hs.193042 


435472 


AW972330 


Hs.283022 


436532 


AA721522 




437119 


AS379921 


Hs.U7043 


437140 


AA312799 


Hs.283689 


437211 


AA382207 


Hs.5509 


437960 


AI669586 


Hs.222194 


438202 


AW169287 


Hs.22588 


438873 


AI3G2471 


Hs.124292 


438875 


AA827640 


Hs.189059 


441048 


M913488 


Hs.192102 


441188 


AW292830 


Hs.255609 


441499 


AW298235 


Hs.101689 


444513 


AL120214 


Hs.7117 


444527 


NM 005408 


Hs.11383 


444561 


NMJ04469 


Hs.11392 


445279 


R41900 


Hs.22245 


446017 


N98238 


Hs.55185 


446984 


AB020722 


Hs.16714 


446998 


N99013 


Hs.16762 


447357 


AI375922 


Hs.159367 


448105 


AI800470 


Hs.171941 


448253 


H25899 


Hs.201591 


449275 


AW450848 


Hs.205457 


450400 


AI694722 


Hs.279744 


450696 


A1654223 


Hs.16026 


450726 


AW204600 


Ks.250505 


451497 


H83294 


Hs.284122 


451533 


NMJ304657 


Hs.26530 


453636 


R67837 


Hs.169872 


458332 


AI000341 


Hs.220491 


459580 


AA022888 


Hs.176065 


400269 






403421 






407570 


Z19002 


Hs.37096 


412295 


AW088826 


Hs.117176 


414517 


M24461 


Hs.76305 


417204 


N81037 


Hs.1074 


418307 


U70867 


Hs.83974 


418935 


T28499 


Hs.89485 


421502 


AF111856 


Hs.105039 


421798 


N74880 


Hs.29877 



CD83 anligen (activated B lymphocytes, i 1 5.32 237.00 

complement component 4-binding protein, 0.64 4,00 

endothefin receptor type B 0.01 3.90 

nuclear receptor subfamily 4, group A, m 35.30 357.00 

chordm-like 1.00 179.00 

adenosine deaminase, RNA-specific B1 (h 0.02 6.00 

protein tyrosine phosphatase, norwecept 1 .00 11 3.00 

ESTs. Weakly similar to S41044 chromosom 0.44 1.90 

acid phosphatase 5, tartrate resistant 0.96 2.04 

Kallmann syndrome 1 sequence 0.62 2.74 

TEK tyrosine kinase, endothelial {venous 0.03 6.90 

neuroHmin 1.48 5.13 

hypothetical protein FU21276 37.55 336.00 

rtosapienscDNAFU23123fis,doneL 0.80 3.65 

ESTs 1.65 8.07 

ESTs 2.99 25.82 

Homo sapiens mRNA; cONA DKFZp586N0121 (f 0.46 1.95 

ESTs, Moderately similar to ALU5_HUMAN A 1.00 156.00 

ESTs 0.03 7.44 

syntaxinll 0.14 3.62 

protein tyrosine phosphatase, receptor t 0.01 3. 16 

ESTs 0.75 141.75 

ESTs 1.00 167.00 

calcitonin receptor-tike 0.43 3.01 

matrix metanoproteinase 19 0.37 19.45 

endothelial differentiation, sphmgolipi 0.14 3.35 

transient receptor potential channel 6 1.00 94.00 

fibroblast growth factor receptor 4 0.68 1 .42 

solute carrier family 14 (urea transport 0.03 3.74 

ESTs 1.00 141.00 

growth differentiation factor 10 1 .00 1 17.00 

cdony stimulating factor 3 (granulocyte 0.75 2.20 

hypothetical protein FU11259 0.76 2.25 

ESTs 0.01 3.62 

ESTs 1.00 138.00 

angiopoietin-like 1 1.00 132.00 

membrane-spanning 4-domains, subf amity A 1 .00 1 57.00 

mulb'merin 1.00 157.00 

ESTs, Highly similar to AF1 75283 1 zinc 0.34 2.24 

ESTs 0.00 1.15 

ESTs, Weakry similar to BCHUIAS-100 pro 0.01 2.06 

lymphoid nuclear protein (LAF-4) mRNA 1 .00 267.00 

ESTs 1.00 105.00 

ESTs 1.05 3.15 

triggering receptor expressed on myeloid 0.83 1 .94 

gb:nv54h12.r1 NCI CGAP_Ew1 Homo sapiens 1.00 218.00 

ESTs 1.00 133.00 

activator of CREM in testis 0.67 122.67 

ecotropic viral integration site 2B 1 .00 1 42.00 

ESTs 1.00 147.00 

ESTs 1.00 141.00 

Homo sapiens cDNA: FU23123 fis, clone L 0.71 3.66 

ESTs 23.32 370.00 

ESTs 0.77 8.50 

ESTs 3.43 16.36 

ESTs 1.00 167.00 

glutamate receptor, tonotropic, AMPA 1 1.00 151.00 

small inducible cytokine subfamily A (Cy 46.47 1 53.00 

c-fbs induced growth factor (vascular en 0.01 3.08 

ESTs 0.60 141.00 

ESTs 0.18 2.39 

Rho guanine exchange factor (GEF) 15 0.10 2.16 

Homo sapiens mRNA* cDNA DKFZp564B2062 (f 0.01 2.53 

ESTs 0.46 2.64 

ESTs 18.05 296.00 

ESTs 1.00 141.00 

periaxin 0.56 1.38 

ESTs 0.88 4.33 

hypothetical protein FU23191 0.52 Z08 

retinoic acid receptor, alpha 0,79 ZQ\ 

Wnt inhibitory factor-1 0.35 2.03 

serum deprivation response (phosphatidyl 0.13 2.25 

ESTs 1.00 116.00 

ESTs 1.00 192.00 

ESTs 0.20 2.98 

Eos Control 0.40 £40 

NMJ)16369*:Homo sapiens claudin 18 (CLDN 0.53 1.77 

zinc finger protein 145 (Kruppel-fike, e 0.01 3.18 

poly(A)-binding protein, nuclear 1 0.56 1.74 

surfactant, pulmonary-associated protein 0.64 1.50 

surfactant, pulmonary-associated protein 0.33 1.16 

solute earner family 21 (prostaglandin 0.53 1 .55 

carbonic anhydrase IV 0.20 1.28 

solute carrier family 34 (sodium phospha 0.78 1.90 

N-acylsphingosine arrtdohydrolase (acid c 0.59 1 .54 
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423354 AB011130 Hs. 127436 

423738 AB002134 Hs.132195 

425211 M18667 Hs.1867 

425438 T62216 Hs.270840 

5 426828 NM_Q0002O Hs.172670 

427019 AA001732 Hs.173233 

428043 T92248 Hs.2240 

430280 AA381258 Hs.237868 

431433 X65018 Hs.253435 
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15 444325 AW152618 Hs.16757 

450954 AJ904740 Hs^5691 

451558 NMJJ01Q89 Hs.26630 

453310 X70697 Hs.553 

456855 AF035528 Ks. 153863 

20 444342 NM_014398 Hs.10887 
400754 
401045 
401083 
402474 

25 402808 
403021 
403438 
403687 
403764 

30 404277 
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35 406387 

406646 M33600 
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406753 AA505665 

406973 M34996 

40 407246 U82275 
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407731 NM_000066 Hs.38069 
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408045 AW138959 
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409389 AB007979 

409718 D86640 

410798 BE178622 

55 411020 NM 006770 

411667 BE160198 

412000 AW576555 

412358 BE047490 

412420 AL035668 

60 412564 X83703 

412869 AA290712 

412870 N22788 
413529 U11674 
413533 BE146973 

65 413689 BE157286 

413724 AA131466 

413800 AJ129238 
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413829 NMJJ01872 Hs.75572 

70 414376 BE393856 Hs.66915 

414577 AJ056548 Hs.72116 

414700 H63202 Hs.38163 

415078 M311223 Hs.283091 

415120 N54464 Hs.34950 

75 415323 BE269352 Hs.949 

415335 AA847758 Hs.111030 

415582 W92445 Hs.165195 

416030 H15261 Hs.21948 

416427 BE244050 Hs.79307 

80 
85 



calcium channel, voltage-dependent, aiph 
airway tiypsin-Cke protease 
progastrin (pepsinogen C) 
ESTs 

acfivin A receptor type IHike 1 

hypothetical protein FU10970 

uteroglobin 

interleuJdn 7 receptor 

surfactant, pulmonary-associated protein 

Homo sapiens mRNA; cDNA DKFZp564B2082 (f 

ESTs 

advanced grycosyiaBon end product-sped 

ESTs 

ESTs 

ESTs 

receptor (calcitonin) activity modifying 
ATP-binding cassette, sub-family A (ABC1 
solute carrier family 6 (neurotransmitte 
MAD (mothers against decapentaplegic Dr 



Hs.266959 
Hs.217493 
Hs.198253 
Hs.94498 



Hs.245123 

Hs.155591 

Hs.141883 

Hs.673 

Hs.50813 

Hs.687 

Hs.51515 

Hs.301281 

Hs.56045 

Hs.16291 

Hs.67726 

Hs.15780 
Hs.24172 
Hs.73853 
Hs.31432 
Hs.82407 
Hs.82407 
Hs.846 

Hs.20631 
Hs.23767 
Hs.192235 
Hs.32241 



416464 NMJ00132 Hs.79345 



416585 X54162 

416847 L43821 

417148 AA359896 

417370 T28651 

417673 T87281 



Hs.79386 

Hs.80261 

Hs.293885 

Hs.82030 

Hs.16355 



Target Exon 

C1 1001883*:gi|6753278IreflNP_033938.1 j c 
NMJ)16582*:Homo sapiens peptide transpor 
NR/L004079:Komo sapiens cathepsin S (CTSS 
ENSP00000235229:SEMB. 
C21000030:gi|9955960Iref]NP_063957.1| AT 
NM_031419*:Homo sapiens molecule possess 
NMJ)07037*:Homo sapiens a dislntegrin-li 
NM_005463:Homo sapiens heterogeneous nuc 
NNUJ191 1 1*:Homo sapiens major histocompa 
NMJ02944*:Homo sapiens v-ros avian UR2 
CO 83 antigen (activated B lymphocytes, i 
C11001637*:gji5032241|ref|NP_005732.1|2 
Target Exon 
Target Exon 

major histocompatibility complex, class 
hemoglobin, gamma G 
annexinA2, 

major histocompatibility complex, class 
leukocyte immunoglobulin-like receptor, 
gb:Human trophoblast hypoxia-regulated f 
complement component 8, beta polypeptide 
arylacetamide deacetylase (esterase) 
ESTs 
ESTs 

forkhead box F1 
ESTs 

inter! eukin 1 2A (natural killer cell sti 
hypothetical protein FU20022 
cytochrome P450, subfamily IVB. porypept 
Homo sapiens mRNA; cDNA DKFZp564G1 12 (fr 
Homo sapiens mRNA, chromosome 1 specific 
src homology three (SH3) and cysteine ri 
gb:PM3-HT0605-270200-001-a02 HT0605 Homo 
macrophage receptor with collagenous str 
gb:QV1-HT0413-01020W)59-h03 HT0413 Homo 
ATP-binding cassette, sub-family A (ABC1 
ESTs 

bone morphogenetic protein 2 

cardiac ankyrln repeat protein 

CXCchemokineligand16 

CXCchemokineligand16 

interleukin 8 receptor, beta 

gb:QV4-HT0222-01 1 199-01 9*05 HT0222 Homo 

zinc finger protein, subfamily 1A, 5 (Pe 

hypothetical protein FU12666 

ESTs 

ESTs, Weakly similar to S65657 alpha-1 C- 
carboxypeptidase B2 (plasma) 
ESTs, Weakly similar to 16.7Kd protein [ 
hypothetical protein FLI20992 $imS\ar to 
ESTs 

found In infiammatory zone 3 
ESTs 

neutrophil cytosotic factor 2 (65kD, chr 
ESTs 

Homo sapiens cDNA FU14237 fis, clone NT 
ESTs 

Rac/Cdc42 guanine exchange factor (GEF) 
coagulation factor VIII, procoagutantco 
leiomodin 1 (smooth muscle) 
enhancer of filamentation 1 (cas-tikedo 
hypothetical protein FU14902 
tryptophanyMRNA synthetase 
ESTs 
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1.00 


102.00 


1.00 


70.00 


1.00 


112.00 


0.07 


10.17 
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0.86 


1.95 


1.00 . 


120.00 


0.60 


148 


1.00 


95.00 


1.00 


136.00 


0.02 


8.07 


1.00 


73.00 


0.70 


3.36 


0.06 


6.56 


0.70 


3.66 


1.00 


114.00 


0.85 


1.30 


0.15 


15.54 
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418067 


AI127958 


Ks.83393 


cystatin DM 


0.81 


1.74 


418295 


C01566 


Hs.86671 


ESTs 


1.00 


99.00 


418543 


J03798 


Hs.86948 


small nuclear ribonudeoproldn D1 polyp 


1.00 


60.00 


418832 


X04011 


Hs.88974 


cytochrome b-245, bela polypeptide (chro 


2.40 


14.74 


418945 


8E246762 


Hs.89499 


arachldonate 5-fipoxygenase 


0.67 


• 3.16 


419261 


X07876 


Hs.89791 


wingless-type MMTV integration site farm' 


1.00 


73.00 


419564 


U08989 


Hs.91139 


solute carrier family 1 (neuronaVepiihe 


1.00 


19Z00 


419574 


AK001989 


Hs.91165 


hypothetical protein 


1.00 


94.00 


419968 


X04430 


Hs.93913 


interleuldn 6 (interferon, beta 2) 


61.16 


500.00 


420256 


U84722 


Hs.76205 


cadherin 5, type 2, VE-cadherin {vascuia 


0.52 


1.70 


420285 


AA258124 


Hs.293878 


ESTs, Materatery simflarto ZN91JWMAN Z 


1.00 


172.00 


420577 


AA278436 


Hs.186649 


ESTs 


1.00 


97.00 


421262 


AA286746 


Hs.9343 


Homo sapiens cONA FU 14265 fis, clone PL 


1.00 


64.00 


421445 


AA913059 


Hs.104433 


Homo sapiens, done IMAGE:4054868, mRNA 


0.88 


1.51 


421470 


R27496 


Hs.1378 


annexin A3 


0.05 


11.26 


421478 


AI683243 


Hs.97258 


ESTs, Moderately similar to S29539 ribos 


1.00 


73.00 


421563 


NM_006433 


Hs.105806 


granulysin 


0.82 


Z42 


421566 


NMJ)00399 


Hs.1395 


early growth response 2 (Krox-20 (Drosop 


5.50 


31.57 


421855 


F06504 


Hs.27384 


ESTs, Moderately similar to ALU4_HUMAN A 


1.00 


129.00 


421913 


AJ934365 


Hs.109439 


osteoglycin (osteoinductive factor, mime 


1.00 


101.00 


421952 


M300900 


Hs.98849 


ESTs. Moderately similar to AF161511 1 H 


0.60 


63.60 


422232 


D43945 


Hs.113274 


transcription factor EC 


1.00 


148.00 


422386 


AF105374 


Hs.115830 


heparan sulfate (glucosamine) 3-Osulfot 


1.40 


3.98 


423168 


R343B5 


Hs.124940 


GTP-binding protein 


0.34 


3.59 


423196 


AK001866 


Hs.125139 


hypothetical protein FU11004 


0.55 


2.00 


423387 


AJ012074 




vasoactive intestinal peptide receptor 1 


0.09 


2.13 


423424 


AH50241 


Hs.128433 


prostaglandin 02 synthase, hematopoietic 


1.00 


141.00 


423456 


AL110151 


Hs.128797 


DKFZP586D0824 protein 


1.00 


66.00 


423696 


Z92546 




Sushi domain (SCR repeat) containing 


0.73 


1.27 


424027 


AW337575 


Hs.201591 


ESTs 


0.54 


2.58 


424212 


NM_005814 


Hs.143131 


glycoprotein A33 (transmembrane) 


0.77 


2.47 


425087 


R62424 


Hs.126059 


ESTs 


1.00 


74.00 


425175 


AF020202 


Hs.155001 


UNC13(C.eiegansFite 


0.85 


1.96 


425771 


BE561776 


Hs.159494 


B niton agammaglobulinemia tyrosine kin as 


1.18 


256 


426486 


BE178285 


Hs.170056 


Homo sapiens mRNA: cDNA DKF2p586B0220 (f 


1.00 


76.00 


427507 


AF240467 


Hs.179152 


toll-like receptor 7 


1.00 


63.00 


427618 


NWL000760 


Hs.2175 


colony stimulating factor 3 receptor (gr 


0.60 


219 


427732 


NM.Q02980 


Hs.2199 


secretin receptor 


0.97 


1.42 


427952 


AA765368 


Hs.293941 


ESTs, Moderately simflar to A53959 throm 


1.00 


105.00 


428709 


BE268717 


Hs.104916 


hypothetical protein FU21940 


1.00 


80.00 


428769 


AW207175 


Hs.106771 


ESTs 


0.09 


255 


428780 


A1478578 


Hs.50636 


ESTs 


1.00. 


98.00 


428833 


A1928355 


Hs.185805 


ESTs 


1.00 


113.00 


429657 


D13626 


Hs.2465 


KIAA00O1 gene product; putative G-protei 


1.00 


52.00 


430212 


AA469153 




gb:nc67fD4.s1 NCLCGAP_Pr1 Homo sapiens 


1.00 


132.00 


430226 


BE245562 


Hs.2551 


adrenergic, befa-2-, receptor, surface 


0.11 


15.60 


430376 


AW292053 


Hs.12532 


chromosome 1 open reading frame 21 


1.00 


103.00 


430414 


AW365665 


Hs.120388 


ESTs 


0.50 


6.96 


430656 


AA482900 


Hs.162080 


ESTs 


1.00 


70.00 


430843 


AI734149 


Hs.119514 


ESTs 


1.00 


90.00 


430998 


AF128847 


Hs.204038 


mdde%lamine N-methyltransferase 


0.29 


1.84 


431217 


NM.013427 


Hs.250830 


Rho GTPase activating protein 6 


1.00 


79.00 


431921 


N46466 


Hs.58879 


ESTs 


0.91 


1.67 


432176 


AW090386 


Hs.112278 


arrestin, beta 1 


0.66 


263 


432203 


AA305746 


Hs.49 


macrophage scavenger receptor 1 


1.00 


76.00 


432231 


AA339977 


Hs.274127 


CLST 11240 protein 


0.46 


1.46 


432485 


N90866 


Hs.276770 


CDW52 antigen (CAMPATH-1 antigen) 


0.79 


225 


432522 


D11466 


Hs.51 


phosphaSdylinositol glycan, class A (pa 


1.93 


4.83 


432596 


AJ224741 


Hs.278461 


malrilin 3 


0.04 


5.79 


432850 


X87723 


Hs.3110 


angiotensin receptor 2 


1.00 


167.00 


433138 


AB029496 


Hs.59729 


semaphorin sem2 


0.04 


9.16 


433563 


A1732637 


Hs.277901 


ESTs 


1.00 


91.00 


433588 


A1056872 


Hs.133386 


ESTs 


120.16 


315.00 


434445 


AI349306 


Hs.11782 


ESTs 


0.60 


1.84 


435496 


AW840171 


Hs.265398 


ESTs, Weakly similar to transformation-r 


1.00 


128.00 


435974 


U29690 


Hs.37744 


Homo sapiens beta-1 adrenergic receptor 


1.00 


108.00 


436061 


AI248584 


Hs,190745 


Homo sapiens cDNA: FU21326 fis, clone C 


1.00 


91.00 


437157 


BE048860 


Hs.120655 


ESTs 


1.00 


87.00 


437207 


T27503 


Hs.15929 


hypothetical protein FU12910 


1.00 


105.00 


437311 


AA370041 


Hs.9456 


SW1/SMF related, matrix associated, acti 


1.00 


71.00 


437439 


H29796 


Hs.269622 


ESTs 


1.00 


115.00 


438199 


AW016531 


Hs.122147 


ESTs 


1.00 


80.00 


439551 


W72062 


Hs.11112 


ESTs 


0.30 


3.10 


440515 


AJ131245 


Hs.7239 


SEC24 (S. cerevisiae) related gene famil 


1.00 


77.00 


440887 


AI799488 


Hs.135905 


ESTs 


1.00 


85.00 


441025 


AA913880 


Hs.176379 


ESTs 


1.00 


8200 


441384 


AA447849 


Hs.268660 


Homo sapiens cDNA: FU22182 fis, done H 


0.79 • 


1.89 


441735 


AI738675 


Hs.127346 


ESTs 


1.00 


75.00 


442200 


AW590572 


Hs.235768 


ESTs 


0.78 


5.83 


442832 


AW206560 


Hs.253569 


ESTs 


0.03 


10.68 


442957 


A1949952 


Hs.49397 


ESTs 


1.00 


70.00 


443282 


T47764 


Hs.132917 


ESTs 


1.00 


197.00 


443547 


AW271273 


Ks.23767 


hypothetical protein FU12666 


1.00 


253.00 


443951 


F13272 


Hs.1 11334 


ferritin, light polypeptide 


0.55 


209 


444330 


AI597655 


Hs.49265 


ESTs 


1.00 


90.00 
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444515 


AW204908 


Hs.169979 


ESTs 


1.00 


84.00 


445769 


AI741471 


Hs.23666 


ESTs 


0.02 


4.38 


445908 


R13580 


Hs.13436 


Homo sapiens done 24425 mRNA sequence 


1.00 


97.00 


446291 


BE397753 


Hs.14623 


interferon, gamma-lndudbfe protein 30 


0.93 


1.69 


446917 


A1347863 


Hs.156672 


ESTs 


1.00 


106.00 


447261 


NM.006691 


Hs.17917 


extracellular link domain-containing 1 


0.40 


47.20 


447432 


AW958473 


Hs.301957 


nudix (nucleoside diphosphate linked moi 


1.00 


100.00 


447482 


AB033059 


Ks.18705 


WAA1233 protein 


0.05 


8.21 


447997 


H00656 


Hs.29792 


EST s, Weakly similar to 138022 hypolheti 


0.02 


5.42 


448299 


AA497044 


Hs.20887 


hypothetical protein FU10392 


1.00 


79.00 


448782 


AL050295 


Hs.22039 


K1AA0758 protein 


0.42 


1.56 


450575 


NM.005859 


Hs.29117 


purine-rich element binding protein A 


0.17 


11.33 


450584 


AA040403 


Hs.60371 


ESTs 


1.00 


94.00 


450693 


AW450461 


Hs.203965 


ESTs 


1.00 


91.00 


450715 


Aj 266484 


Hs.31570 


ESTs, Weakly similar to K1AA1324 protein 


1.00 


15200 


451103 


R52804 


Hs.25956 


DKFZP564D206 protein 


1.00 


86.00 


451220 


AF124251 


Hs.26054 


novel SH2-containing protein 3 


0.60 


1.30 


451668 


Z43948 


Hs.326444 


cartilage acidic protein 1 


0.54 


1.91 


452197 


AW023595 


Hs.232048 


ESTs 


1.00 


67.00 


452331 


AA598509 


Hs.29117 


purine-rich element binding protein A 


4.53 


11.07 


452353 


C18825 


Hs.29191 


epithelial membrane protein 2 


0.72 


2.24 


453049 


BE537217 


Hs.30343 


ESTs 


1.00 


68.00 


453107 


NMJJ16113 


Hs.279746 


vanBlotd receptor-like protein 1 


0.83 


1.70 


453355 


AW295374 


Hs.31412 


Homo sapiens cDNA FU11422 fis, done HE 


1.00 


132.00 


453390 


AA862496 


Hs^8482 


ESTs 


1.00 


72.00 


453531 


AA417940 




ESTs, Weakly similar to JC5795 CDEP prot 


1.00 


68.00 


454741 


BE154396 




gb:CM2-HT0342-091299-050-b05 HT0342 Homo 


0.57 


289 


456579 


AA287827 


Hs.284205 


up-regulated by BCG-CWS 


1.00 


82.00 


456672 


AK002016 


Hs.1 14727 


Homo sapiens, done MGC:16327, mRNA, com 


0.79 


1.96 


457400 


AF032906 


Hs.252549 


cathepsin Z 


1.03 


3.25 


457718 
459696 


F18572 


Hs.22978 


ESTs, Weakly similar to ALU4_HUMAN ALU S 


1.00 


113.00 


F03027 




gb:HSC1KA072 normalized infant brain cDN 


1.00 


544.00 



TABLE 10B 

Pkey. Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

408074 103684 1 R20723 AA263003 AA333976 AA334725 AA334151 AW965490 AA310513 AI810530 D31302 AW134897 AA830127 AA046953 AJ668930 
C06094AW104534 

411667 1253334 1 BE160198 AW935898T1 1520 AW935930AW856073AW861 034 

413533 1375344~1 BE146973 BE146972 BE147042 BE147018 BE146783 BE147020 BE146781 BE147019 BE146766 BE147021 BE146952 BE146767 BE147044 
BE146797 BE146776 BE146985 BE146793 BE146768 BE146771 BE146954 BE146760 BE147048 BE147025 BE147030 

423387 22779 1 AJ01 2074 U1 1087 L13288 X75299 L20295 AW630780 H14880 T28Q37 A1872991 R72136AW449839T81622T79697T29519 R94105 T83923 
R73300 A1797007 R73390 AA961010 H74168 AI689932 BE045543 A1808418 AI608912 AI806573 AW884084 AW872978 AW872985 AA565655 
AJ022915 R50647 R73210 H45098 R46451 AW166269T71132 A1264547 R52146 AI304920 R73391 AW684059 AW884085 H73241 T60038 
T79612 R73145 R50549AI094557 A1668793 R72302 A1564366 W01 956 AA41 8962 W32571 R72840 H45409 R72085 R46356 R46758 
AA508805 AA418798 T83751 R94072 T161 82 AA928785 AA903896 

423696 23112 1 Z92546 AA330586 A1570568 AW341487 A1827050 AW298658 AI792189 AI015693 AI733599 AI572251 A1672488 AW1 93262 A1244716 
A1864375 A1206100 AA912444 A1269365 AI640254 AW772466 AJ867336 AA627604 H16914 AA358477 AA338009 

430212 314437 1 AA469153A1718503 M469225 

436532 421802J AA721522 AW975443 T93070 

453531 97026J AA417940AA036735T07025 

454741 1232559 1 BE154396 AW817959 BE154393 



TABLE 10C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled The DNA 

sequence of human chromosome 22" Dunham I. et aL, Nature (1999) 402489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref Strand NLposition 

400754 7331445 Plus 144559-144684 

401045 8117619 Plus 90044-90184,91111-91345 

401083 3242744 Plus 33192-33360 

402474 7547175 Minus 53526-53628,55755-55920,57530-57757 

402808 6456148 Minus 114964-1 15136,1 15461-115585.1 15931-1 16047,1 17666-1 17771,1 18004-1 18102 

403021 7547270 Pius 120799-120966 

403421 9665041 Minus 126609-126773,139986-140205 

403438 9719679 Plus 90792-90938 

403687 7387384 Plus 9009-9534 

403764 7717105 Minus 118692-118853 

404277 1834458 Minus 91665-91946 

404288 2769644 Rus 3512-3691 

404394 3135305 Minus 37121-37205,37491-37762,4105M1140,4132241593,4177341919 

404518 8151988 Plus 84494-84603 

404916 7341826 Plus 91057-91188 

405106 8079395 Minus 80877-81418 

405257 7329310 Rus 73121-73273 

405381 6006920 Minus 7636-8054 
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406387 9255180 Plus 116229-116371,117512-117651 
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TABLE 11 A: Genes Distinguishing Adenocarcinoma from Olher Lung Diseases and Normal Lung 



PCT/US02/12476 



Table 11A shows about 84 genes unregulated in lung adenocarcinomas relative to other lung tumors, non-mafignant lung disease, and norma) lung. These genes were selected 
from about 59680 probesels on the Eos/Affymetrix Hu03 Genechip array. 

Table 1 1 B show the accession numbers for those Pke/s lacking UnlgenelD's for table 1 1 A. For each probeset we have listed the gene duster number from which the 
oligonucleotides were designed. Gene dusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubteTwist, Oakland California). The Genbank accession numbers for sequences comprising each duster are listed in the 
'Accession' column. 

Table 1 1C show the genomic positioning for those Pkey's tacking Unigene ID's and accession numbers in labia 1 1 A. For each predicted exon, we have listed the gertomic 
sequence source used for prediction. Nudeotide locations of each predicted exon are also listed. 



Pkey: 
ExAccn: 
UnigenelCh 
Unigene Title: 
R1: 



Unique Eos probeset Identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 



R2: 

Pkey 

403329 

406399 

406690 

407869 

407881 

408908 

409103 

409187 

409269 

410076 

410102 

410399 

411908 

412612 

414075 

416208 

417542 

419183 

419502 

419631 

420931 

421155 

421190 

421474 

421515 

421582 

422026 

422095 

422311 

422867 

423472 

423554 

424502 

424544 

424905 

424960 

425523 

426230 

427701 

428585 

428758 

429170 

429263 

429610 

430508 

430985 

431548 

431566 

431986 

432375 

432677 

433556 

433819 

434001 

434424 

434792 

436217 

436749 

436972 

437866 

437935 

438915 

439451 



Average of lung tumors {including squamous ceil carcinomas, adenocarcinomas, small ceil carcinomas, granulomatous and carcinoid tumors) divided by the 
average of norma! lung samples 

Average of non-malignant lung disease samples (including bronchitis, emphysema, fibrose, atelectasis, asthma) divided by the average of normal lung samples 



ExAccn UnigenelD 



M29540 

A1827976 

AW072003 

BE296227 

AF251237 

AF154830 

AA576953 

T05387 

AW24850B 



L27943 

NMJ00047 

U11862 

AW291168 

J04129 

U60669 

AU076704 

AW188117 

AF044197 

H87879 

U95031 

U76362 

Y11339 

AI910275 

U80736 

AI868872 

AF073515 

L32137 

AF041260 

M90516 

AF242388 

MB8700 

NMJJ02497 

BE245380 

AB007948 

AA367019 

AA411101 

AB007863 

AA433988 

NM.001394 

AA019004 

AB024937 

AI015435 

AA490232 

AI834273 

AF176012 

AA536130 

BE536069 

NMJJ04482 

W56321 

AW511097 

AW950905 

AI811202 

AA549253 

T53925 

AA584890 

AA284679 

AA156781 

AW939591 

AA280174 

AF086270 



Hs.220529 

Hs.24391" 

Hs.40968 

Hs.250822 

Hs.1 12208 

Hs.50966 

Hs.22972 

Hs.7991 

Hs.279727 

Hs.72924 
Hs.74131 
Hs.75741 
Hs.41295 
Hs.82269 
Hs.89663 

Hs.303154 
Hs.100431 
Hs.102267 
Hs.102462 
Hs.104637 
Hs.105352 

Hs.110826 

Hs.282804 

Hs.1 14948 

Hs.1584 

Hs.1 29057 

Hs.1674 

Hs.149585 

Hs.150403 

Hs.153704 

Hs.153952 

Hs.158244 

Hs.241395 

Hs.243886 

Hs.185140 

Hs.98502 

Hs.2359 

Hs.198396 

Hs.211092 

Hs.104637 

Hs.27323 

Hs.9711 

Hs.260720 

Hs.149018 

Hs.2962 

Hs.278611 

Hs.1 11460 

Hs.1 12765 

Hs.3697 

Hs.325335 

Hs.132458 

Hs.107 

Hs.5302 

Hs.25640 

Hs.5940 

Hs.285681 

Hs.278554 



Unigene Title 
Target Exon 

NM_003122*:Homo sapiens serine protease 
carcinoembryonic antigen-related ceil ad 
hypothetical protein FU13612 
heparan sulfate (glucosamine) 3-O-sulfot 
serine/lhreonine kinase 15 
XAGE-1 protein 

carbamoyt-phosphate synthetase 1, mitxh 

hypothetical protein FU13352 

ESTs 

Homo sapiens cDNA FU 14035 Ms, done HE 
synudeln, gamma (breast cancer-spedfic 
cytidine deaminase 

arytsuffatase E (chondrodysplasia puncta 
amtloride binding protein 1 (amine oxida 
ESTs, Weakly similar to MUC2J4UMAN MUCIN 
progestagen-assodated endometrial prole 
cytochrome P450, subfamily XXIV (vitamin 
fibrinogen, A alpha polypeptide 
popeye protein 3 

small inducible cytokine B subfamily (Cy 
tysyl oxidase 

mucin 5, subtype B, tracheobronchial 
solute carrier family 1 (glutamate trans 
GalNAc alpha-2, 6-sialy)transferase \, I 
trefoil factor 1 (breast cancer, estroge 
trinucleotide repeat containing 9 
hypothetical protein FU22704 
cytokine receptor-like fador 1 
cartilage oligomeric matrix protein (pse 
breast carcinoma amplified sequence 1 
grutamine-mjctose-6-phosphate transamto 
lengsln 

dopa decarboxylase (aromatic L-amino ad 

NIMA (never in mitosis gene a)-related k 

5* nucleotidase (CD73) 

KIAA0479 protein 

protease, serine, 1 (trypsin 1) 

nudear autoanligenic sperm protein (his 

KIAA0403 protein 

hypothetical protein FU14303 

dual specificity phosphatase 4 

ATP-binding cassette, sub-family A (ABC1 

LUNX protein; PLUNC (palate lung and nas 

ESTs 

ESTs, Weakly similar to I78885 serme/ih 
novel protein 

J domain containing protein 1 

Novel human gene mapping to chomosome 20 

SlOOcaldum-binding protein P 

UDP-N-aceryl-alpha-[^dadosamine:polyp 

catdurn/calmodulin-dependent protein kin 

ESTs 

serine (or cysteine) proteinase inhibito 
Homo sapiens cDNA: FU23523 fis, done L 
ESTs 

fibrinogen-Jixe 1 

lectin, galactosrde-binding, soluble, 4 
claudin 3 

metallothionein 1E (functional) 
mucin 13, epithelial transmembrane 
Wiiliams-Beuren syndrome chromosome regi 
heterochromatirvtike protein 1 



R1 


R2 


1.00 


61.00 


1.00 


39.00 


226.37 


350.00 


0.77 


1.18 


1.00 


10.00 


7.76 


100 


80.44 


40.00 


1.00 


1.00 


1.00 


1.00 


1.12 


1.50 


9.89 


1.00 


0.92 


1.06 


1.00 


1.00 


1.02 


1.03 


0.84 


1.07 


3.67 


1.00 


1.28 


1.35 


1.00 


1.00 


13.05 


115.00 


1.00 


13.00 


1.00 


8.00 


1.00 


15.00 


1.17 


1.55 


1.46 


1.76 


1.00 


3.00 


1.23 


1.00 


1.00 


52.00 


4.37 


Z34 


1.15 


1.78 


1.69 


3.17 


48.13 


72.00 


1.00 


50.00 


1.00 


1.00 


1.00 


59.00 


21.35 


1.00 


1.00 


1.00 


1.00 


35.00 


1.00 


83.00 


7.41 . 


34.00 


1.00 


6.00 


1.05 


1.13 


16.18 


105.00 


1.07 


1.00 


1.59 


1.69 


4.75 


7.27 


0.94 


1.28 


5.66 


15.00 


49.76 


37.00 


1.19 


1.47 


1.65 


1.06 


1.00 


48.00 


1.00 


19.00 


3.71 


8.00 


29.31 


72.00 


1.00 


64.00 


8.52 


44.00 


57.97 


31.00 


1.10 


1.41 


1.59 


1.46 


3.62 


101.00 


1.60 


1.39 


1.00 


1.00 


23.28 


52.00 
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439759 


AL359055 


Hs.67709 


Homo sapiens mRNA full lenglh insert cDN 


1.00 


21.00 


441031 


AJ110664 


Hs.7645 


fibrinogen, 6 beta polypeptide 


1.41 


99.00 


441377 


BE218239 


Hs.202658 


ESTs 


22.03 


1.00 


443614 


AV655386 


Hs.7645 


fibrinogen, B beta polypeptide 


1.00 


16.00 


443813 


AA876372 


Hs.93961 


Homo sapiens mRNA; cDMA OKFZp667O095 (fr 


1.20 


1.99 


443991 


NM.Q02250 


Hs.10082 


potassium intermedlate/smail conductance 


5.71 


6.87 


444670 


H58373 


Hs.332938 


hvoothfiftca) nrotein MGC5370 


1 Oft 

1.30 


38.00 


444931 


AV652066 


Hs.75113 


general transcription factor IIIA 


1.00 


54.00 


446102 


AW168067 


Hs.317694 


ESTs 


1.00 


1.00 


446163 


AA026880 


Hs.25252 


Homo sapiens cDNA FU1 3603 fis, done PL 


1.00 


36.00 


446469 


BE094848 


Hs.15113 


homogenlisate 1 ,2-dioxygenase (homogenti 


1.00 


11.00 


447388 


AW630534 


Hs.76277 


Homo sapiens, done MGQ'9381, mRNA, comp 


1.24 


1.16 


447532 


AK000614 


Hs. 18791 


hypothetical protein FU20607 


1.23 


1.63 


446243 


AW369771 


Hs.52620 


integrin, beta 8 


15.84 


1.00 


448844 


A1581519 


Hs.177164 


ESTs 


1.00 


31.00 


449444 


AW818436 


Hs^3590 


solute carrier family 16 (monocarboxyltc 


1.00 


83.00 


451807 


W52854 




hypothetical protein FU 23293 similar to 


1.55 


35.00 


452689 


F33868 


Hs.284176 


transferrin 


1.54 


1.44 


453392 


U23762 


Ks.32964 


SRY (sex determining region Y)-box 1 1 


• 1.00 


16.00 


453464 


AI884911 


Hs.32989 


receptor (calcitonin) activity modifying 


1.55 


145 


453735 


AI065629 


Hs.125073 


ESTs 


1.01 


1.30 



TABLE 11B 

Pkey: Unique Eos prcbeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

410399 11995J BE068889 BE068882 AF044311 AF017256 NM.003087 AF037207 AF010126 AA633976 AA872836 BE298825 BE299889 AHJ16464 A1684600 

A1936527 AA804675 AA394097 A1139933 AA946606 BE171313 AA722407 AA293803 AI468480 AA056035 AA05596B AW796957 AI637713 
AA410737 H49348AA48M72M411094AA235594AM02624 M443638AW452137M421708AW265 

419502 18535.1- AU076704 T74854 T74860 T72098 T73265 T73873 T69180 T74658 T58786 T60385 T73410 T68781 T67845 T67593 T73952 T67864 T60630 

T68367 T68401 T53959 T72360 T72099 T60377 T58961 ni712 T72821 T64738 T74645 T72037 T68688 T72063 T73258 T72826 T64242 
T68220 T74673 T71800 T68355 T61227 T62738 T69317 T53850 T64692 T73768 T73952 T73382 T68914 T70975 T73400 T60531 T73277 
T73203 T70498 T61409 T58925 NM.Q00508 M54982 T68301 T73729 T69445 T60424 T67922 T67736 T66716T67755 T74765 T73819 T58719 
T74756 T60477 T74863 T61 109 T68329 T58850 T71857 T73425 T53736 T68607 T58898 T64309 T72031 T72079 T64305 T71908 T68107 
T71916 T73787 T56035 T64425 T71870 T60476 T61376 T67820 T71 895 T41006 T69441 T68170 T74617 T71958 T69440 T61875 R06796 
H48353 T71914 T53939 T641 21 AA693996 T72525 T67779 T68078 AA01 1465 AA345378 AV654847 AV654272 AV656001 A1064740 T82897 
N33594 M344542 AW805054 A1207457 T61743 AA026737 H94389 AA382695 AA918409 T68044 S82092 T39959 AI017721 AA312395 
AA312919 T40156 H66239 AV652989 H38728 R98521 AV655200 R95790 W03250 W00913 AA344136 AV660126 R97923 AA343596 
AW470774 AV651256 N54417 AA812862 AW182929 All 1 1 192 H61463 H72060 AA344503 H38539 AI27751 1 AV66 1 1 08 AI207625 T47810 
AA235252 T27853 T47778 R95746 H70620 AA701463 AW827166 R98475 C20925 AV657287 T71959 T71313 T73920 T73333 T61 618 T69293 
T69283 T73931 n2178 T72456 AV645639 AV653476 T72957 n2300 T58906 T71457 T70494 T72956 T70495 T68267 T74407 T85778 
AA344726 T27854 T74485 T74101 T73868 T71518 T72304 AA343853 T73909 T68070 T72065 H72149 T73493 T73495 AV645993 R02293 
T70475 T64751 AA344441 AA343657 AA345732 AA344328 AM 10639 AA344603 AF06351 3 T64696 T6851 6 T72223 T60507 T67633 R29500 
T72517 R02292 T60599 T69206 T70452 T74677 R29366 T61277 T74914 T60352 R29675 T74843 AV645792 AA344408 T69197 T72057 
T69368 T69358 T6B258 AV650429 T73341 T61702 T74598 T40095 K02272 T40106 AA343045 AA341908 AA341907 AA342807 AA341964 
T53747 T72042 T62764 AI064899 AA343060 T67832 T72440 T71770 T68091 T69108 T72449 T69167 771289 T68251 AV654844 T64375 
AA345234 T67598 AA01 1 414 T68036 H48262 AI207557 T68219 W86031 T69081 T64232 R93196 T62136 AV650539 H67459 T72978 
AA344583 T60362 H58121 T95711 T72803 T68055 T7171 5 R29036 T72793 T69122 T64595 T62888 T691 39 T68291 T64652 T67971 T46862 
AA693592 AI248502 R29454 T64764 T57001 T73052 T71429 T51176 T58866 AV655414 H90426 AA342489 T73666 T67848 T72512 T53835 
T67837 T7331 7 T74273 T69420 T68245 T74380 T67862 T74474 T56068 

421582 2041,1 A1910275 X00474 X52003 X05030 NM J03225 AA314326 AA308400 AA506787 AA314825 A1571948 AA507595 M614579 AA587613 R83818 

AA568312 AA614409 AA307578 A1925552 AW950155 AI910083 M12075 BE074052 AW004668 AA578674 M582084 BE074053 BE074126 
BE074140 AA514776 AA588034 BE074051 8E074068 AW009769 AW050690 AA858276 R55389 AI001051 AW050700 AW750216 AA614539 
BE074045 A1307407 AW602303 BE073575 A1202532 AA524242 AI970839 AI909751 BE076078 A1909749 R55292 

437866 44433,2 AA156781 AW293839 U52054 AA024963 AA778446 BE073977 AW444904 AW602574 BE164040 BE1 64012 BE163972 BE163974 BE163992 

AA837461 AW468444 BE185091 AW468002 AA687333 AA81 1830 AA581806 AI866666 AI572124 M043777 AA040926 020160 AI536733 
AA812489 AW874142 A1471883W84421 AA156850 

451807 8865_1 W52854AL1 17600 BE208116 BE208432 BE206239 BE082291 AW953423 AA351619 BE180648 BE140560 W60080 AA865478 N90291 

AW450652 AW4495 1 9 AA993634 AI806539 AA351616 AW449522 AI827626 AA904788 AA380381 AA886045 AA774409 BE003229 Z4 1756 



TABLE 11C 

Pkey: Unique number corresponding to an Eos probeset - 

Ref. Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al.* refers to the publication entitled The DNA 

sequence of human chromosome 22.* Dunham I. et al., Nature (1999) 402:489-495. 

Strand: Indicates DMA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

403329 8516120 Plus 96450-96598 

406399 9256288 Minus 63448-63554 
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TABLE 12A: Genes Distinguishing Squamous Ceil Carcinoma from Other Lung Diseases and Normal Lung 

Table 12A shows about 72 genes upregulated in squamous cell carcinomas of the lung relative to other lung tumors, non-maSgnant lung disease, and normal lung. These genes 
were selected from about 59680 probesets on the Eos/Anymetrix Hu03 Genechlp array. 

Table 12B show the accession numbers for those Pke/s lacking Unigenelirs for table 12A. For each probeset we have feted the gene cluster number from which the 
ofigonucleotidBs were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist. Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession' column. 

Table 12C show the genomic positioning for those Pkey*s lacking Unigene ID'S and accession numbers in table 12A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also Bsted. 
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Hs.1 1261 
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4.43 


2.90 
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lymphocyte antigen 6 complex, locus D 
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1.09 
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ESTs 
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1.00 


1.00 
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interleukin-1 homolog 1 


1.00 
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WAA1313 protein 
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ESTs 


16.76 
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ESTs 
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Rhr^eC glycoprotein 
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TABLE 12B 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

439285 47065 1 AL133916 N79113 AF085101 N76721 AW950828AA364013 AW9556B4 AI346341 AI867454 N54784 AI655270AJ421279 AW014882 

AA775552 N62351 N 59253 AA626243 AI341407 BE175639 AA456968AI358918AA457077 



TABLE 12C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in mis column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et a!., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLpositton: Indicates nucleotide positions of predicted exons. 

NLposition 

17982-18115,20297-20456 

28397-28617 l 2892&29045,29135-29296,29411-2^ 
8321543435,83531-83656,83740^901,842^^ 

165776-165996,166189-166314,166408-166569,167112-167268,167387-167469,168634-168942 
42904^3124.43211-43336,44607-44763,45199^5281,46337-46732 
121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 
3799948145,38652-38998,39727-39872,40557-40674,42351-42450 



Pkey 


Ref 


Strand 


400666 


6118496 


Plus 


401780 


7249190 


Minus 


401781 


7249190 


Minus 


401785 


7249190 


Minus 


401994 


4153858 


Minus 


402075 


8117407 


Plus 


404996 


6007890 


Plus 
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TABLE 13A Genes Dislingufehing Non-Malignant Lun9 Disease from Lung Tumors and Normal lung 

Table 13A shows about 23 genes unregulated in non-malignant lung disease relative to lung tumors and normal lung. These genes were selected from about 59880 probesets on 
the Eos/Affymetrix Hu03 Genechip array. 

Table 13B show the accession numbers tor those Pke/s tacking Unigene! D's for table 13A. For each probeset we have feted the gene duster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Toots (DoubleTwist Oakland California). The Genbank accession numbers for sequences comprising each duster are listed In the 
'Accession' column. 

Table 13C show the genomic positioning for those Pke/s lacking Unigene ID'S and accession numbers in table 13A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Average of lung tumors {including squamous cell carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) divided by the 
average of norma) lung samples 

R2: Average of non-malignant lung disease samples (Including bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal king samples 
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Hs.131987 


ESTs 


1.00 


167.00 


426753 


T89832 


Hs.170278 


ESTs 


1.00 


141.00 


429496 


AA453800 


Hs.192793 


ESTs 


1.00 


moo 


430719 


AA488988 


Hs.293796 


ESTs 


1.00 


133.00 


431089 


BE041395 




ESTs, Weakly similar to unknown protein 


23.32 


941.00 


431385 


BE178536 


Hs.11090 


membrane-spanning 4-domains, subfamily A 


1.00 


157.00 


431728 


NMJJ07351 


Hs.268107 


rnirltimerin 


1.00 


157.00 


436532 


AA721522 




gbmv54h12.r1 NCLCGAP.Ewl Homo sapiens 


1.00 


2iaoo 


437960 . 


A1669586 


Hs.222194 


ESTs 


1.00 


147.00 


438202 


AW169287 


Hs.22588 


ESTs 


1.00 


141.00 


441499 


AW298235 


Hs.101689 


ESTs 


1.00 


167.00 


444513 


AL1 20214 


HsJ117 


glutamate receptor, ionotropic, AMPA 1 


1.00 


151.00 


448253 


H25899 


Hs.201591 


ESTs 


1.00 


141.00 


453636 


R67837 


Hs.169872 


ESTs 


t.oo 


. 116.00 


458332 


AI000341 


Hs.220491 


ESTs 


1.00 


19Z00 


459587 


AA031956 




gbzk15e04.s1 Soares_pregnanLuterus_NbH 


1.00 


154.00 



TABLE 13B 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 

Accession: Genbank accession numbers 

Pkey CAT Number Accession 

431089 327825J BE041395AA491826AA621946 AA715980AA666102 

436532 421 802,1 AA721522 AW975443 T93070 



TABLE 13C 

Pkey: 
Ret 

Strand: 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 <SqH numbers in this column are Genbank Identifier (Gl) numbers. Vunham I. et ah" refers to the publication entitled The DNA 
sequence of human chromosome 22.' Dunham I. et a!., Nature (1 999) 402:489-495. 
Indicates DNA strand from which exons were predicted. 



reposition: Indicates nucleotide positions of predicted exons. 
Pkey Ref Strand NLposilion 

402075 8117407 Plus 121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 



180 



WO 02/086443 PCT/US02/12476 

TABLE 14A: Preferred UtHHy and Subcellular Locafizalion for Potential Lung Disease Targets 

Table 14A shows the subcellular localization and preferred utility for the genes appearing in Tables 9A and 10A mAb symbolizes monoclonal antibody, diag symbolizes 
diagnostic s.m. symbolizes small molecule, and CTL symbolizes cytotoxic lymphocyte ligand. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 
Genechip array. 

Table 14B show the accession numbers for those Pke/s lacking UnigenelD's for table 14A. For each probeset we have listed the gene cluster number from which the 
oOgonudeofides were designed. Gene dusters were comp&ed using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are feted in the 
"Accession' column. 

Table 14C show the genomic positioning for those Pkey's lacking Unigene ID'S and accession numbers in table 14A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Ptey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Vntgene number 

Unigene Title: Unigene gene title 

PreWtiiity: Preferred Utifity 

Pred.Loc: Predicted subcellular localization 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


Pref Utility 


Pred. Loc 


400289 


X07820 


Hs.2258 


matrix metanoprotemase 10 (strornerysin 


mAb & diag & s.m. 


extracellular 


400303 


AA242758 


Hs.79136 


UV-1 protein, estrogen regulated 


mAb 


plasma membrane 


402075 






ENSP00000251056*:Plasma membrane calcium mAb & diag 


secreted 


407811 


AW190902 


Ks.40098 


cysteine knot superfamity 1 , BMP antagon 


diag 


secreted 


408243 


Y00787 


Hs.624 


interleukin 8 


diag 


secreted 


408790 


AW580227 


Hs.47860 


neurotrophic tyrosine kinase, receptor, 


mAb & s.m. 


plasma membrane 


408908 


BE296227 


Hs.250822 


serine/threonine kinase 15 


s.m. 


cytoplasm 


409041 


AB033025 


Hs.50081 


Hypothetical protein, XPJ51860 (KIAA1 19 


CTL & diag 


secreted 


409103 


AF251237 


Hs.112208 


XAGE-1 protein 


CTL 


nuclear 


409420 


Z15008 


Hs.54451 


laminin, gamma 2 (nicein (100kD), kalini 


diag 


secreted 


409632 


W74001 


Hs.55279 


serine (or cysteine) proteinase inhibito 


diag 


secreted 


409757 


NMJ01898 


Hs.123114 


cystatin SN 

minichromosome maintenance deficient {$. 


diag 


extracellular 


409893 


AW247090 


Hs.57101 


CTL 


nuclear 


409356 


AW103364 


Hs.727 


inhlhin heia A factivin A. activin AB a 
Raflikrein 1 1 


diag 


extracellular 


410001 


AB041036 


Hs.57771 


diag 


extracellular 


410407 


X66839 


Hs.63287 


carbonic an hydras b IX 


mAb & s.m. 


plasma membrane 


410418 


D31382 


Hs.63325 


transmembrane protease, serine 4 


mAb & diag & s.m. 


plasma membrane 


412140 


AA219691 


Hs.73625 


RARfi intpfftrtinn kinp^in-likfi frahkinM 


s.m. 




412719 


AW016610 


Hs.816 


ESTs 


SJTt 


nuclear 


414774 


X02419 


Hs.77274 


olasminooen activator urokinase 


diag 


extracellular 


414883 


AA926960 




CDC28 nrolein kinase 1 


s.m. 




415138 


C18356 


Hs.295944 


tissue factor oalhwav inhibitor 2 


CTL & diaa 


extracellular 


415669 


NM.005025 


Hs.78589 


^ftrinn for n/stplnp\ nmtainasa inhibito 


mAb & diaa & s m 


secreted 


415817 


U88967 


Hs.78867 


nrntpin tvmsine nhosnhatasp. reeentnr-l 

UIVlwJ.I IJIUOMlw |ftH/Q|fl IOiOJC| IwwpWI i 


mAb & s fTL 


plasma membrane 


416658 


U03272 


Hs.79432 


fibrillin 2 fconapmtal contracture) ara 


diaa 
u,a u 


extracellular 


417034 


NMJ06183 


Hs.80962 


npurotpn^in 


diag 


extracellular 


417079 


U65590 


Hs.81134 


interteukin 1 receotor antaoonist 


diag 


extracellular 


417308 


H60720 


Hs.81892 


KIAA0101 aene nrodtict 


s.m. 


mitochondrial 


417389 


BE260964 


Hs.82045 


mtriktnft fnpurifp nrowth-nrnmntinn fartnr 

uiiuruuo jiicuiiic yiuriui piuinuuiiy iaoivi 


mAb & diaa 


secreted 


417433 


BE270266 


Hs.82128 


5T4 oncofetal trophoblast glycoprotein 


mAb 


plasma membrane 


417933 


X02308 


Hs.82962 


thvmirivlate wnthptase 

uijfiiuujiaiu ojiiuiciooo 


s.m. 


endoplasmic reticulum 


418478 


U38945 


Hs.1174 


cvrlin-dsnpndpnl kinase inhibitor 2A fme 


s.m. 


cytoplasm 


418506 


AA084248 


Hs.85339 


G orotsln-coiinlfid receotor 39 


mAb & s.m. 


plasma membrane 


418678 


NMJ01327 


Hs.1 67379 


cancer/testis antigen (NY-ESO-1) 


CTL 


cytoplasmic 


419121 


AA374372 


Hs.89626 


parathyroid hormone-like hormone 


diag 


secreted 


419171 


NMJ02846 


Ks.89655 


protein tyrosine phosphatase, receptor t 


mAb & s.m. 


plasma membrane 


419183 


U60669 


Hs.89663 


cytochrome P450, subfamily XXIV {vitamin 


CTL & s m 


mitochondria) 


419216 


AU076718 


Ks.164021 


small inducible cytokine subfamily B (Cy 


diag 


secreted 


419235 


AW470411 


Hs.288433 


neurotrimin 


mAb & diag 


plasma membrane 


419452 


U33635 


Hs.90572 


PTK7 protein tyrosine kinase 7 


mAb & s.m. 


plasma membrane 


419556 


U29615 


Hs.91093 


chitinase 1 (chitotriosidase) 


mAb&diag 


extracellular* 


420610 


A1683183 


Hs.99348 


distal-less homeo box 5 


CTL 


nuclear 


421110 


AJ250717 


Hs.1355 


cathepsin E 


sm&diag 


extracellular 


421379 


Y15221 


Hs.103982 


small inducible cytokine subfamily B (Cy 


diag 


secreted 


421474 


U76362 


Hs.1 04637 


solute carrier family 1 (glutamate trans 


mAb & s.m. 


plasma membrane 


421552 


AF026692 


Hs.105700 


secreted frizzled-related protein 4 


diag 


secreted 


421753 


BE314828 


Hs.107911 


ATP-binding cassette, sub-family 8 (MDR/ 


mAb & s.m. 


plasma membrane 


421817 


AF146074 


Hs.108660 


ATP-binding cassette, sub-family C (CFTR 


mAb & s.m. 


plasma membrane 


422109 


S73265 


Hs.1473 


gastrin-releasing peptide 


diag 


secreted 


422158 


L10343 


Hs.1 12341 


protease inhibitor 3, skin-derived (SKAL 


diag 


secreted 


422282 


AF019225 


Hs.1 14309 


apoiipoprotein L 


diag 


secreted 


422283 


AW411307 


Hs.114311 


CDC45 (cell division cycle 45, Sxerevis 


s.m. 


nuclear 


422424 


AI186431 


Hs,296638 


prostate differentiation factor 


diag 


extracellular 


422765 


AW409701 


Hs.1578 


bacutoviral IAP repeat-containing 5 (sur 


s.m. 


cytoplasm 


422809 


AK001379 


Hs.121028 


hypotheOca) protein FU 10549 


s.m. 


nuclear 


422867 


L32137 


Hs.1584 


cartilage oligomeric matrix protein (pse 


diag 


extracellular 


422956 


BE545072 


Hs.1 22579 


ECT2 protein (Epithelial cell transformi 


CTL & s.m. 




423634 


AW959908 


Hs.1690 


heparin-btnding growth factor binding pr 


diag 




423673 


BE003054 


Hs.1695 


matrix metalloproteinase 12 (macrophage 


mAb & diag &s.m. 


secreted 


423961 


013666 


Hs.136348 


periostin (OSF-2os) 


mAb&diag 


extracellular 


424046 


AF027866 


Hs.1 38202 


serine (or cysteine) proteinase inhibito 


diag 


secreted 


424381 


AA285249 


Hs.146329 


protein kinase Chk2 


s.m 


nuclear 
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424502 


AF24238B 


Ms.149585 


424503 


NM.002205 


Hs.149609 


424587 


J05070 


Hs.151738 


425247 


NWL005940 


Hs.1 55324 


425322 


U63630 


Hs.1 55637 


425650 


NM-001944 


Hs.1925 


425734 


AF056209 


Hs.159396 


425776 


U25128 


Hs.155499 


425852 


AK001504 


Hs.1 59651 


426215 


AW963419 


Hs.1 55223 


426427 


M86699 


Hs.169840 


426514 


BE616633 


Hs.170195 


427335 


AA448542 


Hs.251677 


427747 


AW411425 


Hs.180655 


428242 


K55709 


Hs.2250 


428330 


L22524 


Hs.2256 


428450 


NM.014791 


Hs.1 84339 


428479 


Y00272 


Hs.334562 


428484 


AF104032 


Hs.184601 


426664 


AK001666 


Hs.189095 


428698 


AA852773 


Hs.334838 


428748 


AW593208 


Hs.98785 


428758 


AA433988 


Hs.98502 


428969 


AF120274 


Hs.194689 


429211 


AF052693 


Hs.198249 


429263 


M019004 


Hs.1 98396 


429547 


AW009166 


Hs.99376 


429510 


AB024937 


Hs.211092 


429903 


AL134197 


Hs.93597 


430486 


BE062109 


Hs.241551 


431462 


AW583672 


Hs.256311 


431515 


NMJJ12152 


Hs.258583 


431846 


BE019924 


Hs.271580 


431958 


X63629 


Hs.2877 


432201 


AI538613 


Hs.298241 


433001 


AF217513 


Hs.279905 


435505 


AF200492 


Hs.211238 


43648V 


M379597 


Hs.5199 


437016 


AU076916 


Hs.5398 


437044 


AL035864 


Hs.69517 


437789 


A) 581344 


Hs.1 27812 


437852 


BE001836 


Hs.256B97 


439223 


AW238299 


Hs.250618 


439477 


W69813 


Hs.58042 


439606 


W79123 


Hs.58561 


439738 


BE246502 


Hs.9598 


440006 


AK000517 


Hs.6844 


441362 


BE614410 


Hs.23044 


442117 


AW664964 


Hs.1 28899 


443247 


BE614387 


Hs.333893 


443426 


AF098158 


Hs.9329 


443859 


NMJJ13409 


Hs.9914 


444006 


BE395085 


Hs.1 0086 


444371 


BE540274 


Hs.239 


444381 


BE387335 


Hs.283713 


444781 


NM.014400 


Hs.11950 


445537 


AJ245671 


Hs.1 2844 


446619 


AU076643 


Hs.313 


446921 


AB012113 


Hs.1 6530 


447033 


A1357412 


Hs.1 57601 


447342 


Al 199268 


Hs.1 9322 


448243 


AW369771 


Hs.52620 


448844 


AI581519 


Hs.177164 


449048 


Z45051 


Hs.22920 


449722 


BE280074 


Hs.23960 


450001 


NMJ301044 


Hs.406 


450375 


AA009647 




450701 


H39960 


Hs.288467 


450983 


AA305384 


Hs.25740 


451668 


Z43948 


Hs.326444 


452281 


T93500 


Hs.28792 


452401 


NMJK57115 


Hs.29352 


452747 


BE1 53855 


Hs.61460 


452838 


U65011 


Hs.30743 


453968 


AA847843 


Hs.62711 


457489 


AI693815 


Hs.127179 



integrin, alpha 5 (fibronectin receptor, 
matrix metalloproteinase 9 (gelatinase B 
matrix metalloproteinase 1 1 (stromelysin 
protein kinase, DNA-activaied, catalytic 
desmogteln 3 (pemphigus vulgaris antigen 
peplidytglycina alpha-amldating monooxyg 
parathyroid hormone receptor 2 
death receptor 6, TNF superfamfly member 
stanniocaldn 2 
TTK protein kinase 

bone morphogenetic protein 7 (osteogenic 
G antigen 7B 

serine/threonine kinase 12 

leukemia inhibitory factor (cholinergic 

matrix metalloproteinase 7 (matnlysin, 

KIAA0175 gene product 

cell division cycle 2, G1 to S and G2 to 

solute carrier family 7 {cationtc amino 

similar to SALL1 (sal (DrosophilaHike 

K1M1866 protein 

Ksp37 protein 

CA125 antigen; mucin 16 



gap junction protein, beta 5 (connexin 3 
ATP-binding cassette, sub-family A (ABC1 
ESTs 

LUNX protein; PLUNC (palate lung and nas 
cyclin-dependent kinase 5, regulatory su 
chloride channel, calcium activated, fam 
granirvlike neuroendocrine peptide precu 
endothelial differentiation, lysophospha 
uroplakin IB 

cadherin 3, type 1, P-cadherin (placenta 
Transmembrane protease, serine 3 
clone HQ0310PRO0310p1 
interleukin-1 homolog 1 
HSPC150 protein similar to ublquitin-con 
guanine monphosphate synthetase 
differentially expressed in FanconPs an 
ESTs, Weakly similar to T17330 hypotheti 
ESTs, Weakly similar to dJ365012.1 [H.sa 
UL1 6 binding protein 2 

ESTs, Moderately similar to GFR3.HUMAH G 
G protein-coupled receptor 87 
sema domain, immunoglobulin domain (Ig), 
NALP2 protein; PYRIN-Containing APAF1-D 
RAD51 (S. cerevlsiae) homolog (E coD Re 
ESTs; hypothetical protein for IMAGE447 
c-Myc target JP01 

chromosome 20 open reading frame 1 
fotBstatin 

type I transmembrane protein Fn14 
forkhead box Ml 

ESTs, Weakly similar to S64054 hypotheti 
GPI-anchored metastasis-associated prote 
EGF-like-domain, multiple 6 
secreted phosphoprotein 1 (osteopontin, 
small inducible cytokine subfamily A (Cy 
ESTs 

Homo sapiens, Similar to RIKEN cDNA 2010 

integrin, beta 8 

ESTs 

similar to S68401 (cattle) glucose indue 
cyclin B1 

solute carrier family 6 (neurotransmilte 
a disintegrin and metalloproteinase doma 
hypothetical protein XP_098151 (teucine- 
EROI (S. cerevisiaeHike 
cartilage acidic protein 1 
Homo sapiens cDNA FU11041 lis, done PL 
tumor necrosis factor, alpha-induced pro 
Ig superfamily receptor LNIR 



High mobility group (nonhistone chromoso 
cryptic gene 



s.m 


cvtodasmic 


mAb & s.m. 


plasma membrane 


mAb & drag & s.m. 


extracellular 


secreted 


s.m. 


nvtonlasmic 


mAb 


plasma membrane 


sm. 




mAb&diag 


plasma membrane 


mAb & s.m. 


plasma membrane 


mAb&diag 


secreted 


CTL & s.ra 


nuclear 


mAb&diag 


secreted 


CTL 


cytoplasmic 


s.m. 


cytoplasmic 


diag 




mAb & diag & s.m. 


extracellular 


s.m. 


nuclear 


SJn. 


nuclear 


mAb & &sn 


nbema membrane 


CTL & s.m. 


nuclear 


mAb 




diag 


extracellular 


dl3Q 


mitochodria* 


diag 


exlracetlutar 


mAb & s.m 


plasma membrane 


mAb & s.ra 


plasma membrane 


diag 


secreted 


mAb & diaa 


secreted 


s.m 




mAb & s.m. 


pJasma membrane 


diaq 


extracellular 


mAb & s.m. 


plasma membrane 


mAb&diag 


plasma membrane 


mAb&diag 


plasma membrane 


mAb & diag & s.m. 


plasma membrane 


s.m. 
diaq 


nuclear 
secreted 


sjn. 




SJn. 


cvfoolasm 


CTL 


ER 


CTL 


nuclear 


mAh A <; m 


nlasma membrane 


mAb 


plasma membrane 


mAb & s.m. 




mAb & s.m. 


plasma membrane 


mAb & s.m 


plasma membrane 


s,m. 


nuclear 


s.m. 




mAh & <5 m 


olasma mftmbrane 


CTL 


extracellular 4 


CTL 




uiay 


pxtracfiliiilar 


mAb 


nlacma momhranfi 


s.m. 


nuclear 


diaa 


secreted 


mAh & diaa 


plasma membrane 


mAh & diaa 


secreted 


diag 


secreted 


diag 


extracellular 


CTL & diag 


secreted 


CTL 




mAb & s.m. - 


plasma membrane 


mAb & s.m. 




mAb 


plasma membrane 


s.m. 


cytoplasm 


mAb&s.m. 


plasma membrane 


mAb & diag & s.m 


plasma membrane 


mAb&diag 


plasma membrane 
secreted 


diag 

mAb & diaa 


olasma membrane 


diag 
diag 


extracellular 


mAb 


nlasma mpmhrane 


CTL 


nuclear 


CTL & s.m. 


nuclear 


diag 


secreted 



TABLE 14B 



Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Gen bank accession numbers 



Pkey 



CAT Number Accession 
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414883 15024.1 AA926960 AA926959 W76521 W24270 W21526 AA037172 BE267636 H83186 AA469909 N86396 AA001348 BE535736 AA081745 BE5S6245 

AA082436 H72525 H77575 N49786 W8Q565 H78746 BE569085 W04339 R98127 T55938 BE279271 AW960304 T29812 AA476873 BE297387 
AA292753 AA177048 NMJJ01826 X54941 BE314366 AA908783 AI719075 BE270172 BE289819 AA889955 AI204630 W25243 AI935150 
AA872039 W72395 T99630 A1422691 H98460 N31428 BE255916 H03265 AI857576 AA776920 AA910644 AA459522 AA293140 AW514667 
R75953 AW662396 AA662522 AI865147 AM23153 AW262230 AA584410 AA583187 AW024595 AW069734 AI828998 AA282997 AA875046 
AW613002 AA527373 AW972459 A1831360 AA521337 AA100926 M772418 AA594628 A1033892 W95096 AI034317 AA398727 AI085031 
N95210 A1459432 A1041437 AA932124 AA627684 AA935829 AI004827 AI423513 AI094597 H42079 R54703 A1830359 M617681 AA978045 
AA643280 W44561 AI991988 AI537692 AJ090262 AA740817 AI312104 AI91 1822 AA416871 A11B5409 AA1 29784 AA701623 AI075239 
AI139549 AA633648 AI339996 AI336880 AA399239 AJ078708 AI085351 AI362835 AI346B18 Al 146955 Ai989380 AI348243 N92892 AA785850 
AI494230 AI278887 AA962596 AI492600 W80435 AA001979 R97424 AI129015 N24127 AA157451 AA235549 AA459292 AA0371 14 M129785 
W494211 AW059601 AW886710 R92790N59755AI3$112BAW589407 H47725 H97534 H48076 H48450T99631 AW300758 H03431 R76789 
AA954344 H77576 R96823 AI457100 N92845 N49682 H42038 BE220698 BE220715 H99552 AA701624 N74173 R54704 H79520 H72923 
H03266BE261919M769833AA480310AA507454AA910586AI203723AW104725W25611 W25071 T88980 H03513 T77589 R99156 
W95095 R97470 AA702275 T77551 AA911952 H82956 N83673 AA283872 

450375 83327.1 AA009647 AA131254 AA374293 AW954405 H04410 AW506284 AA151 168 BE157467 BE157601 H04384 W46291 AW663674 H04021 H01532 

AA190993 H03231 H59605 H01642 AA852876 AA1 13758 AA626915 AA746952AJ161014 AA099554 R69067 



TABLE 14C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Gertbank Identifier (G!) numbers. 'Dunham I. et al/ refers to the pubCcalion entitled The DNA 

sequence of human chromosome 22." Dunham I. et a)., Nature (1 999) 402:489-495. 

Strand: Indicates DNA strand from which exons were predicted. 

NLposifon: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposifon 

402075 8117407 Plus 121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 
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TABLE 15A: Information for all sequences in Table 16 

Table 15A shows the Seq ID No, Pkey, ExAccn, UnigenelD, and Unigene Title for aU of Ihe sequences in Table 16. 

Table 15B show the accession numbers for those Pkey's lacking UnigenelD's for table 15 A. For each probes et we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene dusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubieTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are feted in the 
"Accession" column. • 

Table 15C show the genomic positioning for those Pkey's lacking Unigene ID'S and accession numbers in table 15A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also feted. 



Seq 10 No: Sequence ID number 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 



Seq ID No: 

SeqlDNo:1&2 
Seq)DNo:3&4 
Seq IDNo:5&6 
Seq ID No: 7 & 8 
Seq ID No: 9 & 10 
Seq ID No: 11 & 12 
Seq 10 No: 13 & 14 
Seq ID No: 15 & 16 
Seq ID No: 17 & 18 
Seq ID No: 19 & 20 
Seq ID No: 21 & 22 
Seq ID No: 23 & 24 
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410407 
412719 
417034 
430486 
407788 
407788 
4077BB 
407788 
439285 
413753 
120486 
425650 
412140 
423673 
452838 
418663 
418663 
409632 
429610 
406690 
431846 
418830 
424098 
443648 
311034 
408522 
422158 
435505 
417366 
431958 
441020 
423217 
429538 
448733 
444371 
444371 
444371 
422168 
422168 
429259 
426440 
437044 
423662 
428484 
429211 
417389 
423634 
417515 
441362 
425322 
449003 
431009 
409103 
417542 
428471 
418004 
414761 
418203 
447343 
437016 
449230 
446989 
457819 
424687 
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X66839 

AW016610 

NM_006183 

BE062109 

BE514982 

BE514982 

BE514982 

BE514982 

AL133916 

U17760 

AW368377 

NM.001944 

AA219691 

BE003054 

U65011 

AK001100 

AK001100 

W74001 

AB024937 

M29540 

BE019924 

BE513731 

AF077374 

AI085377 

BE567130 

AI541214 

L10343 

AF200492 

BE185289 

X63629 

W79283 

NMJJ00094 

BE182592 

NM.005629 

BE540274 

8E540274 

BE540274 

AA5B6894 

AA586894 

AA420450 

BE382756 

AL035864 

AK001035 

AF104032 

AF052693 

BE260964 

AW959908 

L24203 

BE614410 

U63630 

X76342 

BE149762 

AF251237 

J04129 

X57348 

U37519 

AU077228 

X54942 

AA256641 

AU076916 

BE613348 

AK001898 

AA057484 

J05070 



UnigenelD 

Hs.63287 

Hs.816 

Hs.80962 

Hs.241551 

Hs.38991 

Hs.38991 

Hs.38991 

Hs.38991 

Hs.75517 

Hs.137569 

Hs.1925 

Hs.73625 

Hs.1695 

Hs.30743 

Hs.41690 

Hs.41690 

Hs.55279 

Hs.211092 

Hs.220529 

Hs.271580 

Hs.88959 

Hs.139322 

Hs.143610 

Hs.311389 

Hs.46320 

Hs.1 12341 

Hs.211238 

Hs.1076 

Hs.2877 

Hs.35962 

Hs.1640 

Hs.1 1261 

Hs.187958 

Hs.239 

Hs.239 

Hs.239 

Hs.1 12408 

Hs.1 12408 

Hs,292911 

Hs.169902 

Hs.69517 

Hs.1 30881 

Hs.1 84601 

Hs.1 98249 

Hs.82045 

Hs.1 690 

Hs.82237 

Hs.23044 

Hs.155637 

Hs.389 

Hs.48956 

Hs.112208 

Hs.82269 

Hs.184510 

Hs-87539 

Hs.77256 

Hs.83758 

Hs.236894 

Hs.5398 

Hs.211579 

Hs.1 6740 

Hs.35406 

Hs.151738 



Unigene Title 

carbonic anhydrase IX 
ESTs 

neurotensin 

chloride channel, calcium activated, fern 
S100 calcium-binding protein A2 
S100 calcium-binding protein A2 
S100 calcium-binding protein A2 
S100 calcium-binding protein A2 
hypothetical protein FU 20093 
lamintn, beta 3 (n'rcein (125kD), kalinln 
tumor protein 63 kDa with strong homolog 
desmoglein 3 (pemphigus vulgaris antigen 
RAB6 interacting, kinesin-like (rabkines 
matrix metalloproteinase 12 (macrophage 
preferentially expressed antigen in mela 
desmocoflin 3 
desmocoDin 3 

serine (or cysteine) proteinase inhiblto 
LUNX protein; PLUNC (palate lung and nas 
carcinoembryonic antigen-related cell ad 
uroptakin 1B 

hypothetical protein MGC4816 
small proline-rich prolein 3 
ESTs 

ESTs, Highly similar to NKGD.HUMAN NKG2- 

Small proline-rich protein SPRK [human, 

protease inhibitor 3, skin-derived (SKAL 

interleukin-1 homolog 1 

small proline-rich protein 1B (comifin} 

cadherin 3, type 1, P-cadherin (placenta 

ESTs 

collagen, type VII, alpha 1 (epidermolys 

small proline-rich protein 2A 

solute carrier family 6 (neurotransmitte 

forkhead boxM1 

forkhead box M1 

forkhead box Ml 

S100 calcium-binding protein A7 (psorias 
S100 calcium-binding protein A7 (psorias 
Plakophilin 

solute carrier family 2 (facilitated glu 
differentially expressed in Fanconi's an 
B-ceR CLUfymphoma 11A (zinc finger pro 
solute carrier family 7 (calionic amino 
gap junction protein, beta 5 (connexin 3 
midkine (neurite growth-promoting factor 
heparin-binding growth factor binding pr 
ataxia-telangiectasia group D-associated 
RAD51 (S. cerevisiae) homolog (E coli Re 
protein kinase, DNA-activated. catalytic 
alcohol dehydrogenase 7 (class IV), mu o 
gap junction protein, beta 6 (connexin 3 
XAGE-1 protein 



strati fin 

aldehyde dehydrogenase 3 family, member 

enhancer of zeste (Drosophila) homolog 2 

CDC28 protein kinase 2 

ESTs, Highly similar to S02392 aIpha-2-m 

guanine monphosphate synthetase 

melanoma cell adhesion molecule 

hypothetical protein FU11036 

ESTs, Highly similar to unnamed protein 

matrix metalioproteinase 9 (gelaiinase B 



184 



WO 02/086443 

SeqIDNo: 127 & 128 414430 

Seq ID No: 129 & 130 418462 

Seq ID No: 13U 132 100558 

SeqIDNo: 133& 134 458933 

SeqID Noc 13S&136 418478 

Seq ID No: 137 & 138 418478 

SeqID No: 139 & 140 418478 

SeqID No: 141 & 142 418478 

SeqIDNo: 143 & 144 446269 

Seq ID No: 145 & 146 422765 

SeqIDNo: 147 & 148 436481 

SeqIDNo: 149 & 150 440325 

Seq ID No: 151 & 152 439606 

Seq ID No: 153 & 154 453884 

SeqIDNo. 155 & 156 453884 

SeqIDNo: 157 & 158 453884 

Seq ID No: 159 & 160 453884 

Seq ID No: 161 & 162 404877 

Seq ID No: 163 & 164 413129 

Seq ID No: 165 & 166 413281 

SeqIDNo: 167 & 168 444781 

Seq ID No: 169 & 170 416819 

Seq ID No: 17U 172 451320 

Seq ID No: 173 & 174 418543 

Seq ID No: 175 & 176 454034 

Seq ID No: 177 & 178 425397 

Seq ID No: 179 & 180 415817 

Seq ID No: 181 & 182 415817 

Seq ID No: 183 & 184 415817 

Seq ID No: 185 & 186 415817 

Seq ID No: 187 & 188 415817 

SeqIDNo: 189 & 190 419121 

Seq ID No: 191 & 192 448993 

SeqID No: 193 & 194 421817 

SeqID No: 195 & 196 430393 

Seq ID No: 197 & 198 425057 

Seq ID No: 199 & 200 420462 

Seq ID No: 201 & 202 102963 

Seq ID No: 203 & 204 100576 

Seq ID No: 205 & 206 101175 

Seq ID No: 207 & 208 429038 

Seq ID No: 209 & 210 418678 

SeqIDNo: 211 & 212 418678 

Seq ID No: 213 & 214 131927 

Seq ID No: 215 & 216 428182 

Seq ID No: 217 & 218 427335 

Seq ID No: 219 & 220 409420 

Seq ID No: 221 & 222 114346 

Seq ID No: 223 & 224 438956 

Seq ID No: 225 & 226 404440 

Seq ID No: 227 & 228 415669 

Seq ID No: 229 & 230 103312 

Seq ID No: 231 & 232 320843 

Seq ID No: 233 429065 

Seq ID No: 234 & 235 446102 

Seq ID No: 236 & 237 330495 

Seq ID No: 238 413573 

Seq ID No: 239 & 240 428479 

Seq ID No: 241 & 242 428479 

Seq ID No: 243 & 244 332180 

Seq ID No: 245 437915 

Seq ID No: 246 & 247 441553 

Seq ID No: 248 & 249 331692 

Seq ID No: 250 & 251 429413 

Seq ID No: 252 & 253 422283 

Seq ID No: 254 & 255 448357 

Seq ID No: 256 & 257 446292 

Seq ID No: 256 & 259 416209 

Seq ID No: 260 & 261 453922 

Seq ID No: 262 & 263 424046 

Seq ID No: 264 4 265 439223 

Seq ID No: 266 & 267 429228 

Seq ID No: 268 & 269 409757 

Seq ID No: 270 & 271 411089 

Seq ID No: 272 & 273 436511 

Seq ID No: 274 & 275 428969 

Seq ID No: 276 & 277 428969 

Seq ID No: 278 & 279 428969 

Seq ID No: 280 & 281 428969 

Seq ID No: 282 407137 

Seq ID No: 283 & 284 412723 

Seq ID No: 285 & 286 450701 

Seq ID No: 287 & 288 405770 

Seq ID No: 289 & 290 439453 

Seq ID No: 291 & 292 414774 



AI346201 


Hs.76118 


BE001596 


Hs.85266 


105424 


Hs.169610 


AI638429 


Hs^4763 


U38945 


Hs.1174 


U38945 


Hs.1174 


U38945 


Hs.1174 


U38945 


Hs.1174 


AW263155 


Hs.14559 


AW409701 


Hs.1578 


AA379597 


Hs.5199 


NMJJ03812 


Hs.7164 


W79123 


Hs.58561 


AA355925 


Hs.36232 


AA355925 


Hs.36232 


AA355925 


Hs.36232 


AA355925 


Hs.36232 


AF292100 


Hs.104613 


AA861271 


Hs.222024 


NM 014400 


Hs.11950 


U77735 


Hs.80205 


AW1 18072 




NM_005329 


Hs.85962 


NM 000691 


Hs.575 


J04088 


Hs.156346 


U88967 


Hs.78867 


U88957 


Hs.78867 


U88967 


Hs.78867 


U88957 


Hs.78867 


U88967 


Hs.78867 


AA374372 


Hs.89626 


AI471630 


Hs.8127 


AF146074 


Hs. 108660 


BE185030 


Hs.241305 


AA826434 


Hs.1619 


AF050147 


Hs.97932 


X02404 


• Hs.274534 


X00356 


Hs.37058 


U82671 


Hs.36960 


AL023513 


Hs.1 94766 


NM 001327 


Hs.167379 


NM 001327 


Hs.167379 


AJ0031 12 


Hs.34780 


BE386042 


Hs.293317 


AA448542 


Hs.251677 


Z15008 


Hs.54451 


AL1 37256 


Hs.1 30489 


W00847 


Hs.135056 


NM_005025 


Hs.78589 


Y12642 


Hs.3185 


BE069288 


Hs.34744 


AI753247 


Hs.29643 


AW168067 


Hs.317694 


U47924 


Hs.71642 


AI733859 


Hs.149089 


Y00272 


Hs.334562 


Y00272 


Hs.334562 


AF134160 


Hs.7327 


AI637993 


Hs.202312 


AA281219 


Hs.121296 


AI683487 


Hs.152213 


NMJJ14058 


Hs.201877 


AW411307 


Hs.1 14311 


N20169 


Hs.108923 


AF081497 


Hs.279682 


AA236776 


Hs.79078 


AF053306 


Hs.36708 


AF027866 


Hs.138202 


AW238299 


Hs.250618 


AI553633 


Hs.326447 


NM_001898 


Hs.123114 


AA456454 


Hs.214291 


AA721252 


Hs.291502 


AF120274 


Hs.194689 


AF120274 


Hs.194689 


AF120274 


Hs.194689 


AF120274 


Hs.194689 


T97307 




AA648459 


Hs.335951 


H39960 


Hs^88467 


BE264974 


Hs.6566 


X02419 


Hs.77274 



ubiquitin carboxyWerminal esterase L1 
integrin, beta 4 

CD44 antigen (homing function and Indian 
RAN binding protein 1 
cycl in-dependent kinase Inhibitor 2A (me 
cycOn-dependent kinase inhibitor 2A (me 
cycGn-dependent kinase inhibitor 2A (me 
cycHn-dependent kinase inhibitor 2A (me 
hypothetical protein FU 10540 
baculoviral IAP repeat-containing 5 (sar 
HSPC150 protein similar to ubiquitin-con 
a disintegrin and metailoproteinase doma 
G protein-coupled receptor 87 
KlAA0186gene product 
KIAA0186 gene product 
WAA0186 gene product 
KJAA0 186 gene product 
NM_005365:Homo sapiens melanoma antigen, 
RP42homotog 
transcription factor BMAL2 
GPI-anchored metastasis-associated prote 
pim-2 oncogene 

dlacytgrycerol kinase, zeta (104kD) 
hyatoronan synthase 3 
aldehyde dehydrogenase 3 family, member 
topoisomerase (DNA) li alpha (170kD) 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
parathyroid hormone-tike hormone 
KIAA0144 gene product 
ATP-binolng cassette, sub-f amity C (CFTR 
estrogen-responsive B box protein 
achaete-scute complex (DrosophBa) homo) 
chondromoduiin I precursor 
caicitonin-relaied polypeptide, beta 
catdtonin/caicitonin-related potypeptid 
melanoma antigen, family A, 2 
seizure related gene 6 (mouse)-ltke 
cancer/testis antigen (NY-ESO-1) 
cancer/testis anligen (NY-ESCM) 
doubiecortex; lissencephary, X-Snked (d 
ESTs, Weakly similar to GGC1_HUMAN G ANT 
G antigen 7B 

lamlnin, gamma 2 (nicein (100kD), kalini 
ATPase, aminophosphoiipid transporter-Ii 
Human DNA sequence from clone RP5-850E9 
NM_021Q48:Homo sapiens melanoma anligen, 
serine (or cysteine) proteinase inhibito 
lysosomal 

Homo sapiens mRNA; cDNA DKF2p547C136 (fr 
Homo sapiens cDNA FU13103 fe, clone NT 
ESTs 

guanine nucleotide binding protein (G pr 
ESTs 

cell division cycle 2, G1 to S and G2 to 
cell division cycle 2, G1 to S and G2 to 
claudin 1 

Homo sapiens clone N11 NTera2D1 teratoca 
ESTs 

wingless-type MMTV integration site fami 
DESC1 protein 

CDC45 (cell division cyde 45, Sxerevis 
RAB38, member RAS oncogene family 
Rh type C glycoprotein 
MAD2 (mitotic arrest deficient, yeast h 
budding uninhibited by benzimidazoles 1 
serine (or cysteine) proteinase inhibito 
UL1 6 binding protein 2 
ESTs 

cystatin SN 

cell division cycle 2-like 1 (PITSLRE pr 

ESTs 

artemin 

artemin 

artemin 

artemin 

gb:ye53h05.s1 Soares fetal fiver spleen 
hypothetical protein AF301 222 
hypothetical protein XP_098151 (leucine- 
NM_002362:Homo sapiens melanoma antigen, 
thyroid hormone receptor interactor 13 
plasminogen activator, urokinase 
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Seq ID No: 295 4296 437789 

Seq ID No: 297 & 298 437789 

Seq ID No: 299 & 300 437789 

SeqlDNo:30U3G2 437789 

Seq ID No: 303 & 304 437789 

Seq ID No: 305 & 306 453968 

Seq ID No: 307 & 308 403478 

SeqlDNo:309 441525 

Seq ID No: 310 & 311 434105 

Seq ID No: 312 & 313 428810 

Seq ID No: 314 & 315 413691 

Seq ID No: 316 & 317 423934 

Seq ID No: 318 & 319 409228 

Seq ID No: 320 & 321 425734 

Seq ID Na 322 & 323 413582 

Seq ID No: 324 & 325 438403 

Seq ID No: 326 & 327 403329 

Seq ID No; 328 & 329 409893 

Seq ID No: 330 & 331 119073 

Seq ID No: 332 & 333 113195 

Seq ID No: 334 & 335 102283 

Seq ID No: 336 & 337 101345 

Seq ID No: 338 & 339 103280 

Seq ID No: 340 & 341 102012 

Seq ID No: 342 & 343 105729 

Seq ID No: 344 & 345 134299 

Seq ID No: 346 & 347 412719 

Seq ID No: 348 & 349 422158 

Seq ID No: 350 & 351 128924 

Seq ID No: 352 & 353 100486 

Seq ID No: 354 & 355 419121 

Seq ID No: 356 & 357 409459 

Seq ID No: 358 & 359 330493 

Seq ID No: 350 & 361 417866 

Seq ID No: 362 & 363 418113 

Seq ID No: 364 & 365 437016 

Seq ID No: 366 & 367 429612 

Seq ID No: 358 & 369 440704 

Seq ID No: 370 & 371 431221 

Seq ID No: 372 & 373 431565 

Seq ID No: 374 & 375 431565 

Seq ID No: 376 & 377 132354 

Seq ID No: 378 & 379 424441 

Seq ID No: 380 & 381 103768 

Seq ID No: 382 & 383 417512 

Seq ID No: 384 & 385 425266 

Seq ID No: 386 & 387 424503 

Seq ID No: 388 & 389 400289 

Seq ID No: 390 & 391 418007 

Seq ID No: 392 & 393 418007 

Seq ID No: 394 & 395 418738 

Seq ID No: 396 & 397 415138 

Seq ID No: 398 & 399 418506 

Seq ID No: 400 & 401 423961 

Seq ID No: 402 & 403 414812 

Seq ID No: 404 & 405 417433 

Seq ID No: 406 & 407 417433 

Seq ID Na 408 & 409 422867 

Seq ID No: 410 & 411 428227 

Seq ID No: 412 & 413 444381 

Seq ID No: 414 & 415 400303 

Seq ID No: 416 & 417 411789 

Seq ID No: 418 & 419 428698 

Seq ID No: 420 & 421 -450098 

Seq ID No: 422 & 423 421552 

Seq ID No: 424 & 425 452747 

Seq ID No: 426 & 427 450375 

Seq ID No: 428 & 429 426215 

Seq ID No: 430 & 431 425247 

Seq ID No: 432 & 433 432201 

Seq ID No: 434 & 435 427585 

Seq ID No: 436 & 437 442117 

Seq ID No: 438 & 439 431211 

Seq ID No: 440 & 441 447033 

Seq ID No: 442 & 443 447033 

Seq ID No: 444 & 445 447033 

Seq ID No: 446 & 447 115522 

Seq ID No: 448 & 449 410418 

Seq ID No: 450 & 451 409041 

Seq ID No: 452 & 453 409041 

Seq ID No: 454 & 455 452461 

Seq ID No: 456 & 457 412420 

Seq ID No: 458 & 459 416658 

Seq ID No: 460 & 461 407811 



M90656 


Ks.151393 


AI581344 


Hs.127812 


Al 581344 


Hs.127812 


Al 58 1344 


Hs.127812 


A1581344 


Hs.127812 


A) 581 344 


Hs.127812 


AA847843 


Hs.62711 


AW241867 


Hs. 127728 


AW952124 


Hs.13094 


AF068236 


Hs.193788 


AB023173 


Hs.75478 


U89995 


Hs.159234 


R16811 


Hs.22010 


AF056209 


Hs.159396 


AW295647 


Hs.71331 


AA806607 


Hs.292206 


AW247090 


Hs.57101 


BE245360 


Hs.279477 


H83265 


Hs.8881 


AW161552 


Hs.83381 


NM-005795 


Hs.152175 


U84722 


Hs.76206 


BE259035 


Hs.118400 


H46612 


Hs.293815 


AW580939 


Hs.97199 


AW016610 


Hs.816 


L10343 


Hs.1 12341 


BE279383 


Hs.26557 


T19006 


Hs.10842 


AA374372 


Hs.89626 


D86407 


Hs.54481 


M27826 




AW067903 


Hs.82772 


AI272141 


Hs.83484 


AU076916 


Hs.5398 


AF062649 


Hs.252587 


M69241 


Hs.162 


AA449015 


Hs.286145 


AF161470 


Hs.260622 


AF161470 


Hs.260622 


BE185289 


Hs.1076 


X14850 


Hs.147097 


AF0860O9 


Hs.296398 


X76534 


Hs.82226 


J00077 


Hs.155421 


NM_002205 


Hs.149609 


X07820 


Hs.2258 


M13509 


Hs.83169 


M13509 


Hs.83169 


AW388633 


Hs.6682' 


C18356 


Hs.295944 


M084248 


Hs.85339 


D13666 


Hs.136348 


X72755 


Hs.77367 


BE270266 


Hs.82128 


BE270266 


Hs.82128 


L32137 


Hs.1584 


AA321649 


Hs.2248 


BE387335 


Hs.283713 


AA242758 


Hs.79136 


AF245505 


Hs.72157 


AA852773 


Hs.334838 


W27249 


Hs.8109 


AF026692 


Hs-105700 


BE153855 


Hs.61460 


AA009647 




AW963419 


Hs.155223 


NM 005940 


Hs.155324 


AI538613 


Hs.298241 


D31152 


Hs.179729 


AW664964 


Hs.128899 


M86849 


Hs.323733 


AJ357412 


Hs.157601 


AI357412 


Hs.157601 


AI357412 


Hs.157601 


BE614387 


Hs.333893 


D31382 


Hs.63325 


AB033025 


Hs.50081 


AB033025 


Hs.50081 


N78223 


Hs.108106 


AL035668 


Hs.73853 


U03272 


Hs.79432 


AW190902 


Hs.40098 



gjutamate-cysteine ligase, catalytic sub 
ESTs, Weakly similar to T17330 hypolheti 
ESTs, WeaWy similar fo T17330 hypotheti 
ESTs, Weakly similar b T17330 hypolheti 
ESTs, WeaWy similar ta T17330 hypotheli 
ESTs, Weakly similar ta T17330 hypolheti 
High mobility group (nonhistone chromoso 
NMJ)22342:Homo sapiens kinesin protein 9 
ESTs 

presenilis associated rhomboid-like pro 
nitric oxide synthase 2A (inducible, hep 
ATPase. Class VI, type 11B 
forkhead box E1 (thyroid transcription f 
ESTs, Weakly similar b 210926QA B cell 
peptidyfgtycine alpha-amidating monooxyg 
hypothetical protein M6C5350 
ESTs 

unnamed protein product (Homo sapiens] 
minichromosome maintenance deficient (S. 
v-ets erythroblastosis virus E26 oncogen 
ESTs, Weakly similar b S41044 chromosom 
guanine nucleotide binding protein 11 
calcitonin receptor-fike 
cadherin 5, type 2, VE-cadherin (vascula 
singed (DrosophOaJ-ltke (sea urchin fas 
Homo sapiens HSPC285 mRNA, partial cds 
complement component C1q receptor 
ESTs 

protease inhibitor 3, skin-derived (SKAL. 
plakophlfin 3 

RAN, member RAS oncogene family 
parathyroid hormone-like hormone 
low density lipoprotein receptor-related 
endogenous retroviral protease 
collagen, type XI. alpha 1 
SRY (sex determining region Y)-box 4 
guanine monphosphate synthetase 
pituitary tumor-transforming 1 
insulin-like growth factor binding prate 
SRB7 (suppressor of RNA polymerase B, ye 
butyrate-induced transcript 1 
butyratennduced transcript 1 
small proline-rich protein 1B (comifin) 
H2A histone family, member X 
gb:Homo sapiens full length insert cDNA 
glycoprotein (transmembrane) nmb 
alpha-fetoprotein 

integrin, alpha 5 (fibronectin receptor, 
matrix metaHoproteinase 10 (stromelysin 
matrix metalloproteinase 1 (interstitial 
matrix metalloproteinase 1 (interstitial 
solute earner family 7, (cationic amino 
tissue factor pathway inhibitor 2 
G proteirKoupled receptor 39 
periostin (OSF-2os) 
monokine induced by gamma interferon 
5T4 oncofetal trophoblast glycoprotein • 
5T4 oncofetal trophoblast glycoprotein 
cartilage oSgomeric matrix protein (pse 
small inducible cytokine subfamily B (Cy 
ESTs, Weakly similar b S64054 hypolheti 
UV-1 protein, estrogen regulated 
Adlican 

KIAA1866 protein 

hypothetical protein FU21060 

secreted frizzled-related protein 4 

Ig superfamfly receptor LNIR 

a disintegrin and metalloproteinase doma 

stanniocalcin 2 

matrix metalloproteinase 11 (stromelysin 
Transmembrane protease, serine 3 
collagen, type X, alpha 1 (Schmid metaph 
ESTs; hypothetical protein for IMAG&447 
gap junction protein, beta 2, 26kD (conn 
ESTs 
ESTs 
ESTs 

c-Myc target JP01 
transmembrane protease, serine 4 
Hypothetical protein, XPJ)51860 (KIAA1 19 
Hypothetical protein, XP.051860 (WAA1 19 
transcription factor 
bone morphogenetic protein 2 
fibrillin 2 (congenital contractural ara 
cysteine knot superfamily 1, BMP antagon 
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Seqi0No:462 & 463 437852 

SeqiDNo:464 & 465 402075 

Seq ID No: 466 & 467 421110 

Seq ID No: 468 & 469 451668 

SeqIDNo:470&471 451668 

Seq ID No: 472 & 473 451668 

Seq ID No: 474 & 475 422282 

Seq ID No: 476 & 477 425852 

Seq ID No: 4784 479 439738 

Seq 10 No: 480 & 481 427747 

Seq ID No: 482 & 483 420281 

Seq ID No: 484 & 485 405932 

Seq ID No: 486 & 487 405932 

Seq ID No: 488 & 489 444342 

Seq ID No: 490 & 491 421379 

Seq ID No: 492 & 493 417079 

Seq ID No: 494 & 495 430890 

Seq ID No: 496 & 497 419721 

Seq ID Not 498 & 499 444471 

Seq ID No: 500 & 501 413063 

Seq ID No: 502 & 503 433800 

Seq ID No: 504 & 505 452401 

Seq ID No: 505 & 507 452401 

Seq ID No: 508 & 509 450001 

Seq ID No: 510 & 511 410407. 

Seq ID No: 51 2 & 513 309931 

Seq ID No: 514 & 515 412719 

Seq ID No: 516 & 517 417034 

Seq ID No: 518 & 519 430486 

Seq ID No: 520 & 521 413753 

Seq ID No: 522 & 523 425650 

Seq ID No: 524 & 525 423673 

Seq ID No: 526 & 527 418663 

Seq ID No: 528 & 529 418663 

Seq ID No: 530 & 531 429610 

Seq ID No: 532 & 533 406690 

Seq ID No: 534 & 535 431846 

Seq ID No: 536 & 537 422158 

Seq ID No: 538 & 539 431958 

Seq ID No: 540 & 541 437044 

Seq ID No: 542 & 543 428484 

Seq ID No: 544 & 545 429211 

Seq ID No: 546 & 547 417389 

Seq ID No: 548 & 549 431009 

Seq ID No: 550 & 551 417542 

Seq ID No: 552 & 553 449230 

Seq ID No: 554 & 555 410555 
Seq ID No: 556 & 557 410555 

Seq ID No: 558 & 559 424687 

Seq ID No: 560 & 561 418462 

Seq ID No: 562 & 563 410274 

Seq ID No: 564 & 565 439606 
Seq ID No: 566 & 567 404877 
Seq ID No: 568 & 569 444781 

Seq ID No: 570 & 571 418543 
Seq ID No: 572 & 573 415817 
Seq ID No: 574 & 575 415817 
Seq ID No: 576 & 577 415817 
Seq ID No: 578 & 579 415817 

Seq ID No: 580 & 581 415817 

Seq ID No: 582 & 583 415817 
Seq ID No: 584 & 585 421617 

Seq ID No: 586 & 567 418678 

Seq ID No: 588 & 589 418678 
Seq ID No: 590 & 591 409420 
Seq ID No: 592 & 593 332180 
Seq ID No: 594 & 595 408790 
Seq ID No: 596 & 597 408790 
Seq ID No: 598 & 599 439223 
Seq ID No: 600 & 601 409757 
Seq ID No: 602 & 603 428969 
Seq ID No: 604 & 605 428969 
Seq ID No: 606 & 607 428969 
Seq ID No: 608 & 609 428969 
Seq ID No: 610 & 611 450701 
Seq ID No: 612 & 613 450701 
Seq ID No: 614 & 615 414774 
Seq ID No: 616 & 617 407944 
Seq ID No: 618 & 619 407944 
Seq ID No: 620 & 621 457489 
Seq ID No: 622 & 623 429547 
Seq ID No: 624 & 625 407242 
Seq ID No: 626 & 627 407242 
Seq ID No: 628 & 629 407242 
Seq ID No: 630 & 631 444006 



BE001836 Hs.256897 

AJ250717 Hs.1355 

Z43948 Hs.326444 

243948 Hs.326444 

Z43948 Hs.326444 

AF019225 Hs.1 14309 

AK001504 Hs.159651 



QCO46502 


Hs.9598 


AW4114? 1 ! 


Hs.1 80655 


Alfi9^GQ3 

MJ0£OG3J 


Hs.323494 


NM 0143QA 


Hs.10887 


Y15221 


Hs.1 03982 


U0vJJ3U 


Hs.81134 




Hs 2699 


MlVl_UU IOOU 


Hs 288650 




Ks.11217 


A! 0157T7 

MJ.UOO/ Of 


Hs 75184 


AI0343R1 

MIUO*fOO 1 


Hs 135150 


MM 00711*1 


Hs 29352 


MM 0071 m 


Hs.29352 


Mm 001044 


Hs.406 


ADO 003 


H«Lfi3?fi7 

rlo.uO£0 1 


AWU1RR3 

MVVOH 1 QOO 




nwu too iu 


Hcfllfi 


MM fulfil A3 
)MIV1_UU0 100 


Wk flOQR9 
no.auju^ 


QCfifi910Q 


He 941 R51 


I I177P0 
U I / f Ou 


He 7R<\17 


mm nniQAit 


Me 1Q9R 
ns. iota 


bcUUoUo4 


Ue 1 RQR 
US. 1030 


Avnnnno 

MftUU 1 1 w 


Me 41RQ0 

ns.** 1 03 u 


Ai^nnnno 

MAUUl IUU 


He41RQ0 


ARn?AQT7 


We 911009 
n&.c 1 iu3a 


MOQtlAf) 
MZoWU 


Ue 99059Q 


RP01QQ94 
DCU 1 S3* 1 * 


He 971 RR0 


L IUOmj 


He 119^41 
ns. 1 1 to 4 * i 


A00O&9 


Hs.2877 


Al fW>AR4 


Hs 69517 


Mr lUHUJi 


Ue 184R01 


AF052693 


Hs.198249 


BE260964 


Hs.82045 


DC IH3JP* 


He 4ftQ^R 


f0419Q 


He R99RQ 
no.0££03 


OtO IOOhO 


He 911R7Q 




He FA1 1 1 


1 \OOPAQ 


He R431 1 


rncn7n 

JUOv/U 


He 1R177A 


OtUU IOMO 


He R<;9RR 


A A^RI AIT7 
AAuO lOUf 


He R17R9 




He RRRR1 
nS.ODOD 1 


KIM 014400 


He 11QR0 


MM OflRWi 


He RRQR9 


I 1QQQC7 
\J003Of 


He 7RRR7 
nS./ODOf 


j tftftqfi7 

uooyof 


He 7RRR7 
nS.f 000/ 




He 7RRR7 
ns./OOO/ 


i/otjyo/ 


We 7RRR7 

ns./ooo f 


| IQQQC7 


He 7RRR7 
ns./ooo/ 


1 IQ0QC7 

uooyof 


We 7RRR7 


AC4A£fY?A 

Arl4oU/«» 


Ue IftRRRA 
nS.lUOODu 




Ue 1R777Q 

ns. Jo/o/3 


mm nnno7 
NMJJU1 J*' 


He 1R717Q 
nS.lO/O/3 


7n;nnft 


Ue RAAR1 


Ar JO'tlDU 


He 7197 

ns./0£/ 




He A7RR0 

ns.**/oou 


AVA/5R0997 
AWOOU/Zi 


He 47RR0 


AVA/97R9QQ 


He 9R0R1R 
n5.il j UO I O 


MM O01AQR 
nlVL,w J oao 


He 191114 


API 90974 
Mr \L\)e.l 1 


He 1Q4RRQ 

no. 1 3HU03 


AP1 90974 
Mr \(MCI H 


He 1Q46RQ 

no. 1 3HO03 


AC1 90974 

Mr >£\tcl H 


We 1Q4RRQ 

no. 1 3HVJ03 


AP1 90974 
Mr IcMct H 


He 1Q4RRQ 
no. 1 3*1003 


H39960 


Hs.288457 


H39960 


Hs.288467 


X02419 


Hs.77274 


R34008 


Hs.239727 


R34008 


Hs.239727 


AI693815 


Hs.127179 


AW009166 


Hs.99376 


M18728 




M18728 




M18728 




BE395085 


Hs.10086 



ESTs. Weakly similar tadJ36501 2.1 [H^a 
ENSPW)000251056*:Pbsma membrane calcium 
cathepsinE 

carffiage acidic protein 1 
cartilage acidic protein 1 
cartilage acidic protein 1 
apollpo protein L 

death receptor 6, TNF superfamay member 
sema domain, immunoglobulin domain (Ig), 
serine/threonine kinase 12 
Predicted cation efflux pump 
C1500O305:gi|3B0S122|gb|AAC69198.1l (AF0 
C1 500O305^l|3806122Igb|AAC691 9B.1 1 (AF0 
similar to lysosome-associated membrane 
small inducible cytokine subfamily B (Cy 
Interleukin 1 receptor antagonist 
grypicanl 
aquaporin 4 
K1AA0877 protein 

chibnase SMike 1 (cartilage grycoprote 
lung type-l ceil membrane-associated gly 
tumor necrosis factor, alpha-induced pro 
tumor necrosis factor, alpha-induced pro 
solute carrier family 6 (neurotransmitte 
carbonic anhydrase IX 

gb:hd13dOU1 Soares_NFLJLGBC_S1 Homos 
ESTs 

neurotensin 

chloride channel, calcium activated, fam 
laminin, beta 3 (ra'cein (1 25kD), kalinin 
desmoglein 3 (pemphigus vulgaris antigen 
matrix metalloproteinase 12 (macrophage 
desmocoiOn 3 
desmocollin 3 

LUNX protein; PLUNC (palate lung and nas 
carcinoembryonic antigen-related cell ad 
uroplakln 1B 

protease inhibitor 3, skin-derived (SKAL 
cadherin 3, type 1 , P-cadherin (placenta 
differentially expressed in Fanconf s an 
solute carrier family 7 (cationic amino 
gap junction protein, beta 5 (connexin 3 
midklne (neurite growth-promoting factor 
gap Junction protein, beta 6 (connexin 3 
progestagen-associated endometrial prole 
melanoma cell adhesion molecule 
a distntegrin and metalloproteinase doma 
a disintegrin and metalloproteinase doma 
matrix metalloproteinase 9 (getatinase B 
integrin, beta 4 
hypoxia-inducible protein 2 - 
G protein-coupled receptor 87 
NMJ)05365:Homo sapiens melanoma antigen, 
GPfanchored metastasis-associated prote 
hyaluronan synthase 3 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
ATP-binding cassette, sub-family C (CFTR 
cancer/testis antigen (NY-ESO-1) 
cancer/tesbs antigen (NY-ESO-1 J 
laminin, gamma 2 (nice'm (100kD). kalini 
daudin 1 

neurotrophic tyrosine kinase, receptor, 

neurotrophic tyrosine kinase, receptor, 

UL16 binding protein 2 

cystatin SN 

artemin 

artemin 

artemin 

artemin 

hypothetical protein XPJ)98151 (ieucine- 
hypothetical protein XP_098151 (leucine- 
plasminogen activator, urokinase 
desmocollin 2 
desmocollin 2 
cryptic gene 
ESTs 

gb:Human nonspecific cross reacting antig 
gb:Human nonspecific crossreacling antig 
gb:Human nonspecific crossreacling antig 
type I transmembrane protein Fn14 
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SeqIDNo: 
SeqIDNo: 
SeqtDNo: 
Seq ID No: 
SeqIDNo: 
SeqIDNo: 
SeqtDNo: 
SeqIDNo: 
SeqIDNo: 
SeqIDNo: 
Seq ID No: 
SeqtDNo: 
SeqtDNo: 
SeqIDNo: 
SeqIDNo: 
Seq ID No: 
SeqIDNo: 
Seq ID No: 
SeqIDNo: 
SeqIDNo: 
Seq ID No: 
SeqIDNo: 
Seq ID Na 
SeqIDNo: 
SeqIDNo: 
SeqtDNo: 
Seq ID No: 
Seq ID No: 
SeqIDNo: 



632 & 633 
634 & 635 
636 &63T 
638 & 639 
640 & 641 
642 & 643 
644 & 645 
646 &647 
648 & 649 
650 & 651 
652 &6S3 
654 & 655 
656 & 657 
658 & 659 
660 & 661 
662 S 653 
664 & 665 
666 & 667 
668 & 659 
670 & 671 
672 & 673 
674 4675 
676 & 677 
678 4679 
680 &681 
682 & 683 
684 & 685 
&687 





MM 003816 


rfe.2442 


4921 HQ 


373265 


Hs.1473 




AW470411 

M»""' WT • ■ 


Hs.288433 




245051 


Hs.22920 


4 1 ID 


Al 1076718 


Hs.1 64021 


431489 
HO I Hue 


AW583672 


Hs 256311 




AW369771 


Hs 52620 


49fi427 


M86699 

VV100U3 3 


Hs 169840 


445537 


AJ245671 


Hs, 12844 


49997R 


AF072873 

Mr u i tot o 


Hs 114218 


428450 


NM 014791 


Hs.184339 


f+OO 1 3 


ai K)7e643 


Hs.313 




U23752 


Hs.32954 


426514 


RE616633 


Hs.170195 


425776 


U25128 


Hs. 159499 


425776 


U25128 


Hs. 159499 


431515 


kim 012152 


Hs 258583 


419452 


U33635 


Hs.90572 

1 U»|7VJ* Am 




M62096 


Hs 293185 


432653 


MB2096 


HsJ93185 


439fi*3 




Hs 293185 




NR2096 


Hs 293185 


4mnni 


AR041036 


Hs 57771 

no. Ji i # I 


426501 


AW043782 






R38438 


Hs 182575 




AA151349 


Hs 19S77 


422424 


AM 86431 


Hs.296638 


428330 


L22524 


Hs.2256 


420610 


AI683183 


Hs.99348 



a tfsintegrin and metafioprotemase doma 
gastrin-reieasing peptide 
neurotrimin 

similar to S68401 (cattle) glucose Indue 
small inducible cytokine subfamily B (Cy 
graniivOke neuroendocrine peptide precu 
integrin, beta 8 
TTK protein kinase 
EGF-like-domaln, multiple 6 
frizzled (Drosophila) homolog 6 
WAA0175 gene product 
secreted phosphoprotein 1 (osteoponQn, 
SRY (sex determining region Y}-box 11 
bone morphogenetic protein 7 (osteogenic 
parathyroid hormone receptor 2 
parathyroid hormone receptor 2 
endolheBal differentiation, lysophospha 
PTK7 protein tyrosine kinase 7 
ESTs, Weakly similar to JC7328 amino aci 
ESTs, Weakly similar to JC7328 amino aci 
ESTs, Weakly similar to JC7328 amino aci 
ESTs, WeaWy similar to JC7328 amino aci 
kallikrein 11 
ESTs 

solute carrier family 1 5 (H??? transport 
CGM47 prolan 
prostate differentiation factor 
matrix melalloproteinase 7 (matrilysin, 
distal-less homeo box 5 



TABLE 15B 



Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey 
309931 
330493 
439285 



451320 



CAT Number 
AW341683 
33264_5 
47065.1 



450375 83327.1 



86576J 



Accession 

M27826 R78416AA307645 AW957879 AW957800AA633529 H03662 

AL133916 N791 13 AF086101 N76721 AW950828 AA364013 AW9556B4 AI346341 AI867454 N54784 AI655270 AI421279 AW014882 
AA775552 N62351 N59253AA626243 AI341407 BE175639 AA456968 AI358918 AA457077 

AA009647 AA131254 AA374293 AW954405 H04410 AW606284 AA151166 BE157467 BE157601 H04384 W46291 AW663674 H04021 H01532 
AA190993 H03231 H59605 H01642 AA852876 AA113758 AA626915 AA746952 AI161014 AA099554 R69067 
AW1 1 8072 A1631982 T15734 AA224195 AI701458 W20198 F26326 AA890570 N90552 AW071907 AJ671352 AI375892 T03517 R88265 
AH 24088 AA224388 AI084316 AJ354686 T33652 AI140719 AI72021 1 T03490 AI372637 T1541 5 AW205836 AA630384 T03515 T33230 
AA017131 AA443303 T33623 AI222556 T33511 T33785AI4 19606 D55612 



TABLE 15C 



Pkey: 



Strand: 
NLposition: 

Pkey 

402075 

403329 

403478 

404440 

404877 

405770 

405932 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled The DNA 
sequence of human chromosome 22." Dunham I. et al, Nature (1999) 402:489-495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 

Ref Strand NLposition 

8117407 Plus 121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 

8516120 Plus 96450-96598 

9958258 Plus 116458-116564 

7528051 Plus ' 80430-81581 

1519284 Plus 1095-2107 

2735037 Plus 61057-62075 

7767812 Minus 123525-123713 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

Table 16 

Seq ID NO: 1 DNA sequence 

Nucleic Acid Accession #: NM_001216 

Coding sequence: 43.. 1422 



PCT/US02/12476 



GCCCGTACAC 
AGCCCCTGGC 
CTGTCACTGC 
TCCCCCTTGG 
AGTGAAGAGG 
GAGGATCTAC 
TCCCTGAAGT 
AATAATGCCC 
CCGCCCTGGC 
CGCCCCCAGC 
CTCCCGCCGC 
CCTCCTGGGC 
CTGCACTGGG 
CCTGCCGAGA 
GGGCGCCCGG 
AGTGCCTATG 
CAGGTCCCAG 
TATGAGGGGT 
CAGACAGTGA 
GGTGACTCTC 
GAGGCCTCCT 
AATTCCTGCC 
ACCAGCGTCG 
GTGAGCTACC 
TGTGAGAAGC 
ATGCCACTTC 



11 
I 

ACCGTGTGCT 
TCCCTCTGTT 
TGCTTCTGAT 
GAGGAGGCTC 
ATTCACCCAG 
CTGGAGAGGA 
TAGAGGATCT 
ACAGGGACAA 
CCCGGGTGTC 
TCGCCGCCTT 
TCCCAGAACT 
TAGAGATGGC 
GGGCTGCAGG 
TCCACGTGGT 
GAGGCCTGGC 
AGCAGTTGCT 
GACTGGACAT 
CTCTGACTAC 
TGCTGAGTGC 
GGCTACAGCT 
TCCCTGCTGG 
TGGCTGCTGG 
CGTTCCTTGT 
GCCCAGCAGA 
CAGCCAGAGG 
CTTTTAACTG 



21 
I 

GGGACACCCC 
GATCCCGGCC 
GCCTGTCCAT 
TTCTGGGGAA 
AGAGGAGGAT 
GGATCTACCT 
ACCTACTGTT 
AGAAGGGGAT 
CCCAGCCTGC 
CTGCCCGGCC 
GCGCCTGCGC 
TCTGGGTCCC 
TCGTCCGGGC 
TCACCTCAGC 
CGTGTTGGCC 
GTCTCGCTTG 
ATCTGCACTC 
ACCGCCCTGT 
TAAGCAGCTC 
GAACTTCCGA 
AGTGGACAGC 
TGACATCCTA 
GCAGATGAGA 
GGTAGCCGAG 
CATCTGAGGG 
CCAAGAAATT 



31 
I 

ACAGTCAGCC 
CCTGCTCCAG 
CCCCAGAGGT 
GATGACCCAC 
CCACCCGGAG 
GAAGTTAAGC 
GAGGCTCCTG 
GACCAGAGTC 
GCGGGCCGCT 
CTGCGCCCCC 
AACAATGGCC 
GGGCGGGAGT 
TCGGAGCACA 
ACCGCCTTTG 
GCCTTTCTGG 
GAAGAAATCG 
CTGCCCTCTG 
GCCCAGGGTG 
CACACCCTCT 
GCGACGCAGC 
AGTCCTCGGG 
GCCCTGGTTT 
AGGCAGCACA 
ACTGGAGCCT 
GGAGCCGGTA 
TTTTAAAATA 



41 
I 

GCATGGCTCC 
GCCTCACTGT 
TGCCCCGGAT 
TGGGCGAGGA 
AGGAGGATCT 
CTAAATCAGA 
GAGATCCTCA 
ATTGGCGCTA 
TCCAGTCCCC 
TGGAACTCCT 
ACAGTGTGCA 
ACCGGGCTCT 
CTGTGGAAGG 
CCAGAGTTGA 
AGGAGGGCCC 
CTGAGGAAGG 
ACTTCAGCCG 
TCATCTGGAC 
CTGACACCCT 
CTTTGAATGG 
CTGCTGAGCC 
TTGGCCTCCT 
GAAGGGGAAC 
AGAGGCTGGA 
ACTGTCCTGT 
AATATTTATA 



51 
I 

CCTGTGCCCC 
GCAACTGCTG 
GCAGGAGGAT 
GGATCTGCCC 
ACCTGGAGAG 
AGAAGAGGGC 
AGAACCCCAG 
TGGAGGCGAC 
GGTGGATATC 
GGGCTTCCAG 
ACTGACCCTG 
GCAGCTGCAT 
CCACCGTTTC 
CGAGGCCTTG 
GGAAGAAAAC 
CTCAGAGACT 
CTACTTCCAA 
TGTGTTTAAC 
GTGGGGACCT 
GCGAGTGATT 
AGTCCAGCTG 
TTTTGCTGTC 
CAAAGGGGGT 
TCTTGGAGAA 
CCTGCTCATT 
AT 



Seq ID NO: 2 Protein sequence: 
Protein Accession #: NP_001207 



11 



MAPLCPSPWL 
GEEDLPSEED 
DPQEPQNNAH 
ELLGFQLPPL 
VEGHRFPAEI 
EEGSETQVPG 
DTLWGPGDSR 
GLLFAVTSVA 



PLLIPAPAPG 



RDKEGDDQSH 
PELRLRNNGH 
HWHLSTAFA 
LD IS ALL PSD 
LQLNFRATQP 
FLVQMRRQHR 



21 
I 

LTVQLLLSLL 
EDLPGEEDLP 
WRYGGDPPWP 
SVQLTLPPGL 
RVDEALGRPG 
FSRYFQYEGS 
LNGRVIEASF 
RGTKGGVSYR 



31 
I 

LLMPVHPQRL 
GEEDLPEVKP 
RVSPACAGRF 
EMALGPGREY 
GLAVLAAFLE 
LTTPPCAQGV 
PAGVDSSPRA 
PAEVAETGA 



41 

I 

PRMQEDSPLG 
KSEEEGSLKL 
QSPVDIRPQL 
RALQLHLHWG 
EGPEENSAYE 
IWTVFNQTVM 
AEPVQLNSCL 



51 
I 

GGSSGEDDPL 
EDLPTVEAPG 
AAFCPALRPL 
AAGRPGSEHT 
QLLSRLEEIA 
LSAKQLHTLS 
AAGDILALVF 



Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession 8: BC013923 

Coding sequence: 438-1391 



I 

AGCGGGGTTG 
GTGTTTGCAA 
AGAGGAGAGA 
TAATAATAAC 
TGATCCTGAT 
TCCTCGCGGA 
CCCGCGGGCC 
CCGCGCACAG 
AGCAAACTTC 
AAAACAGCCC 
AGCGGCGCAA 
TGGGCGCCGA 
AGCGGCTGCG 
AAACCAAGAC 
GCGGCAATAG 
AGCGCATGGA 
AGGACCAGCT 
AGCCCATGCA 
CCTACATGAA 
TGGCTCTTGG 
TTACCTCTTC 
GCATGTATCT 
CCCAGCACTA 
CACACATGTG 
GGAAATGGGA 
TCAAAAAAAA 
AACACCAATC 
GAGAGATCCT 
GGGGAGGGCG 
TTTAAAAGTT 
TAATATTTAG 
TTGTACAGTA 



11 

i ' 

TCTATTAACT 
AAGGGGGAAA 
GAAAGAAAGG 
AATCATCGGC 
TCCAGTTTGC 
GCCCTGCGCT 
CCCCAAAGTC 
CGCCCGCATG 
GGGGGGCGGC 
GGACCGCGTC 
GATGGCCCAG 
GTGGAAACTT 
AGCGCTGCAC 
GCTCATGAAG 
CATGGCGAGC 
CAGTTACGCG 
GGGCTACCCG 
CCGCTACGAC 
CGGCTCGCCC 
CTCCATGGGT 
CTCCCACTCC 
CCCCGGCGCC 
CCAGAGCGGC 
AGGGCCGGAC 
GGGGTGCAAA 
AAAAAAAAAA 
CCATCCACAC 
GGACTTCTTT 
GGGGAATGGA 
CTAGTGGTAC 
AGCTAGTCTC 
TTTATCGAGA 



21 
I 

TGTTCAAAAA 
GTAGTTTGCT 
GAGAGAAGTT 
GGCGGCAGGA 
CTCTCTCTTT 
CCCGACACCC 
CCGGCCGGGC 
TACAACATGA 
GGCGGCAACT 
AAGCGGCCCA 
GAGAACCCCA 
TTGTCGGAGA 
ATGAAGGAGC 
AAGGATAAGT 
GGGGTCGGGG 
CACATGAACG 
CAGCACCCGG 
GTGAGCGCCC 
ACCTACAGCA 
TCGGTGGTCA 
AGGGCGCCCT 
GAGGTGCCGG 
CCGGTGCCCG 
AGCGAACTGG 
AGAGGAGAGT 
AAAATCCCAT 
TCACGCAAAA 
TKGGGGGACT 
CCTTGTATAG 
GGTAGGAGCT 
CAAGCGACGA 
TAAACATGGC 



31 

I 

GTATCAGGAG 
GCCTCTTTAA 
TGAGCCCCAG 
TCGGCCAGAG 
TTTTCCCCCA 
CCGCCCGCCT 
CGAGGGTCGG 
TGGAGACGGA 
CCACCGCGGC 
TGAATGCCTT 
AGATGCACAA 
CGGAGAAGCG 
ACCCGGATTA 
ACACGCTGCC 
TGGGCGCCGG 
GCTGGAGCAA 
GCCTCAATGC 
TGCAGTACAA 
TGTCCTACTC 
AGTCCGAGGC 
GCCAGGCCGG 
AACCCGCCGC 
GCACGGCCAT 
AGGGGGGAGA 
AAGAAACAGC 
CACCCACAGC 
ACCGCGATGC 
ATTTTTGTAC 
ATCTGGAGGA 
TTGCAGGAAG 
AAAAAATGTT 
AATCAAAATG 



41 
I 

TTGTCAAGGC 
GACTAGGACT 
GCTTAAGCCT 
GAGGAGGGAA 
AATTATTCTT 
CCCCTCCTCC 
CGGCCGCCGG 
GCTGAAGCCG 
GGCGGCCGGC 
CATGGTGTGG 
CTCGGAGATC 
GCCGTTCATC 
TAAATACCGG 
CGGCGGGCTG 
CCTGGGCGCG 
CGGCAGCTAC 
GCACGGCGCA 
CTCCATGACC 
GCAGCAGGGC 
CAGCTCCAGC 
GGACCTCCGG 
CCCCAGCAGA 
TAACGGCACA 
AATTTTCAAA 
ATGGAGAAAA 
AAATGACAGC 
CGACAAGAAA 
AGAGAAAACC 
AAGAAAGCTA 
TTTGCAAAAG 
TTAATATTTG 
TCCATTGTTT 



51 
I 

AGAGAAGAGA 
GAGAGAAAGA 
TTCCAAAAAA 
GCGCTTTTTT 
CGCCTGATTT 
TCTCCCCCCG 
CGGGCCGGGC 
CCGGGCCCGC 
GGCAACCAGA 
TCCCGCGGGC 
AGCAAGCGCC 
GACGAGGCTA 
CCCCGGCGGA 
CTGGCCCCCG 
GGCGTGAACC 
AGCATGATGC 
GCGCAGATGC 
AGCTCGCAGA 
ACCCCTGGCA 
CCCCCTGTGG 
GACATGATCA 
CTTCACATGT 
CTGCCCCTCT 
GAAAAACGAG 
CCCGGTACGC 
TGCAAAAGAG 
ACTTTTATGA 
TGGGGAGGGT 
CGAAAAACTT 
TCTTTACCAA 
CAAGCAACTT 
ATAAGCTGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



189 



WO 02/086443 

AATTTGCCAA TATTTTTCAA GGAGAGGCTT CTTGCTGAAT TTTGATTCTG CAGCTGAAAT 1980 

TTAGGACAGT TGCAAACGTG AAAAGAAGAA AATTATTCAA ATTTGGACAT TTTAATTGTT 2040 

TAAAAATTGT ACAAAAGGAA AAAATTAGAA TAAGTACTGG CGAACCATCT CTGTGGTCTT 2100 

GTTTAAAAAG GGCAAAAGTT TTAGACTGTA CTAAATTTTA TAACTTACTG TTAAAAGCAA 2160 

AAATGGCCAT GCAGGTTGAC ACCGTTGGTA ATTTATAATA GCTTTTGTTC GATCCCAACT 2220 

TTCCATTTTG TTCAGATAAA AAAAACCATG AAATTACTGT GTTTGAAATA TTTTCTTATG 2280 

GTTTGTAATA TTTCTGTAAA TTTATTGTGA TATTTTAAGG TTTTCCCCCC TTTATTTTCC 2340 

GTAGTTGTAT TTTAAAAGAT TCGGCTCTGT ATTATTTGAA TCAGTCTGCC GAGAATCCAT 2400 

GTATATATTT GAACTAATAT CATCCTTATA ACAGGTACAT TTTCAACTTA AGTTTTTACT 2460 

CCATTATGCA CAGTTTGAGA TAAATAAATT TTTGAAATAT GGACACTGAA AAAAAAAAAA 2520 

AAAAAAACAA AACAAAAAAA CAAAAAACAA AAACAGAAAA AACAAAAAAA AAAACAAAAC 2580 

CACAACACAA AAACAAAAAA AAAAAAAAGA AACAAACACA CAACACAACA CAACACAAAA 2640 
CCACAACACA AACAACAACA CACAGAGGG 

Seq ID NO: 4 Protein sequence: 
Protein Accession ft:CAA83435.1 



1 11 21 31 41 51 

I I I I I I 

MYNMMETELK PPGPQQTSGG GGGNSTAAAA GGNQKNSPDR VKRPMNAFMV WSRGQRRKMA 60 
QENPKMHNSE ISKRLGAEWK LLSETEKRPF IDEAKRLRAL HMKEHPDYKY RPRRKTKTLM 120 
KKDKYTLPGG LLAPGGNSMA SGVGVGAGLG AGVNQRMDSY AHMNGWSNGS YSMMQDQLGY 180 
PQHPGLNAHG AAQMQPMHRY DVSALQYNSM TSSQTYMNGS PTYSMSYSQQ GTPGMALGSM 240 
GSWKSEASS SPPWTSSSH SRAPCQAGDL RDMISMYLPG ABVPEPAAPS RLHMSQHYQS 300 
GPVPGTAING TLPLSHM 



Seq ID NO: 5 DNA sequence 
Nucleic Acid Accession #: U91618 
Coding sequence: 29-541 



1 11 21 

I I I 

CGGACTTGGC TTGTTAGAAG GCTGAAAGAT 
CATGCTACTC CTGGCTTTCA GCTCCTGGAG 
AGCATTAGAA GCAGATTTCT TGACCAATAT 
TCCCTCTTGG AAGATGACTC TGCTAAATGT 
AGCTGAGGAA ACAGGAGAAG TTCATGAAGA 
TGCTTTAGAT GGCTTTAGCT TGGAAGCAAT 
TCACAGCAGG GCTTTTCAAC ACTGGGAGTT 
TGACAAAAAT GGAAAGGAAG AAGTCATAAA 
GCTGTATGAG AATAAACCCA GAAGACCCTA 
AGAGAATAAA TCATTTATTT ACATGTGATT 
ATTATATTTG TGTGAAAATG TGACAAACAC 
ATTGAATGTG TTTTTCTGCA CTAATAGAAA 
TCTTCAAAAA AAAAAAAAAA AAATGGGGCC 



31 41 51 

( I I 

GATGGCAGGA ATGAAAATCC AGCTTGTATG 60 

TCTGTGCTCA GATTCAGAAG AGGAAATGAA 120 

GCATACATCA AAGATTAGTA AAGCACATGT 180 

TTGCAGTCTT GTAAATAATT TGAACAGCCC 240 

GGAGCTTGTT GCAAGAAGGA AACTTCCTAC 300 

GTTGACAATA TACCAGCTCC ACAAAATCTG 360 

AATCCAGGAA GATATTCTTG ATACTGGAAA 420 

GAGAAAAATT CCTTATATTC TGAAACGGCA 480 

CATACTCAAA AGAGATTCTT ACTATTACTG 540 

GTGATTCATC ATCCCTTAAT TAAATATCAA 600 

ACTTATCTGT CTCTTCTACA ATTGTGGTTT 660 

TTAGACTAAG TGTTTTCAAA TAAATCTAAA 720 
GCAATT 



Seq ID NO: 6 Protein sequence: 
Protein Accession #: AAB50564 

1 11 21 31 41 51 

I I I I I I 

MMAGMKIQLV CMLLLAFSSW SLCSDSEEEM KALEADFLTN MHTSKISKAH VPSWKMTLLN 60 
VCSLVNNLNS PAEETGEVHE EEL.VARRKLP TALDGFSLEA MLTIYQLHKI CHSRAFQHWE 120 
LIQEDILDTG NDKNGKEEVI KRKIPYILKR QLYENKPRRP YIJjKRDSYYY 



Seq ID NO: 7 DNA sequence 

Nucleic Acid Accession #: NM_006536.2 

Coding sequence: 109-2940 



1 11 21 

I I I 

ACCTAAAACC TTGCAAGTTC AGGAAGAAAC 
ATGTATGCAG CAGGCTCAGT GTGAGTGAAC 
AGCATTGCAG GTCCTATTTG CAACCTGAAG 
GAACTCCCAT TCCTGGGAGC TGGAGTACAG 
ATTGCAATTA ATCCTCAGGT ACCTGAGAAT 
ATAACTGAAG CTTCATTTTA CCTATTTAAT 
ATAAAGATTT TAATACCTGC CACATGGAAA 
TCATATGAAA AGGCAAATGT CATAGTGACT 
TACACCCTAC AATACAGAGG GTGTGGAAAA 
TTCCTACTGA ATGATAACTT AACAGCTGGC 
GAATGGGCCC ACCTCCGTTG GGGTGTGTTC 
ATAAATGGGC AAAATCAAAT TAAAGTGACA 
GTGTGTGAAA AAGGTCCTTG CCCCCAAGAA 
GGATGCACCT TTATCTACAA TAGCACCCAA 
AGTTTATCTT CTGTGGTTGA ATTTTGTAAT 
CTACAGAACC AGATGTGCAG CCTCAGAAGT 
TTTCACCACA GCTTTCCCAT GAATGGGACT 
GTACAGGCTG GTGACAAAGT GGTCTGTTTA 
GCTGACAGAC TCCTTCAACT ACAACAAGCC 
ATTCATACCT TCGTGGGCAT TGCCAGTTTC 
CACCAAATTA ACAGCAATGA TGATCGAAAG 
TCAGCTAAAA CAGACATCAG CATTTGTTCA 
AAACTGAATG GAAAAGCTTA TGGCTCTGTG 
CTTCTTGGCA ATTGCTTACC CACTGTGCTC 



31 41 51 

III 

CATCTGCATC CATATTGAAA ACCTGACACA 60 

TGGAGGCTTC TCTACAACAT GACCCAAAGG 120 

TTTGTGACTC TCCTGGTTGC CTTAAGTTCA 180 

CTTCAAGACA ATGGGTATAA TGGATTGCTC 240 

CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

GCTACCAAGA GAAGAGTATT TTTCAGAAAT 360 

GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

GACTGGTATG GGGCACATGG AGATGATCCA 480 

GAGGGAAAAT ACATTCATTT CACACCTAAT 540 

TACGGATCAC GAGGCCGAGT GTTTGTCCAT 600 

GATGAGTATA ACAATGACAA ACCTTTCTAC 660 

AGGTGTTCAT CTGACATCAC AGGCATTTTT 720 

AACTGTATTA TTAGTAAGCT TTTTAAAGAA 780 

AATGCAACTG CATCAATAAT GTTCATGCAA 840 

GCAAGTACCC ACAACCAAGA AGCACCAAAC 900 

GCATGGGATG TAATCACAGA CTCTGCTGAC 960 

GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 1020 

GTGCTGGATG TGTCCAGCAA GATGGCAGAG 1080 

GCAGAATTTT ATTTGATGCA GATTGTTGAA 1140 

GACAGCAAAG GAGAGATCAG AGCCCAGCTA 1200 

TTGCTGGTTT CATATCTGCC CACCACTGTA 1260 

GGGCTTAAGA AAGGATTTGA GGTGGTTGAA 1320 

ATGATATTAG TGACCAGCGG AGATGATAAG 1380 

AGCAGTGGTT CAACAATTCA CTCCATTGCC 1440 
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10 
15 
20 
25 
30 
35 
40 
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CTGGGTTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC GTCTTACAGG AGGTTTAAAG 1500 

TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG ATGCTTTCAG TAGAATTTCC 1560 

TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAGTACAGG TGAAAATGTC 1620 

AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAACGACACT 1680 

ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA TTATATTATT TGATCCTGAT 1740 

GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTCGGAC AGCTAGTCTT 1800 

TGGATTCCAG GAACAGCTAA GCCTGGGCAC TGGACTTACA CCCTGAACAA TACCCATCAT 1860 

TCTCTGCAAG CCCTGAAAGT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 1920 

GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 1980 

TATGCCAATG TGAAACAGGG ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 2040 

GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGACTCCTTG ATGATGGAGC AGGTGCTGAT 2100 

GTTATAAAAA ATGATGGAAT TTACTCGAGG TATTTTTTCT CCTTTGCTGC AAATGGTAGA 2160 

TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA GCACCCCAAC CCACTCTATT 2220 

CCAGGGAGTC ATGCTATGTA TGTACCAGGT TACACAGCAA ACGGTAATAT TCAGATGAAT 2280 

GCTCCAAGGA AATCAGTAGG CAGAAATGAG GAGGAGCGAA AGTGGGGCTT TAGCCGAGTC 2340 

AGCTCAGGAG GCTCCTTTTC AGTGCTGGGA GTTCCAGCTG GCCCCCACCC TGATGTGTTT 2400 

CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG AAGAGGAATT GACCCTATCT 2460 

TGGACAGCAC CTGGAGAAGA CTTTGATCAG GGCCAGGCTA CAAGCTATGA AATAAGAATG 2520 

AGTAAAAGTC TACAGAATAT CCAAGATGAC TTTAACAATG CTATTTTAGT AAATACATCA 2580 

AAGCGAAATC CTCAGCAAGC TGGCATCAGG GAGATATTTA CGTTCTCACC CCAGATTTCC 2640 

ACGAATGGAC CTGAACATCA GCCAAATGGA GAAACACATG AAAGCCACAG AATTTATGTT 2700 

GCAATACGAG CAATGGATAG GAACTCCTTA CAGTCTGCTG TATCTAACAT TGCCCAGGCG 2760 

CCTCTGTTTA TTCCCCCCAA TTCTGATCCT GTACCTGCCA GAGATTATCT TATATTGAAA 2820 

GGAQTTTTAA CAGCAATGGG TTTGATAGGA ATCATTTGCC TTATTATAGT TGTGACACAT 2880 

CATACTTTAA GCAGGAAAAA GAGAGCAGAC AAGAAAGAGA ATGGAACAAA ATTATTATAA 2940 

ATAAATATCC AAAGTGTCTT CCTTCTTAGA TATAAGACCC ATGGCCTTCG ACTACAAAAA 3000 

CATACTAACA AAGTCAAATT AACATCAAAA CTGTATTAAA ATGCATTGAG TTTTTGTACA 3060 

ATACAGATAA GATTTTTACA TGGTAGATCA ACAATTCTTT TTGGGGGTAG ATTAGAAAAC 3120 

CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT TTAAAGTAAT GTCTTTAAAG 3180 

GCAAAGGGAA GGGTAAAGTC GGACCAGTGT CAAGGAAAGT TTGTTTTATT GAGGTGGAAA 3240 

AATAGCCCCA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA TAACTGTCTG TGTGAAGCAA 3300 

TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TGTGCAGTAC AGGTTGCTTG 3360 

TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAATGCAAAG CTCTTTACCT 3420 

CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT GCTAATGCTC AGAGATCTTT 3480 

TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC CTTTGTCTCT TCATACCGGT 3540 

TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT TCTACTCCCA TCAAAGCAGC 3600 

TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT TATAGCCCCN TATAATGCCT 3660 
TACCTAGGAA A 

Seq ID NO: 8 Protein sequence: 
Protein Accession #: NP 006527.1 



PCTYUS02/12476 



45 



50 



55 



60 



MTQRSIAGPI 
IKEMITEASF 
GDDPYTLQYR 
KPFYINGQNQ 
MFMQSLSSW 
TFSLVQAGDK 
RAQLHQINSN 
GDDKLLGNCL 
SRISSGTGDI 
FDPDGRKYYT 
AVPPATVEAF 
AGADVIKNDG 
IQMNAPRKSV 
LTLSWTAPGE 
PQISTNGPEH 
LILKGVLTAM 



11 

I 

CNLKFVTLLV 
YLFNATKRRV 
GCGKEGKYIH 
IKVTRCSSDI 
EFCNASTHNQ 
WCLVLDVSS 
DDRKLLVSYL 
PTVLSSGSTI 
FQQHIQLEST 
NNFITNLTFR 
VERDSLHFPH 
IYSRYFFSFA 
GRNEEERKWG 
DFDQGQATSY 
QPNGETHESH 
GLIGIICLII 



21 
I 

ALSSELPFLG 
FFRNIKILIP 
FTPNFLLNDN 
TGI FVCEKGP 
EAPNLQNQMC 
KMAEADRLLQ 
PTTVSAKTDI 
HSIAXX3SSAA 
GENVKPHHQL 
TASLWIPGTA 
PVMIYANVKQ 
ANGRYSLKVH 
FSRVSSGGSF 
EIRMSKSLQN 
RIYVAIRAMD 
WTHHTLSRK 



31 
I 

AGVQIiQDNGY 
ATWKANNNSK 
IiTAGYGSRGR 
CPQENCIISK 
SLRSAWDVIT 
LQQAAEFYLM 
SICSGLKKGF 
PNLEELSRLT 
KNTVTVDNTV 
KPGHWTYTLN 
GFYPILNATV 
VNHSPSISTP 
SVLGVPAGPH 
IQDDFNNAIL 
RNSLQSAVSN 
KRADKKENGT 



41 



I 

NGLLIAINPQ 
IKQESYEKAN 
VFVHEWAHLR 
LFKEGCTFIY 
DSADFHHSFP 
QIVEIHTFVG 
EWEKLNGKA 
GGLKFFVPDI 
GNDTMFLVTW 
NTHHSLQALK 
TATVEPETGD 
AHSIPGSHAM 
PDVFPPCKII 
VNTSKRNPQQ 
IAQAPLFIPP 
KLL 



51 
I 

VPENQNLISN 
VIVTDWYGAH 
WGVFDEYNND 
NSTQNATASI 
MNGTELPPPP 
IASFDSKGEI 
YGSVMILVTS 
SNSNSMIDAF 
QASGPPEIIL 
VTVTSRASNS 
PVTLRLLDDG 
YVPGYTANGN 
DLEAVKVEEE 
AGIREIFTFS 
NSDPVPARDY 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



65 
70 
75 
80 
85 



Seq ID NO: 9 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 336-632 



1 

I . 

CTCCCCTCAC 
CCTGGGTGGG 
GAGCTGGCAC 
CAGGGTTTGG 
CCAGTGGGGC 
GGGTCTGTCT 
CGCTGGCTGT 
AGCTGAGTAA 
AGAAAGTGGA 
AGCAGGTGGA 
ACTTCTTCCA 
TCTCTTGGGC 
TGTTGATAAT 
CTGGGAGATG 
TCTCCAAGGC 
ATTGGAAATC 
AAATACCA 



11 
I 

CCCGGTCCAG 
CTCAGGGGCT 
TCTCTGGGAG 
TGGGATCAGG 
CCACATATAA 
CTGCCACCTG 
GCTGGTCACT 
GGGGGAAATG 
TGAGGAGGGG 
CTTCCAGGAG 
GGGCTGCCCA 
CCAGGACTGT 
ATTTTAATTG 
AGGGCCTCCT 
CAGAGCTATG 
GAGATAGGTT 



21 
I 

GATGCCCAGT 
GCCCTTGACC 
GGAGGGGGCT 
TTGAGGCAGG 
ATCCTCACCC 
GTCTGCCACA 
ACCTTCCACA 
AAGGAACTTC 
CTGAAGAAGC 
TATGCTGTTT 
GACCGACCCT 
TGATGCCTTT 
CTCAGTGATG 
GGATCCTGCT 
CTTTAGGTCT 
GCTGACTTTT 



31 
I 

CCCCACGACA 
TGGCCTAGAG 
GGGAGGGAAT 
TTTGGTTTCC 
TGGGAGCCTG 
GATCCATGAT 
AGTACTCCTG 
TGCACAAGGA 
TGATGGGCAG 
TCCTGGCACT 
GAAGCAGAAC 
GAGTTTTGTA 
TTCCATAACC 
CCCTTCTGGG 
CAATTTTGGA 
ATTTTGTCAA 



41 
I 

CCTCCCACTT 
CCCTCCCCCA 
GAGTGGGAAT 
TTAAAATGCC 
GCTGCCTTGC 
GTGCAGTTCT 
CCAAGAGGGC 
GCTGCCCAGC 
CCTGGATGAG 
CATCACTGTC 
TCTTGACTTC 
TTCAATAAAC 
CGGCTGGCTC 
CTCTGACTCT 
ATTTCAAACA 
ATAAAGATAT 



51 
I 

CCCACTGTGG 
GCTGGTGGTG 
GGCAAGAGGC 
AAGTTGGGGG 
TCTCCTTCCT 
CTGGAGCAGG 
GACAAGTTCA 
TTTGTGGGGG 
AACAGTGACC 
ATGTGCAATG 
CTGCCATGGA 
TTTTTTTGTC 
AGCTGGAGTG 
CCTGGAAATC 
CCAGCAAAAA 
TAAAAAAGGC 



60 
X20 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



Seq ID NO: 10 Protein sequence: 



191 



WO 02/086443 

Protein Accession 8: NP_005969.1 



PCT/US02/12476 



1 11 21 31 41 51 

I I I 1 I I 

MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKGEMKELLH KELPSFVGEK VDEEGLKKLM 60 

GSLDENSDQQ VDFQEYAVFL ALITVMCNDF FQGCPDRP 



Seq ID NO: 11 DNA sequence 
10 Nucleic Acid Accession fh Eos sequence 
Coding sequence: 336-626 

1 11 21 31 41 51 

1C 1 I I I I I 

1 J CTCCCCTCAC CCCGGTCCAG GATGCCCAGT CCCCAOGACA CCTCCCACTT CCCACTGTGG 60 

CCTGGGTGGG CTCAGGGGCT GCCCTTGACC TGGCCTAGAG CCCTCCCCCA GCTGGTGGTG 120 

GAGCTGGCAC TCTCTGGGAG GGAGGGGGCT GGGAGGGAAT GAGTGGGAAT GGCAAGAGGC 180 

CAGGGTTTGG TGGGATCAGG TTGAGGCAGG TTTGGTTTCC TTAAAATGCC AAGTTGGGGG 240 

CCAGTGGGGC CCACATATAA ATCCTCACCC TGGGAGCCTG GCTGCCTTGC TCTCCTTCCT 300 

20 GGGTCTGTCT CTGCCACCTG GTCTGCCACA GATCCATGAT GTGCAGTTCT CTGGAGCAGG 360 

CGCTGGCTGT GCTGGTCACT ACCTTCCACA AGTACTCCTG CCAAGAGGGC GACAAGTTCA 420 

- AGCTGAGTAA GGGGGAAATG AAGGAACTTC TGCACAAGGA GCTGCCCAGC TTTGTGGGGC 480 

ATTCCAGAGA ACCATGTGCT GTGAGGGCCT TCCGAGTCCA TCTGTTTAAT CCTGTCATTG 540 

GAGACTTGAG AAACCAGAGC CCAGAAGGGA AAAGTGATTG TCCCAAGATC ACACAGCACT 600 

25 GGAGAAAGTG GATGAGGAGG GGCTGAAGAA GCTGATGGGC AGCCTGGATG AGAACAGTGA 660 

CCAGCAGGTG GACTTCCAGG AGTATGCTGT TTTCCTGGCA CTCATCACTG TCATGTGCAA 720 

TGACTTCTTC CAGGGCTGCC CAGACCGACC CTGAAGCAGA ACTCTTGACT TCCTGCCATG 780 

GATCTCTTGG GCCCAGGACT GTTGATGCCT TTGAGTTTTG TATTCAATAA ACTTTTTTTG 840 

TCTGTTGATA ATATTTTAAT TGCTCAGTGA TGTTCCATAA CCCGGCTGGC TCAGCTGGAG 900 

30 TGCTGGGAGA TGAGGGCCTC CTGGATCCTG CTCCCTTCTG GGCTCTGACT CTCCTGGAAA 960 

TCTCTCCAAG GCCAGAGCTA TGCTTTAGGT CTCAATTTTG GAATTTCAAA CACCAGCAAA 1020 

AAATTGGAAA TCGAGATAGG TTGCTGACTT TTATTTTGTC AAATAAAGAT ATTAAAAAAG 1080 
GCAAATACCA 

35 Seq ID NO: 12 Protein sequence: 

Protein Accession #: Eos sequence 

1 11 21 31 41 51 

, n I I I I I I 

40 MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKGEMKELLH KELPS FVGHS REPCAVRAFR 60 
VHLFNPVIGD LRNQSPEGKS DCPKITQHWR KWMRRG 



Seq ID NO: 13 DNA sequence 
45 Nucleic Acid Accession ft: Eos sequence 
Coding sequence: 58-354 

1 11 21 31 41 51 

, n I ! I I I I 

jU gtgagctcac catgtggggg tgaggctgag agaaaacaag TACACAGCCA CAGATCCATG 60 

ATGTGCAGTT CTCTGGAGCA GGCGCTGGCT GTGCTGGTCA CTACCTTCCA CAAGTACTCC 120 

TGCCAAGAGG GCGACAAGTT CAAGCTGAGT AAGGGGGAAA TGAAGGAACT TCTGCACAAG 180 

GAGCTGCCCA GCTTTGTGGG GGAGAAAGTG GATGAGGAGG GGCTGAAGAA GCTGATGGGC 240 

AGCCTGGATG AGAACAGTGA CCAGCAGGTG GACTTCCAGG AGTATGCTGT TTTCCTGGCA 300 

55 CTCATCACTG TCATGTGCAA TGACTTCTTC CAGGGCTGCC CAGACCGACC CTGAAGCAGA 360 

ACTCTTGACT TCCTGCCATG GATCTCTTGG GCCCAGGACT GTTGATGCCT TTGAGTTTTG 420 

TATTCAATAA ACTTTTTTTG TCTGTTGATA ATATTTTAAT TGCTCAGTGA TGTTCCATAA 480 

CCCGGCTGGC TCAGCTGGAG TGCTGGGAGA TGAGGGCCTC CTGGATCCTG CTCCCTTCTG 540 

GGCTCTGACT CTCCTGGAAA TCTCTCCAAG GCCAGAGCTA TGCTTTAGGT CTCAATTTTG 600 

60 GAATTTCAAA CACCAGCAAA AAATTGGAAA TCGAGATAGG TTGCTGACTT TTATTTTGTC 660 
AAATAAAGAT ATTAAAAAAG GCAAATACCA 



Seq ID NO: 14 Protein sequence: 
Protein Accession ft: NP_005969.1 
65 1 11 21 31 41 51 



70 



MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKGEMKELLH KELPSFVGEK VDEEGLKKLM 60 
GSLDENSDQQ VDFQEYAVFL ALITVMCNDF FQGCPDRP 

Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 62-358 

75 i 11 21 31 41 51 

I I I I I I 

GGAGGGTGTG CCGCTGAGTC ACTGCCTGGG CATCTGGGCC TGGAACCTCG GCCACAGATC 60 

CATGATGTGC AGTTCTCTGG AGCAGGCGCT GGCTGTGCTG GTCACTACCT TCCACAAGTA 120 

_ n CTCCTGCCAA GAGGGCGACA AGTTCAAGCT GAGTAAGGGG GAAATGAAGG AACTTCTGCA 180 

(SO CAAGGAGCTG CCCAGCTTTG TGGGGGAGAA AGTGGATGAG GAGGGGCTGA AGAAGCTGAT 240 

GGGCAGCCTG GATGAGAACA GTGACCAGCA GGTGGACTTC CAGGAGTATG CTGTTTTCCT 300 

GGCACTCATC ACTGTCATGT GCAATGACTT CTTCCAGGGC TGCCCAGACC GACCCTGAAG 360 

CAGAACTCTT GACTTCCTGC CATGGATCTC TTGGGCCCAG GACTGTTGAT GCCTTTGAGT 420 

oc TTTGTATTCA ATAAACTTTT TTTGTCTGTT GATAATATTT TAATTGCTCA GTGATGTTCC 480 

o5 ATAACCCGGC TGGCTCAGCT GGAGTGCTGG GAGATGAGGG CCTCCTGGAT CCTGCTCCCT 540 

TCTGGGCTCT GACTCTCCTG GAAATCTCTC CAAGGCCAGA GCTATGCTTT AGGTCTCAAT 600 

TTTGGAATTT CAAACACCAG CAAAAAATTG GAAATCGAGA TAGGTTGCTG ACTTTTATTT 660 



192 



WO 02/086443 

TGTCAAATAA AGATATTAAA AAAGGCAAAT ACCA 



Seq ID NO: 16 Protein sequence: 
Protein Accession ft: NP_005969.1 

1 11 21 31 41 51 

I I I i I I 

MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKGEMKELLH KELPSFVGEK VDEEGLKKLM 60 
GSLDENSDQQ VDFQEYAVFL ALITVMCNDP FQGCPDRP 



Seq ID NO: 17 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 939-2372 

1 11 21 31 41 51 

I I l I I I 

AAGACGGATT CTCAGACAAG GCTTGCAAAT GCCCCGCAGC CATCATTTAA CTGCACCCGC 60 

AGAATAGTTA CGGTTTGTCA CCCGACCCTC CCGGATCGCC TAATTTGTCC CTAGTGAGAC 120 

CCCGAGGCTC TGCCCGCGCC TGGCTTCTTC GTAGCTGGAT GCATATCGTG CTCCGGGCAG 180 

CGCGGGCGCA GGGCACGCGT TCGCGCACAC CCTAGCACAC ATGAACACGC GCAAGAGCTG 240 

AACCAAGCAC GGTTTCCATT TCAAAAAGGG AGACAGCCTC TACCGCGATT GTAGAAGAGA 300 

CTGTGGTGTG AATTAGGGAC CGGGAGGCGT CGAACGGAGG AACGGTTCAT CTTAGAGACT 360 

AATTTTCTGG AGTTTCTGCC CCTGCTCTGC GTCAGCCCTC ACGTCACTTC GCCAGCAGTA 420 

GCAGAGGCGG CGGCGGCGGC TCCCGGAATT GGGTTGGAGC AGGAGCCTCG CTGGCTGCTT 480 

CGCTCGCGCT CTACGCGCTC AGTCCCCGGC GGTAGCAGGA GCCTGGACCC AGGCGCCGCC 540 

GGCGGGCGTG AGGCGCCGGA GCCCGGCCTC GAGGTGCATA CCGGACCCCC ATTCGCATCT 600 

AACAAGGAAT CTGCGCCCCA GAGAGTCCCG GGAGCGCCGC CGGTCGGTGC CCGGCGCGCC 660 

GGGCCATGCA GCGACGGCCG CCGCGGAGCT CCGAGCAGCG GTAGCGCCCC CCTGTAAAGC 720 

GGTTCGCTAT GCCGGGGCCA CTGTGAACCC TGCCGCCTGC CGGAACACTC TTCGCTCCGG 780 

ACCAGCTCAG CCTCTGATAA GCTGGACTCG GCACGCCCGC AACAAGCACC GAGGAGTTAA 840 

GAGAGCCGCA AGCGCAGGGA AGGCCTCCCC GCACGGGTGG GGGAAAGCGG CCGGTGCAGC 900 

GCGGGGACAG GCACTCGGGC TGGCACTGGC TGCTAGGGAT GTCGTCCTGG ATAAGGTGGC 960 

ATGGACCCGC CATGGCGCGG CTCTGGGGCT TCTGCTGGCT GGTTGTGGGC TTCTGGAGGG 1020 

CCGCTTTCGC CTGTCCCACG TCCTGCAAAT GCAGTGCCTC TCGGATCTGG TGCAGCGACC 1080 

CTTCTCCTGG CATCGTGGCA TTTCCGAGAT TGGAGCCTAA CAGTGTAGAT CCTGAGAACA 1140 

TCACCGAAAT TTTCATCGCA AACCAGAAAA GGTTAGAAAT CATCAACGAA GATGATGTTG 1200 

AAGCTTATGT GGGACTGAGA AATCTGACAA TTGTGGATTC TGGATTAAAA TTTGTGGCTC 1260 

ATAAAGCATT TCTGAAAAAC AGCAACCTGC AGCACATCAA TTTTACCCGA AACAAACTGA 1320 

CGAGTTTGTC TAGGAAACAT TTCCGTCACC TTGACTTGTC TGAACTGATC CTGGTGGGCA 1380 

ATCCATTTAC ATGCTCCTGT GACATTATGT GGATCAAGAC TCTCCAAGAG GCTAAATCCA 1440 

GTCCAGACAC TCAGGATTTG TACTGCCTGA ATGAAAGCAG CAAGAATATT CCCCTGGCAA 1500 

ACCTGCAGAT ACCCAATTGT GGTTTGCCAT CTGCAAATCT GGCCGCACCT AACCTCACTG 1560 

TGGAGGAAGG AAAGTCTATC ACATTATCCT GTAGTGTGGC AGGTGATCCG GTTCCTAATA 1620 

TGTATTGGGA TGTTGGTAAC CTGGTTTCCA AACATATGAA TGAAACAAGC CACACACAGG 1680 

GCTCCTTAAG GATAACTAAC ATTTCATCCG ATGACAGTGG GAAGCAGATC TCTTGTGTGG 1740 

CGGAAAATCT TGTAGGAGAA GATCAAGATT CTGTCAACCT CACTGTGCAT TTTGCACCAA 1800 

CTATCACATT TCTCGAATCT CCAACCTCAG ACCACCACTG GTGCATTCCA TTCACTGTGA 1860 

AAGGCAACCC CAAACCAGCG CTTCAGTGGT TCTATAACGG GGCAATATTG AATGAGTCCA 1920 

AATACATCTG TACTAAAATA CATGTTACCA ATCACACGGA GTACCACGGC TGCCTCCAGC 1980 

TGGATAATCC CACTCACATG AACAATGGGG ACTACACTCT AATAGCCAAG AATGAGTATG 2040 

GGAAGGATGA GAAACAGATT TCTGCTCACT TCATGGGCTG GCCTGGAATT GACGATGGTG 2100 

CAAACCCAAA TTATCCTGAT GTAATTTATG AAGATTATGG AACTGCAGCG AATGACATCG 2160 

GGGACACCAC GAACAGAAGT AATGAAATCC CTTCCACAGA CGTCACTGAT AAAACCGGTC 2220 

GGGAACATCT CTCGGTCTAT GCTGTGGTGG TGATTGCGTC TGTGGTGGGA TTTTGCCTTT 2280 

TGGTAATGCT GTTTCTGCTT AAGTTGGCAA GACACTCCAA GTTTGGCATG AAAGGTTTTG 2340 

TTTTGTTTCA TAAGATCCCA CTGGATGGGT AGCTGAAATA AAGGAAAAGA CAGAGAAAGG 2400 

GGCTGTGGTG CTTGTTGGTT GATGCTGCCA TGTAAGCTGG ACTCCTGGGA CTGCTGTTGG 2460 

CTTATCCCGG GAAGTGCTGC TTATCTGGGG TTTTCTGGTA GATGTGGGCG GTGTTTGGAG 2520 

GCTGTACTAT ATGAAGCCTG CATATACTGT GAGCTGTGAT TGGGGAACAC CAATGCAGAG 2580 

GTAACTCTCA GGCAGCTAAG CAGCACCTCA AGAAAACATG TTAAATTAAT GCTTCTCTTC 2640 

TTACAGTAGT TCAAATACAA AACTGAAATG AAATCCCATT GGATTGTACT TCTCTTCTGA 2700 

AAAGTGTGCT TTTTGACCCT ACTGGACATT TATTGACTTA ATTGCTTCTG TTTATTAAAA 2760 

TTGACCTGCA AAGTTAAAAA AAAATTAAAG TTGAGAACAG GTATAAGTGC ACACTGAATA 2820 

GTCTAATCTA CATGTAACAC ATATTTTAGT GTGATTTTCT ATACTCTAAT CAGCACTGAA 2880 

TTCAGAGGGT TTGACTTTTT CATCTATAAC ACAGTGACTA AAAGAGTTAA GGGTATATAT 2940 

ACCATCACTT TGGGACTTGG TAGTATTATT AAAAGGTTAT TTCCTTCACT GTCAATAAAA 3000 

GTCCAAATGT TTAGCTTAGG TCTGAGAGTC AAACAATGTT AAGGATTGTC TTAAAGTTCC 3060 

TTAGCCAGCA AAACAAAACA AAACAAAACA AACAAATGAA AAACGTTTAA AAAGAAGAAG 3120 

AAGAAAAAAA ACAAGAACAA GCAGCAACAG CTGTTTTGTT GGGGCTATAG ATTTAAGTTA 31 B0 

GGCATAGTCA ATTTCAGAAT AACTAAGAGT GGAATATATG CATATGGTGA AATTATAACC 3240 

TTGCCCTTTT T1ATTTGCCC TCTGCGATCC ACCTGCTTTT TAGAAGTCTG CCGAGTGAGA 3300 

AGGCCACAGT ATCTCATGCT GTTTGCATTA CAGAACTGCA GCTTTTCTAC TCTGAAAAGG 3360 

CCTGGGAGCA GAATGGCTGG CCTGCTGTGA GCAGGAGAGG AGATTCTAAG AAGGATAGTC 3420 

CCCCCTACAA CATACTGTCA TACTGCTGGG TTTTCATGGG TAGGAAAGCT TGTCCTGACC 3480 

CCAGCAGCAA AGAGGTGGCA GGTCGCTAAT GAATATATGC TTTATAATGT CCTTCTTCAT 3540 

TGCTGAGAGG GCAGCCTTAG AGCTGTGGAT TTCTGCATCC CCCCTGAGTC TGACCCATGG 3600 

ACACCTGTTT CATTCACTTT AGCATCACAG TGACCTTTGT ATGCTCTGTT CAGTCTGTGT 3660 

CAGGCAGTAT GCTTGTCCTG AAGAGAGGTT TGGCTATCCC CACCCCACCC CACCCCACCC 3720 

TGTTCCTTTT TTATCAGGAG GACTTCAGAG CCAGGCCTGC AGCATTTTGT TTGAAAACAC 3780 

AATCAGCTCT GACAGTTAGA CATGCACACA GACGCCATAG CTGGATTGGA AACATTGATG 3840 

TTTTAAAAAT TTATTTTTTT TGGAAATAGT TGCACAAATG CTGCAATTTA GCTTTAAGGT 3900 

TCTATAGATT TTTAACTAGT CCAACACAGT CAGAAACATT GTTTTGAATC CTCTGTAAAC 3960 

CAAGGCATTA ATCTTAATAA ACCAGGATCC ATTTAGGTAC CACTTGATAT AAAAAGGATA 4020 

TCCATAATGA ATATTTTATA CTGCATCCTT TACATTAGCC ACTAAATACG TTATTGCTTG 4080 

ATGAAGACCT TTCACAGAAT CCTATGGATT GCAGCATTTC ACTTGGCTAC TTCATACCCA 4140 
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TGCCTTAAAG AGGGGCAGTT TCTCAAAAGC AGAAACATGC CGCCAGTTCT CAAGTTTTCC 4200 

TCCTAACTCC ATTTGAATGT AAGGGCAGCT GGCCCCCAAT GTGGGGAGGT CCGAACATTT 4260 

TCTGAATTCC CATTTTCTTG TTCGCGGCTA AATGACAGTT TCTGTCATTA CTTAGATTCC 4320 

GATCTTTCCC AAAGGTGTTG ATTTACAAAG AGGCCAGCTA ATAGCAGAAA TCATGACCCT 4380 

GAAAGAGAGA TGAAATTCAA GCTGTGAGCC AGGCAGGAGC TCAGTATGGC AAAGGTTCTT 4440 

GAGAATCAGC CATTTGGTAC AAAAAAGATT TTTAAAGCTT TTATGTTATA CCATGGAGCC 4500 

ATAGAAAGGC TATGGATTGT TTAAGAACTA TTTTAAAGTG TTCCAGACCC AAAAAGGAAA 4560 

AATAAAAAAA AAGGAATATT TGTACCCAAC AGCTAGAAGG ATTGCAAGGT AGATTTTTGT 4620 

TTTAAAATGG AGAGAAGTGG ACAGATAAGG CCATTTAATA TATCAAAGAT CAGTTGACAT 4680 
CTCCTAGGGA ATGATGAAAA CAGCAGGCTA T 

Seq ID NO: 18 Protein sequence: 
Protein Accession 8: CAA5357X 

1 11 21 31 41 51 

I I I I I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE IINEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI . 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 
DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGFVLF HKIPLDG 



Seq ID NO: 19 DNA sequence 

Nucleic Acid Accession #: NM_000228 

Coding sequence: 82-3600 

1 11 21 31 41 51 

I I I I I I 

GCTTTCAGGC GATCTGGAGA AAGAACGGCA GAACACACAG CAAGGAAAGG TCCTTTCTGG '60 

GGATCACCCC ATTGGCTGAA GATGAGACCA TTCTTCCTCT TGTGTTTTGC CCTGCCTGGC 120 

CTCCTGCATG CCCAACAAGC CTGCTCCCGT GGGGCCTGCT ATCCACCTGT TGGGGACCTG 180 

CTTGTTGGGA GGACCCGGTT TCTCCGAGCT TCATCTACCT GTGGACTGAC CAAGCCTGAG 240 

ACCTACTGCA CCCAGTATGG CGAGTGGCAG ATGAAATGCT GCAAGTGTGA CTCCAGGCAG 300 

CCTCACAACT ACTACAGTCA CCGAGTAGAG AATGTGGCTT CATCCTCCGG CCCCATGCGC 360 

TGGTGGCAGT CCCAGAATGA TGTGAACCCT GTCTCTCTGC AGCTGGACCT GGACAGGAGA 420 

TTCCAGCTTC AAGAAGTCAT GATGGAGTTC CAGGGGCCCA TGCCCGCCGG CATGCTGATT 480 

GAGCGCTCCT CAGACTTCGG TAAGACCTGG CGAGTGTACC AGTACCTGGC TGCCGACTGC 540 

ACCTCCACCT TCCCTCGGGT CCGCCAGGGT CGGCCTCAGA GCTGGCAGGA TGTTCGGTGC 600 

CAGTCCCTGC CTCAGAGGCC TAATGCACGC CTAAATGGGG GGAAGGTCCA ACTTAACCTT 660 

ATGGATTTAG TGTCTGGGAT TCCAGCAACT CAAAGTCAAA AAATTCAAGA GGTGGGGGAG 720 

ATCACAAACT TGAGAGTCAA TTTCACCAGG CTGGCCCCTG TGCCCCAAAG GGGCTACCAC 780 

CCTCCCAGCG CCTACTATGC TGTGTCCCAG CTCCGTCTGC AGGGGAGCTG CTTCTGTCAC 840 

GGCCATGCTG ATCGCTGCGC ACCCAAGCCT GGGGCCTCTG CAGGCCCCTC CACCGCTGTG 900 

CAGGTCCACG ATGTCTGTGT CTGCCAGCAC AACACTGCCG GCCCAAATTG TGAGCGCTGT 960 

GCACCCTTCT ACAACAACCG GCCCTGGAGA CCGGCGGAGG GCCAGGACGC CCATGAATGC 1020 

CAAAGGTGCG ACTGCAATGG GCACTCAGAG ACATGTCACT TTGACCCCGC TGTGTTTGCC 1080 

GCCAGCCAGG GGGCATATGG AGGTGTGTGT GACAATTGCC GGGACCACAC CGAAGGCAAG 1140 

AACTGTGAGC GGTGTCAGCT GCACTATTTC CGGAACCGGC GCCCGGGAGC TTCCATTCAG 1200 

GAGACCTGCA TCTCCTGCGA GTGTGATCCG GATGGGGCAG TGCCAGGGGC TCCCTGTGAC 1260 

CCAGTGACCG GGCAGTGTGT GTGCAAGGAG CATGTGCAGG GAGAGCGCTG TGACCTATGC 1320 

AAGCCGGGCT TCACTGGACT CACCTACGCC AACCCGCAGG GCTGCCACCG CTGTGACTGC 1380 

AACATCCTGG GGTCCCGGAG GGACATGCCG TGTGACGAGG AGAGTGGGCG CTGCCTTTGT 1440 

CTGCCCAACG TGGTGGGTCC CAAATGTGAC CAGTGTGCTC CCTACCACTG GAAGCTGGCC 1500 

AGTGGCCAGG GCTGTGAACC GTGTGCCTGC GACCCGCACA ACTCCCCTCA GCCCACAGTG 1560 

CAACCAGTTC ACAGGGCAGT GCCCTGTCGG GAAGGCTTTG GTGGCCTGAT GTGCAGCGCT 1620 

GCAGCCATCC GCCAGTGTCC AGACCGGACC TATGGAGACG TGGCCACAGG ATGCCGAGCC 1680 

TGTGACTGTG ATTTCCGGGG AACAGAGGGC CCGGGCTGCG ACAAGGCATC AGGCCGCTGC 1740 

CTCTGCCGCC CTGGCTTGAC CGGGCCCCGC TGTGACCAGT GCCAGCGAGG CTACTGCAAT 1800 

CGCTACCCGG TGTGCGTGGC CTGCCACCCT TGCTTCCAGA CCTATGATGC GGACCTCCGG 1860 

GAGCAGGCCC TGCGCTTTGG TAGACTCCGC AATGCCACCG CCAGCCTGTG GTCAGGGCCT 1920 

GGGCTGGAGG ACCGTGGCCT GGCCTCCCGG ATCCTAGATG CAAAGAGTAA GATTGAGCAG 1980 

ATCCGAGCAG TTCTCAGCAG CCCCGCAGTC ACAGAGCAGG AGGTGGCTCA GGTGGCCAGT 2040 

GCCATCCTCT CCCTCAGGCG AACTCTCCAG GGCCTGCAGC TGGATCTGCC CCTGGAGGAG 2100 

GAGACGTTGT CCCTTCCGAG AGACCTGGAG AGTCTTGACA GAAGCTTCAA TGGTCTCCTT 2160 

ACTATGTATC AGAGGAAGAG GGAGCAGTTT GAAAAAATAA GCAGTGCTGA TCCTTCAGGA 2220 

GCCTTCCGGA TGCTGAGCAC AGCCTACGAG CAGTCAGCCC AGGCTGCTCA GCAGGTCTCC 2280 

GACAGCTCGC GCCTTTTGGA CCAGCTCAGG GACAGCCGGA GAGAGGCAGA GAGGCTGGTG 2340 

CGGCAGGCGG GAGGAGGAGG AGGCACCGGC AGCCCCAAGC TTGTGGCCCT GAGGCTGGAG 2400 

ATGTCTTCGT TGCCTGACCT GACACCCACC TTCAACAAGC TCTGTGGCAA CTCCAGGCAG 2460 

ATGGCTTGCA CCCCAATATC ATGCCCTGGT GAGCTATGTC CCCAAGACAA TGGCACAGCC 2520 

TGTGGCTCCC GCTGCAGGGG TGTCCTTCCC AGGGCCGGTG GGGCCTTCTT GATGGCGGGG 2580 

CAGGTGGCTG AGCAGCTGCG GGGCTTCAAT GCCCAGCTCC AGCGGACCAG GCAGATGATT 2640 

AGGGCAGCCG AGGAATCTGC CTCACAGATT CAATCCAGTG CCCAGCGCTT GGAGACCCAG 2700 

GTGAGCGCCA GCCGCTCCCA GATGGAGGAA GATGTCAGAC GCACACGGCT CCTAATCCAG 2760 

CAGGTCCGGG ACTTCCTAAC AGACCCCGAC ACTGATGCAG CCACTATCCA GGAGGTCAGC 2820 

■ GAGGCCGTGC TGGCCCTGTG GCTGCCCACA GACTCAGCTA CTGTTCTGCA GAAGATGAAT 2880 

GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT TGGTGCTGTC CCAGACCAAG 2940 

CAGGACATTG CGCGTGCCCG CCGGTTGCAG GCTGAGGCTG AGGAAGCCAG GAGCCGAGCC 3000 

CATGCAGTGG AGGGCCAGGT GGAAGATGTG GTTGGGAACC TGCGGCAGGG GACAGTGGCA 3060 

CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT CCCTTCGGCT TATCCAGGAC 3120 

AGGGTTGCTG AGGTTCAGCA GGTACTGCGG CCAGCAGAAA AGCTGGTGAC AAGCATGACC 3180 

AAGCAGCTGG GTGACTTCTG GACACGGATG GAGGAGCTCC GCCACCAAGC CCGGCAGCAG 3240 

GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GCGGAAGGTG CCAGCGAGCA GGCATTGAGT 3300 

GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGTATGCTG AGTTGAAGGA CCGGTTGGGT 3360 
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CAGAGTTCCA TGCTGGGTGA GCAGGGTGCC 
GAGCTGTTTG GGGAGACCAT GGAGATGATG 
CTGCGGGGCA GCCAGGCCAT CATGCTGCGC 
GTGGAGCAGA TCCGTGACCA CATCAATGGG 
TGCTACAGCT TCCAGCCCGT TGCCCCACTC 
GATTGGGTTG GAATGCTTTC CATCTCCAGG 
GACCACCCCT GGTGTGTAGC TAGTAAGATT 
GGGACAGTTA CACTTGACAG ACAAAGATGG 
CTCTCAAGTC AAGGAAGCTG GGCTGGGCAG 
GGAATCCTGG ACCAAGCACA AAAACTTAAC 
AAAATCTTTG G 

Seq ID NO: 20 Protein sequence: 
Protein Accession 8: NP 000219 



PCT/US02/12476 



CGGATCCAGA 
GACAGGATGA 
TCGGCGGACC 
CGCGTGCTCT 
ATCTGCCGCC 
AGACTTTCAT 
ACCCTGAGCT 
TGGAGATTGG 
TATCCCCCGC 
AAAAGTGATG 



GTGTGAAGAC 
AAGACATGGA 
TGACAGGACT 
ACTATGCCAC 
TTTGCTTTTG 
GCAGCCTAAA 
GCAGCTGAGC 
CATGCCATTG 
CTTTAGTTCT 
TAAAAATGAA 



AGAGGCAGAG 
GTTGGAGCTG 
GGAGAAGCGT 
CTGCAAGTGA 
GTTGGGGGCA 
GTACAGCCTG 
CTGAGCCAAT 
AAACTAAGAG 
CCACTGGGGA 
AAGCCAAATA 



1 
I 

MRPFFLLCFA 
EWQMKCCKCD 
MEFQGPMPAG 
NARLNGGKVQ 
VSQLRLQGSC 
PWRPAEGQDA 
HYFRNRRPGA 
TYANPQGCHR 
CACDPHNSPQ 
TEGPGCDKAS 
RLRNATASLW 
TLQGLQLDLP 
AYEQSAQAAQ 
TPTFNKLCGN 
GFNAQLQRTR 
DPDTDAATIQ 
RLQAEAEEAR 
VLRPAEKLVT 
IKQKYAELKD 
MLRSADLTGL 



11 
I 

LPGLLHAQQA 
SRQPHNYYSH 
MLIERSSDFG 
LNLMDLVSGI 
FCHGHADRCA 
HECQRCDCNG 
SIQETCISCE 
CDCNILGSRR 
PTVQPVHRAV 
GRCLCRPGLT 
SGPGIiEDRGL 
IiEEETLSLPR 
QVSDSSRLLD 
SRQMACTPIS 
QMIRAAEESA 
EVSEAVLALW 
SRAHAVEGQV 
SMTKQLGDFW 
RLGQSSMLGE 
EKRVEQIRDH 



21 

I 

CSRGACYPPV 
RVENVASSSG 
KTWRVYQYLA 
PATQSQKIQE 
PKPGASAGPS 
HSETCHFDPA 
CDPDGAVPGA 
DMPCDEESGR 
PCREGFGGLM 
GPRCDQCQRG 
ASRILDAKSK 
DLESLDRSFN 
QLRDSRREAE 
CPGELCPQDN 
SQIQSSAQRL 
LPTDSATVLQ 
EDWGNLRQG 
TRMEELRHQA 
QGARIQSVKT 
INGRVLYYAT 



31 
I 

GDLLVGRTRF 
PMRWWQSQND 
ADCTSTFPRV 
VGEITNLRVN 
TAVQVHDVCV 
VFAASQGAYG 
PCDPVTGQCV 
CLCLPNWGP 
CSAAAIRQCP 
YCNRYPVCVA 
IEQIRAVLSS 
GLLTMYQRKR 
RLVRQAGGGG 
GTACGSRCRG 
ETQVSASRSQ 
KMNEIQAIAA 
TVALQEAQDT 
RQQGAEAVQA 
EAEELFGETM 
CK 



41 
I 

LRASSTCGLT 
VNPVSLQLDL 
RQGRPQSWQD 
FTRLAPVPQR 
CQHNTAGPNC 
GVCDNCRDHT 
CKEHVQGERC 
KCDQCAPYHW 
DRTYGDVATG 
CHPCFQTYDA 
PAVTEQEVAQ 
EQFEKISSAD 
GTGSPKLVAL 
VLPRAGGAFL 
MEEDVRRTRL 
RLPNVDLVLS 
MQGTSRSLRL 
QQLAEGASEQ 
EMMDRMKDME 



51 
I 

KPETYCTQYG 
DRRFQLQEVM 
VROQSLPQRP 
GYHPPSAYYA 
ERCAPFYNNR 
EGKNCERCQL 
DLCKPGFTGL 
KLASGQGCEP 
CRACDCDFRG 
DLREQALRFG 
VASAILSLRR 
PSGAFRMLST 
RLEMSSLPDL 
MAGQVAEQLR 
LIQQVRDFLT 
QTKQDIARAR 
IQDRVAEVQQ 
ALSAQEGFER 
LELLRGSQAI 



Seq ID NO: 21 DNA sequence 
Nucleic Acid Accession #: NMj)03722 
Coding sequence: 145-1491 



TCGTTGATAT 
ACAGTACTGC 
AAAGAAAGTT 
CCAGAGGTTT 
ATTGACTTGA 
AGCATGGACT 
ACGAACCTGG 
AGTCCCTATA 
CCCAGCTCCA 
CCAGGCCCGC 
TGGACGTATT 
CAGATCAAGG 
. AAAAAAGCTG 
GAATTCAACG 
CATGCCCAGT 
CCACCCCAGG 
TGTGTTGGAG 
GGGCAAGTCC 
AGGAAGGCGG 
GATGGTACGA 
AAACGAAGAT 
GAAATGCTGT 
ATTGAAACGT 
CTTTCAGCCT 
GACGTCTTCT 
TCTATATTTT 
TGTGTGTGCG 
CCCAACTGCT 
TTACAAGAAA 
GAACCACTGT 
GAAAGGGGCA 
AATTCACAGG 
AAAAAAGTTG 
CCCTTTTAAT 
TACTGCTGGG 
TTTGTGAGAA 
GCTGTGTACC 
CATGAAACCC 
CTCATTTTGT 
TGTTTACCAT 
AATTTGCTTA 
CTGATACTGT 
AGACGTGTTA 



11 
I 

CAAAGACAGT 
CCTGACCCTT 
ATTACCGATC 
TCCAGCATAT 
ACTTTGTGGA 
GTATCCGCAT 
GGCTCCTGAA 
ACACAGACCA 
CCTTCGATGC 
ACAGTTTCGA 
CCACTGAACT 
TGATGACCCC 
AGCACGTCAC 
AGGGACAGAT 
ATGTAGAAGA 
TTGGCACTGA 
GGATGAACCG 
TGGGCCGACG 
ATGAAGATAG 
AGCGCCCGTT 
CCCCAGATGA 
TGAAGATCAA 
ACAGGCAACA 
GCTTCAGGAA 
TTAGACATTC 
AAGTGTGTGT 
TGTGTATCTA 
CAAAGGCACA 
GGATGTTTTC 
GTTTGTCTGT 
TTAAGATGTT 
GAAGCTTTTG 
TTATTGTCTG 
GCTGGTCATG 
CAGCGAGGTG 
CTTGCATTAT 
TGCCTCTGCC 
TGGAAGACCT 
GCTTTTAATA 
TATTCAAAGC 
ATTAGAGCTT 
TCAGTGCATT 
AAATCAGCAC 



21 
I 

TGAAGGAAAT 
ACATCCAGCG 
CACCATGTCC 
CTGGGATTTT 
TGAACCATCA 
GCAGGACTCG 
CAGCATGGAC 
CGCGCAGAAC 
TCTCTCTCCA 
CGTGTCCTTC 
GAAGAAACTC 
ACCTCCTCAG 
GGAGGTGGTG 
TGCCCCTCCT 
TCCCATCACA 
ATTCACGACA 
CCGTCCAATT 
CTGCTTTGAG 
CATCAGAAAG 
TCGTCAGAAC 
TGAACTGTTA 
AGAGTCCCTG 
GCAACAGCAG 
TGAGCTTGTG 
CAAGCCCCCA 
GTTGTATTTC 
GCCCTCATAA 
AAGCCACTAG 
TGCAGATTTT 
GAGCTTTCTG 
TATTGGAACC 
AGCAGGTCTC 
TGCATAAGTA 
TAATAATATT 
ATCATTACCA 
TTGTGTCCTC 
ACTGTATGTT 
ACTACAAAAA 
GAAAGACAAA 
TCAAAATAGA 
CTATCCCTCA 
TAGCCAGGAG 
TCCTGGACTG 



31 
I 

GAATTTTGAA 
TTTCGTAGAA 
CAGAGCACAC 
CTGGAACAGC 
GAAGATGGTG 
GACCTGAGTG 
CAGCAGATTC 
AGCGTCACGG 
TCACCCGCCA 
CAGCAGTCGA 
TACTGCCAAA 
GGAGCTGTTA 
AAGCGGTGCC 
AGTCATTTGA 
GGAAGACAGA 
GTCTTGTACA 
TTAATCATTG 
GCCCGGATCT 
CAGCAAGTTT 
ACACATGGTA 
TACTTACCAG 
GAACTCATGC 
CAGCACCAGC 
GAGCCCCGGA 
AACCGATCAG 
CATGTGTATA 
ACAGGACTTG 
TGAGAGAATC 
GTATCCTTAG 
TTGTTTCCTG 
CTTTTCTGTC 
AAACTTAAGA 
AGTTGTAGGT 
G CAAGTAGTA 
AAAGTAATCA 
CCCTCATGTG 
GGCATCTGTT 
AACTGTTGTT 
TCCACCCCAG 
ATTTGAAGCC 
AGCCTACCTA 
ACTTACGTTT 
GAAATTAAAG 



41 
I 

ACTTCACGGT 
ACCCAGCTCA 
AGACAAATGA 
CTATATGTTC 
CGACAAACAA 
ACCCCATGTG 
AGAACGGCTC 
CGCCCTCGCC 
TCCCCTCCAA 
GCACCGCCAA 
TTGCAAAGAC 
TCCGCGCCAT 
CCAACCATGA 
TTCGAGTAGA 
GTGTGCTGGT 
ATTTCATGTG 
TTACTCTGGA 
GTGCTTGCCC 
CGGACAGTAC 
TCCAGATGAC 
TGAGGGGCCG 
AGTACCTTCC 
ACTTACTTCA 
GAGAAACTCC 
TGTACCCATA 
TGTGAGTGTG 
AAGACACTTT 
TTTTGAAGGG 
ACCGGCCATT 
GGAGGGAGGG 
TTCTTCTGTT 
TGTCTTTTTA 
GACTGAGAGA 
AGAAACGAAG 
ACTTTGTGGG 
TAGGTAGAAC 
ATGCTAAAGT 
TGGCCCCCAT 
TAATATTGCC 
CTCTCACAAA 
CCATAAAACC 
TGAGTAAGTG 
ATTGAAAGGG 



51 
I 

GTGCCACCCT 
TTTCTCTTGG 
ATTCCTCAGT 
AGTTCAGCCC 
GATTGAGATT 
GCCACAGTAC 
CTCGTCCACC 
CTACGCACAG 
CACCGACTAC 
GTCGGCCACC 
ATGCCCCATC 
GCCTGTCTAC 
GCTGAGCCGT 
GGGGAACAGC 
ACCTTATGAG 
TAACAGCAGT 
AACCAGAGAT 
AGGAAGAGAC 
AAAGAACGGT 
ATCCATCAAG 
TGAGACTTAT 
TCAGCACACA 
GAAACATCTC 
AAAACAATCT 
GAGCCCTATC 
TGTGTGTGTA 
GGCTCAGAGA 
ACTCAAACCT 
GGTGGGTGAG 
GTCAGGTGGG 
GTTTTTCTAA 
AGAAAAGGAG 
CTCAGTCAGA 
GTGTCAAGTG 
TGGAGAGTTC 
ATTTCTTAAT 
TTTTCTTGTA 
AGCAGGTGAA 
CTTACGTAGT 
ATCTGTGATT 
AGCCATATTA 
AGATCCAAGC 
TAGACTACTT 



3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
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TTCTTTTTTT TACTCAAAAG TTTAGAGAAT CTCTGTTTCT TTCCATTTTA AAAACATATT 2640 
TTAAGATAAT AGCATAAAGA CTTTAAAAAT GTTCCTCCCC TCCATCTTCC CACACCCAGT 2700 
CACCAGCACT GTATTTTCTG TCACCAAGAC AATGATTTCT TGTTATTGAG GCTGTTGCTT 2760 
TTGTGGATGT GTGATTTTAA TTTTCAATAA ACTTTTGCAT CTTGGTTTAA AAGAAA 

Seq ID NO: 22 Protein sequence: 
Protein Accession #: NP_003713 

1 11 21 31 41 51 

I I I I I I 

MSQSTQTNEP LSPEVFQHIW DPLEQPICSV QPIDLNFVDE PSEDGATNKI EISMDCIRMQ 60 
DSDLSDPMWP QYTNLGLLNS MDQQIQNGSS STSPYNTDHA QNSVTAPSPY AQPSSTFDAL 120 
SPSPAIPSNT DYPGPHSFDV SFQQSSTAKS ATWTYSTELK KLYCQIAKTC PIQIKVMTPP 180 
PQGAVIRAMP VYKKAEHVTE WKRCPNHEL SREFNEGQIA PPSHLIRVEG NSHAQYVEDP 240 
ITGRQSVLVP YEPPQVGTEF TTVLYNFMCN SSCVGGMNRR PILIIVTLET RDGQVLGRRC 300 
FEARICACPG RDRKADEDSI RKQQVSDSTK NGDGTKRPFR QNTHGIQMTS IKKRRSPDDE 360 
LLYLPVRGRE TYEMLLKIKE SLELMQYLPQ HTIETYRQQQ QQQHQHLLQK HLLSACFRNE 420 
LVEPRRETPK QSDVFFRHSK PPNRSVYP 



Seq ID NO: 23 DNA sequence 

Nucleic Acid Accession ft: NM_001944.1 

Coding sequence: 84-3083 

1 11 21 31 41 51 

I I I I I I 

TTTTCTTAGA CATTAACTGC AGACGGCTGG CAGGATAGAA GCAGCGGCTC ACTTGGACTT 60 

TTTCACCAGG GAAATCAGAG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG 120 

CCATCTTCGT GGTGGTCATA TTGGTTCATG GAGAATTGCG AATAGAGACT AAAGGTCAAT 180 

ATGATGAAGA AGAGATGACT ATGCAACAAG CTAAAAGAAG GCAAAAACGT GAATGGGTGA 240 

AATTTGCCAA ACCCTGCAGA GAAGGAGAAG ATAACTCAAA AAGAAACCCA ATTGCCAAGA 300 

TTACTTCAGA TTACCAAGCA ACCCAGAAAA TCACCTACCG AATCTCTGGA GTGGGAATCG . 360 

ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 420 

CTATAGTCGA CCGGGAGGAA ACTCCAAGCT TCCTGATCAC ATGTCGGGCT CTAAATGCCC 480 

AAGGACTAGA TGTAGAGAAA CCACTTATAC TAACGGTTAA AATTTTGGAT ATTAATGATA 540 

ATCCTCCAGT ATTTTCACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCTCAA 600 

ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 660 

AAATTGCCTT CAAAATTGTC TCTCAGGAAC CAGCAGGCAC ACCCATGTTC CTCCTAAGCA 720 

GAAACACTGG GGAAGTCCGT ACTTTGACCA ATTCTCTTGA CCGAGAGCAA GCTAGCAGCT 780 

ATCGTCTGGT TGTGAGTGGT GCAGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 840 

GTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCCCAAT GTTTAGAGAC TCTCAGTATT 900 

CAGCACGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TCGATTTCAA GTAACAGATT 960 

TGGATGAAGA GTACACAGAT AATTGGCTTG CAGTATATTT CTTTACCTCT GGGAATGAAG 1020 

GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AGGCATCCTG AAAGTGGTGA 1080 

AGGCTCTAGA TTATGAACAA CTACAAAGCG TGAAACTTAG TATTGCTGTC AAAAACAAAG 1140 

CTGAATTTCA CCAATCAGTT ATCTCTCGAT ACCGAGTTCA GTCAACCCCA GTCACAATTC 1200 

AGGTAATAAA TGTAAGAGAA GGAATTGCAT TCCGTCCTGC TTCCAAGACA TTTACTGTGC 1260 

AAAAAGGCAT AAGTAGCAAA AAATTGGTGG ATTATATCCT GGGAACATAT CAAGCCATCG 1320 

ATGAGGACAC TAACAAAGCT GCCTCAAATG TCAAATATGT CATGGGACGT AACGATGGTG 1380 

GATACCTAAT GATTGATTCA AAAACTGCTG AAATCAAATT TGTCAAAAAT ATGAACCGAG 1440 

ATTCTACTTT CATAGTTAAC AAAACAATCA CAGCTGAGGT TCTGGCCATA GATGAATACA 1500 

CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC CGATTTCAAT GACAATTGTC 1560 

CAACAGCTGT CCTCGAAAAA GATGCAGTTT GCAGTTCTTC ACCTTCCGTG GTTGTCTCCG 1620 

CTAGAACACT GAATAATAGA TACACTGGCC CCTATACATT TGCACTGGAA GATCAACCTG 1680 

TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC TACCTCGGCC CTCCTCAGAG 1740 

CCCAGGAACA GATACCTCCT GGAGTATACC ACATCTCCCT GGTACTTACA GACAGTCAGA 1800 

ACAATCGGTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTGTCAGTGT GACAACAGGG 1860 

GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 

CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2040 

GTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2160 

GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG 2220 

TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 2340 

CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 2400 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 2460 

TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCAGGAAGCA AATGACTGCT 2520 

TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 2580 

GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2640 

TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2700 

CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GTCCAGCAGA 2760 

CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG AGCTTCTGCT TTGTCCGCCT 2820 

CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT GCAGCATGGT AACTATTTAG 2880 

TAACGGAGAC TTACTCGGCT TCTGGTTCCC TCGTGCAACC TTCCACTGCA GGCTTTGATC 2940 

CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCCATT TCCAGTGTTC 3000 

CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC TGTACAGAGG 3060 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 

ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG GCTAATAATT 3180 

TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT AAATTAAATG TTTGGGTTCA 3240 

TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAGTAAA 3300 
TCTTAAAGTT TTTCAAAACC CTAAAATCAT ATTCGC 

Seq ID NO: 24 Protein sequence: 
Protein Accession ft: NP_001935.1 

1 11 21 31 41 51 
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i r i i i i 

MMGLFPRTTG ALAIFWVIL VHGELRIBTK GQYDEEEMTM QQAKRRQKRE WVKFAKPCRE 60 

GEDNSKRNPI AKITSDYQAT QKITYRISGV GIDQPPFGIF WDKNTGDIN ITAIVDREET 120 

PSPLITCRAL NAQGLDVEKP LILTVKILDI NDNPPVPSQQ IFMGEIEENS ASNSLVMILN 180 

ATDADEPNHL NSKIAFKIVS QEPAGTPMFL LSRNTGEVRT LTNSLDREQA SSYRLWSGA 240 

DKDGEGbSTQ CECNIKVKDV NDNFPMFRDS QYSARIEENI LSSELLRFQV TDLDEEYTDN 300 

WLAVYFFTSG NEGNWPEIQT DPRTNEGIUC WKALDYEQL QSVKLSIAVK NKAEFHQSVI 360 

SRYRVQSTPV TIQVINVREG IAFRPASKTF TVQKGISSKK LVDYILGTYQ AIDEDTNKAA 420 

SNVKYVMGRN DGGYLMIDSK TAEIKFVKNM NRDSTFIVNK TITAEVLAID EYTGKTSTGT 480 

VYVRVPDFND NCPTAVLEKD AVCSSSPSW VSARTLNNRY TGPYTFALED QPVKLPAVWS 540 

ITTLNATSAL LRAQEQIPPG VYHISLVLTD SQNNRCEMPR SLTLEVCQCD NRGICGTSYP 600 

TTSPGTRYGR PHSGRLGPAA IGLLLLGLLL LLLAPLLLLT CDCGAGSTGG VTGGFIPVPD 660 

GSEGTIHQWG IEGAHPEDKE ITNICVPPVT ANGADFMESS EVCTNTYARG TAVEGTSGME 720 

MTTKLGAATE SGGAAGFATG TVSGAASGFG AATGVGICSS GQSGTMRTRH STGGTNKDYA 780 

DGAISMNFLD SYFSQKAFAC AEEDDGQEAN DCLLIYDNEG ADATGSPVGS VGCCSFIADD 840 

LDDSFLDSLG PKFKKLAEIS LGVDGEGKEV QPPSKDSGYG IESCGHPIEV QQTGFVKCQT 900 

LSGSQGASAL SASGSVQPAV SIPDPLQHGN YLVTETYSAS GSLVQPSTAG FDPLLTQNVI 960 
VTERVICPIS SVPGNLAGPT QLRGSHTMLC TEDPCSRLI 

Seq ID NO: 25 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 56-1642 

1 11 21 31 41 51 

I I I I I I 

AGTATCCCAG GAGGAGCAAG TGGCACGTCT TCGGACCTAG GCTGCCCCTG CCGTCATGTC 60 

GCAAGGGATC CTTTCTCCGC CAGCGGGCTT GCTGTCCGAT GACGATGTCG TAGTTTCTCC 120 

CATGTTTGAG TCCACAGCTG CAGATTTGGG GTCTGTGGTA CGCAAGAACC TGCTATCAGA 180 

CTGCTCTGTC GTCTCTACCT CCCTAGAGGA CAAGCAGCAG GTTCCATCTG AGGACAGTAT 240 

GGAGAAGGTG AAAGTATACT TGAGGGTTAG GCCCTTGTTA CCTTCAGAGT TGGAACGACA 300 

GGAAGATCAG GGTTGTGTCC GTATTGAGAA TGTGGAGACC CTTGTTCTAC AAGCACCCAA 360 

GGACTCTTTT GCCCTGAAGA GCAATGAACG GGGAATTGGC CAAGCCACAC ACAGGTTCAC 420 

CTTTTCCCAG ATCTTTGGGC CAGAAGTGGG ACAGGCATCC TTCTTCAACC TAACTGTGAA 480 

GGAGATGGTA AAGGATGTAC TCAAAGGGCA GAACTGGCTC ATCTATACAT ATGGAGTCAC 540 

TAACTCAGGG AAAACCCACA CGATTCAAGG TACCATCAAG GATGGAGGGA TTCTCCCCCG 600 

GTCCCTGGCG CTGATCTTCA ATAGCCTCCA AGGCCAACTT CATCCAACAC CTGATCTGAA 660 

GCCCTTGCTC TCCAATGAGG TAATCTGGCT AGACAGCAAG CAGATCCGAC AGGAGGAAAT 720 

GAAGAAGCTG TCCCTGCTAA ATGGAGGCCT CCAAGAGGAG GAGCTGTCCA CTTCCTTGAA 780 

GAGGAGTGTC TACATCGAAA GTCGGATAGG TACCAGCACC AGCTTCGACA GTGGCATTGC 840 

TGGGCTCTCT TCTATCAGTC AGTGTACCAG CAGTAGCCAG CTGGATGAAA CAAGTCATCG 900 

ATGGGCACAG CCAGACACTG CCCCACTACC TGTCCCGGCA AACATTCGCT TCTCCATCTG 960 

GATCTCATTC TTTGAGATCT ACAACGAACT GCTTTATGAC CTATTAGAAC CGCCTAGCCA 1020 

ACAGCGCAAG AGGCAGACTT TGCGGCTATG CGAGGATCAA AATGGCAATC CCTATGTGAA 1080 

AGATCTCAAC TGGATTCATG TGCAAGATGC TGAGGAGGCC TGGAAGCTCC TAAAAGTGGG 1140 

TCGTAAGAAC CAGAGCTTTG CCAGCACCCA CCTCAACCAG AACTCCAGCC GCAGTCACAG 1200 

CATCTTCTCA ATCAGGATCC TACACCTTCA GGGGGAAGGA GATATAGTCC CCAAGATCAG 1260 

CGAGCTGTCA CTCTGTGATC TGGCTGGCTC AGAGCGCTGC AAAGATCAGA AGAGTGGTGA 1320 

ACGGTTGAAG GAAGCAGGAA ACATTAACAC CTCTCTACAC ACCCTGGGCC GCTGTATTGC 1380 

TGCCCTTCGT CAAAACCAGC AGAACCGGTC AAAGCAGAAC CTGGTTCCCT TCCGTGACAG 1440 

CAAGTTGACT CGAGTGTTCC AAGGTTTCTT CACAGGCCGA GGCCGTTCCT GCATGATTGT 1500 

CAATGTGAAT CCCTGTGCAT CTACCTATGA TGAAACTCTT CATGTGGCCA AGTTCTCAGC 1560 

CATTGCTAGC CAGGTGACTT GTGCATGCCC CACCTATGCA ACTGGGATTC CCATCCCTGC 1620 

ACTCGTTCAT CAAGGAACAT AGTCTTCAGG TATCCCCCAG CTTAGAGAAA GGGGCTAAGG 1680 

CAGACACAGG CCTTGATGAT GATATTGAAA ATGAAGCTGA CATCTCCATG TATGGCAAAG 1740 

AGGAGCTCCT ACAAGTTGTG GAAGCCATGA AGACACTGCT TTTGAAGGAA CGACAGGAAA 1800 

AGCTACAGCT GGAGATGCAT CTCCGAGATG AAATTTGCAA TGAGATGGTA GAACAGATGC 1860 

AACAGCGGGA ACAGTGGTGC AGTGAACATT TGGACACCCA AAAGGAACTA TTGGAGGAAA 1920 

TGTATGAAGA AAAACTAAAT ATCCTCAAGG AGTCACTGAC AAGTTTTTAC CAAGAAGAGA 1980 

TTCAGGAGCG GGATGAAAAG ATTGAAGAGC TAGAAGCTCT CTTGCAGGAA GCCAGACAAC 2040 

AGTCAGTGGC CCATCAGCAA TCAGGGTCTG AATTGGCCCT ACGGCGGTCA CAAAGGTTGG 2100 

CAGCTTCTGC CTCCACCCAG CAGCTTCAGG AGGTTAAAGC TAAATTACAG CAGTGCAAAG 2160 

CAGAGCTAAA CTCTACCACT GAAGAGTTGC ATAAGTATCA GAAAATGTTA GAACCACCAC 2220 

CCTCAGCCAA GCCCTTCACC ATTGATGTGG ACAAGAAGTT AGAAGAGGGC CAGAAGAATA 2280 

TAAGGCTGTT GCGGACAGAG CTTCAGAAAC TTGGTGAGTC TCTCCAATCA GCAGAGAGAG 2340 

CTTGTTGCCA CAGCACTGGG GCAGGAAAAC TTCGTCAAGC CTTGACCACT TGTGATGACA 2400 

TCTTAATCAA ACAGGACCAG ACTCTGGCTG AACTGCAGAA CAACATGGTG CTAGTGAAAC 2460 

TGGACCTTCG GAAGAAGGCA GCATGTATTG CTGAGCAGTA TCATACTGTG TTGAAACTCC 2520 

AAGGCCAGGT TTCTGCCAAA AAGCGCCTTG GTACCAACCA GGAAAATCAG CAACCAAACC 2580 

AACAACCACC AGGGAAGAAA CCATTCCTTC GAAATTTACT TCCCCGAACA CCAACCTGCC 2640 

AAAGCTCAAC AGACTGCAGC CCTTATGCCC GGATCCTACG CTCACGGCGT TCCCCTTTAC 2700 

TCAAATCTGG GCCTTTTGGC AAAAAGTACT AAGGCTGTGG GGAAAGAGAA GAGCAGTCAT 2760 

GGCCCTGAGG TGGGTCAGCT ACTCTCCTGA AGAAATAGGT CTCTTTTATG CTTTACCATA 2820 

TATCAGGAAT TATATCCAGG ATGCAATACT CAGACACTAG CTTTTTTCTC ACTTTTGTAT 2880 

TATAACCACC TATGTAATCT CATGTTGTTG TTTTTTTTTA TTTACTTATA TGATTTCTAT 2940 

GCACACAAAA ACAGTTATAT TAAAGATATT ATTGTTCACA TTTTTTATTG AATTCCAAAT 3000 
GTAGCAAAAT CATTAAAACA AATTATAAAA GGGACAGAAA AA 

Seq ID NO: 26 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MSQGILSPPA GLLSDDDVW SPMFESTAAD LGSWRKNLL SDCSWSTSL EDKQQVPSED 60 

SMEKVKVYLR VRPLLPSELE RQEDQGCVRI ENVETIiVLQA PKDSFALKSN ERGIGQATHR 120 

FTFSQIFGPE VGQASFFNLT VKEMVKDVLK GQNWLIYTYG VTNSGKTHTI QGTIKDGGIL 180 

PRSLALIFNS LQGQLHPTPD LKPLLSNEVI WLDSKQIRQE EMKKLSLLNG GLQEEELSTS 240 

LKRSVYIESR IGTSTSFDSG IAGLSSISQC TSSSQLDETS HRWAQPDTAP LPVPANIRFS 300 
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IWISPPBIVN ELLYDLLEPP SQQRKRQTLR LCEDQNGNPY VKDLNWIHVQ DAEEAWKLLK 360 
VGRKNQSFAS THLNQNSSRS HSIFSIRILH LQGEGDIVPK ISELSLCDLA GSERCKDQKS 420 
GERLKEAGNI NTSLHTLGRC IAALRQNQQN RSKQNLVPFR DSKLTRVFQG PPTGRGRSCM 480 
IVNVNPCAST YDBTLHVAKF SA1ASQVTCA CPTYATGIPI PALVHQGT 

Seq ID NO: 27 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 13-1424 



PCT7US02/12476 



TAGAAGTTTA 
CTTCCCCTGA 
TTAGAAAAAT 
GGAAACTTAA 
GGGCAACTGG 
GTCCATCATT 
AGAATCAATA 
GCTTTCCAAG 
GCTGACATTT 
AAAGGTGGAA 
TTCGATGAGG 
GTTCACGAGA 
TTCCCCACCT 
GGCATTCAGT 
TCAGAACCAG 
AAGATCTTTT 
AGTGTTAATT 
GAAATTGAAG 
AATTTAAGAC 
GTGAAAAAAA 
GATAACCAGT 
CTGATTACCA 
AACAAATACT 
CGTATCACCA 
TGGTTTTTGT 
GTGTACCACT 
TTATATAAAA 
CTCTACTATT 
CTCTGTAAGT 
TAAAATTAAG 



11 
I 

CAATGAAGTT 
ACAGCTCTAC 
TTTATGGCCT 
TGAAGGAAAA 
ACACATCTAC 
TCAGGGAAAT 
ATTACACACC 
TATGGAGTAA 
TGGTGGTTTT 
TCCTAGCCCA 
ACGAATTCTG 
TTGGCCATTC 
ACAAATATGT 
CCCTGTATGG 
CTCTCTGTGA 
TCTTCAAAGA 
TAATTTCTTC 
CCAGAAATCA 
CAGAGCCAAA 
TTGATGCAGC 
ATTGGAGGTA 
AGAACTTCCA 
ACTATTTCTT 
AAACACTGAA 
TAGTTCACTT 
ACTTAGAGAT 
TACATAATAT 
AAGTTTGAAA 
TGCTTCCTAA 
TATATATATT 



21 
I 

TCTTCTAATA 
AAGCCTGGAA 
TGAGATAAAC 
AATCCAAGAA 
CCTGGAGATG 
GCCAGGGGGG 
TGACATGAAC 
TGTTACCCCC 
TGCCCGTGGA 
TGCTTTTGGA 
GACTACACAT 
CTTAGGTCTT 
TGACATCAAC 
AGACCCAAAA 
CCCCAATTTG 
CAGGTTCTTC 
CTTATGGCCA 
AGTTTTTCTT 
TTATCCCAAG 
TGTTTTTAAC 
TGATGAAAGG 
AGGAATCGGG 
CCAAGGATCT 
AAGCAATAGC 
CAGCTTAATA 
ATGTATCATA 
TTTTCAATTT 
ATAGTTACCT 
CATCCTTGGA 
TTGGCTCAAA 



31 
I 

CTGCTCCTGC 
AAAAATAATG 
AAACTTCCAG 
ATGCAGCACT 
ATGCACGCAC 
CCCGTATGGA 
CGTGAGGATG 
TTGAAATTCA 
GCTCATGGAG 
CCTGGATCTG 
TCAGGAGGCA 
GGCCATTCTA 
ACATTTCGCC 
GAGAACCAAC 
AGTTTTGATG 
TGGCTGAAGG 
ACCTTGCCAT 
TTTAAAGATG 
AGCATACATT 
CCACGTTTTT 
AGACAGATGA 
CCTAAAATTG 
AACCAATTTG 
TGGTTTGGTT 
AGTATTTATT 
AAAATAAAAT 
TGAAAACTCT 
TCAAAGCAAG 
CTGAGAAATT 
TAAAATTG 



41 

I 

AGGCCACTGC 
TGCTATTTGG 
TGACAAAAAT 
TCTTGGGTCT 
CTCGATGTGG 
GGAAACATTA 
TTGACTACGC 
GCAAGATTAA 
ACTTCCATGC 
GCATTGGAGG 
CAAACTTGTT 
GTGATCCAAA 
TCTCTGCTGA 
GCTTGCCAAA 
CTGTCACTAC 
TTTCTGAGAG 
CTGGCATTGA 
ACAAATACTG 
CTTTTGGTTT 
ATAGGACCTA 
TGGACCCTGG 
ATGCAGTCTT 
AATATGACTT 
GTTGAAAATG 
GCATATTTGC 
CTGTAAACCA 
AATTGTCCAT 
ATAATTCTAT 
ATACTTACTT 



51 
I 

TTCTGGAGCT 
TGAAAGATAC 
GAAATATAGT 
GAAAGTGACC 
AGTCCCCGAT 
TATCACCTAC 
AATCCGGAAA 
CACAGGCATG 
TTTTGATGGC 
GGATGCACAT 
CCTCACTGCT 
GGCCGTAATG 
TGACATACGT 
TCCTGACAAT 
CGTGGGAAAT 
ACCAAAGACC 
AGCTGCTTAT 
GTTAATTAGC 
TCCTAACTTT 
CTTCTTTGTA 
TTATCCCAAA 
CTACTCTAAA 
CCTACTCCAA 
GTGTAATTAA 
TATGTCCTCA 
TAGGTAATGA 
TCTTGCTTGA 
TTGAAGCATG 
CTGGCATAAC 



Seq ID NO: 28 Protein sequence: 
Protein Accession #: Eos sequence 



I 

MKFTiLILLXjQ 
KEKIQEMQHF 
YTPDMNREDV 
LAHAPGPGSG 
KYVDINTFRL 
FKDRFFWLKV 
EPNYPKSIHS 
NFQGIGPKID 



11 
I 

ATASGALPLN 
LGLKVTGQLD 
DYAIRKAFQV 
IGGDAHFDED 
SADDIRGIQS 
SERPKTSVNL 
FGFPNFVKKI 
AVFYSKNKYY 



21 
I 

SSTSLEKNNV 
TSTLEMMHAP 
WSNVTPLKFS 
EFWTTHSGGT 
LYGDPKENQR 
ISSLWPTLPS 
DAAVFNPRFY 
YFFQGSNQFE 



31 

I 

LFGERYLEKF 
RCGVPDVHHF 
KINTGMADIL 
NLFLTAVHEI 
LPNPDNSEPA 
GIEAAYEIEA 
RTYPFVDNQY 
YDFLLQRITK 



41 



51 



Seq ID NO: 29 DNA sequence 

Nucleic Acid Accession #: NM_006115.1 

Coding sequence: 236.. 1765 



GCTTCAGGGT 
CGGGACACCC 
ACTCTCTGAG 
GAGACCTAGA 
ACGAAGGCGT 
CCCACGGAGA 
TGCCGCCCTG 
CGGGAGACAC 
TCTGGGAGTG 
TGGACTTGAT 
GGATTTACGG 
TCTGTACTCA 
TGGTTTGAGC 
CCTCAAGGAA 
GAAAAATGTA 
TATCAAGATG 
TACCTGGAAG 
GCGTAGACTC 
GCAGTATATC 
TGTGGACTCT 
CCCCTTGGAA 
GTCCCAGAGT 
CGATGTAAGT 
CCTGGTCTTT 
GAGCCACTGC 



11 
I 

ACAGCTCCCC 
CACCCGCTTC 
GAAAAACCAT 
AATCCAAGCG 
TTGTGGGGTT 
CTTGTGGAGC 
GAGTTGCTGC 
AGCCAGACCC 
CTGATGAAGG 
GTGCTCCTTG 
AAGAACTCTC 
TTTCCAGAGC 
ACAGAGGCAG 
GGTGCCTGTG 
CTACGCCTGT 
ATCCTGAAAA 
CTACCCACCT 
CTCCTCTCCC 
GCCCAGTTCA 
TTATTTTTCC 
ACCCTCTCAA 
CCCAGCGTCA 
CCCGAGCCCC 
GATGAGTGTG 
TCCCAGCTTA 



21 
I 

CGCAGCCAGA 
CCAGGCGTGA 
TTTGATTATT 
TTGGAGGTCC 
CCATTCAGAG 
TGGCAGGGCA 
CCAGGGAGCT 
TGAAGGCAAT 
GACAACATCT 
CCCAGGAGGT 
ATCAGGACTT 
CAGAAGCAGC 
AGCAGCCCTT 
ATGAATTGTT 
GCTGTAAGAA 
TGGTGCAGCT 
TGGCGAAATT 
ACATCCATGC 
CCTCTCAGTT 
TTAGAGGCCG 
TAACTAACTG 
GTCAGCTAAG 
TCCAAGCTCT 
GGATCACGGA 
CAACCTTAAG 



31 

! 

AGCCGGGCCT 
CCTGTCAACA 
ACTCTCAGAC 
TGAGGCCAGC 
CCGATACATC 
GAGCCTGCTG 
CTTCCCGCCA 
GGTGCAGGCC 
TCACCTGGAG 
TCGCCCCAGG 
CTGGACTGTA 
TCAGCCCATG 
CATTCCAGTA 
CTCCTACCTC 
GCTGAAGATT 
GGACTCTATT 
TTCTCCTTAC 
ATCTTCCTAC 
CCTCAGTCTG 
CCTGGATCAG 
CCGGCTTTCG 
TGTCCTGAGT 
GCTGGAGAGA 
TGATCAGCTC 
CTTCTACGGG 



41 
I 

GCAGCCCCTC 
GCAACTTCGC 
GTGCGTGGCA 
CTAAGTCGCT 
AGCATGAGTG 
AAGGATGAGG 
CTCTTCATGG 
TGGCCCTTCA 
ACCTTCAAAG 
AGGTGGAAAC 
TGGTCTGGAA 
ACAAAGAAGC 
GAGGTGCTCG 
ATTGAGAAAG 
TTTGCAATGC 
GAAGATTTGG 
CTGGGCCAGA 
ATTTCCCCGG 
CAGTGCCTGC 
TTGCTCAGGC 
GAAGGGGATG 
CTAAGTGGGG 
GCCTCTGCCA 
CTTGCCCTCC 
AATTCCATCT 



51 
I 

AGCACCGCTC 
GGTGTGGTGA 
ACAAGTGACT 
TCAAAATGGA 
TGTGGACAAG 
CCCTGGCCAT 
CAGCCTTTGA 
CCTGCCTCCC 
CTGTGCTTGA 
TTCAAGTGCT 
ACAGGGCCAG 
GAAAAGTAGA 
TAGACCTGTT 
TGAAGCGAAA 
CCATGCAGGA 
AAGTGACTTG 
TGATTAATCT 
AGAAGGAAGA 
AGGCTCTCTA 
ACGTGATGAA 
TGATGCATCT 
TCATGCTGAC 
CCCTCCAGGA 
TGCCTTCCCT 
CCATATCTGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



YGLEINKLPV 


TKMKYSGNLM 


60 


REMPGGPVWR 


KHYITYRINN 


120 


WFARGAHGD 


FHAFDGKGGI 


180 


GHSLGLGHSS 


DPKAVMFPTY 


240 


LCDPNLSFDA 


VTTVGNKIFF 


300 


RNQVFLFKDD 


KYWLISNLRP 


360 


WRYDERRQMM 


DPGYPKLITK 


420 


TLKSNSWFGC 







60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CTTGCAGAGT CTCCTGCAGC ACCTCATCGG GCTGAGCAAT 
TGTCCCCCTG GAGAGTTATG AGGACATCCA TGGTACCCTC 
TCTGCATGCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGG 
TAGTGCCAAC CCCTGTCCTC ACTGTGGGGA CAGAACCTTC 
GTGCCCCTGT TTCATGCCTA ACTAGCTGGG TGCACATATC 
TTGGACACTA AAGCCAGGAT GTGCATGCAT CTTGAAGCAA 
ACAAATGTTC AGTGTGAGTG AGGAAAACAT GTTCAGTGAG 
GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAGA 
GTGATCTTTG GGGAGATACA TCTTATAGAG TTAGAAATAG 
GATTCTGGCT TGGGAAGTAC ATGTAGGAGT TAATCCCTGT 
TGTTGAAAAT AAAGAGAAGC AATGTGAAGC AAAAAAAAAA 

Seq ID NO: 30 Protein sequence: 
Protein Accession #: NP 006106.1 



PCT/US02/12476 



GCTTCAGGGT 
CGGGACACCC 
ACTCTCTGAG 
GAGACCTAGA 
ACGAAGGCGT 
CCCACGGAGA 
TGCCGCCCTG 
CGGGAGACAC 
TCTGGGAGTG 
TGGACTTGAT 
GGATTTACGG 
TCTGTACTCA 
TGGTTTGAGC 
CCTCAAGGAA 
GAAAAATGTA 
TATCAAGATG 
TACCTGGAAG 
GCGTAGACTC 
GCAGTATATC 
TGTGGACTCT 
CCCCTTGGAA 
GTCCCAGAGT 
CGATGTAAGT 
CCTGGTCTTT 
GAGCCACTGC 
CTTGCAGAGT 
TGTCCCCCTG 
TCTGCATGCC 
TAGTGCCAAC 
GTGCCCCTGT 
TTGGACACTA 
ACAAATGTTC 
GTTCAGTGAG 
GTGATCTTTG 
GATTCTGGCT 
TGTTGAAAAT 



11 
I 

ACAGCTCCCC 
CACCCGCTTC 
GAAAAACCAT 
AATCCAAGCG 
TTGTGGGGTT 
CTTGTGGAGC 
GAGTTGCTGC 
AGCCAGACCC 
CTGATGAAGG 
GTGCTCCTTG 
AAGAACTCTC 
TTTCCAGAGC 
ACAGAGGCAG 
GGTGCCTGTG 
CTACGCCTGT 
ATCCTGAAAA 
CTACCCACCT 
CTCCTCTCCC 
GCCCAGTTCA 
TTATTTTTCC 
ACCCTCTCAA 
CCCAGCGTCA 
CCCGAGCCCC 
GATGAGTGTG 
TCCCAGCTTA 
CTCCTGCAGC 
GAGAGTTATG 
AGGCTCAGGG 
CCCTGTCCTC 
TTCATGCCTA 
AAGCCAGGAT 
AGTGTGAGTG 
GAAAAAAAGG 
GGGAGATACA 
TGGGAAGTAC 
AAAGAGAAGC 



21 
I 

CGCAGCCAGA 
CCAGGCGTGA 
TTTGATTATT 
TTGGAGGTCC 
CCATTCAGAG 
TGGCAGGGCA 
CCAGGGAGCT 
TGAAGGCAAT 
GACAACATCT 
CCCAGGAGGT 
ATCAGGACTT 
CAGAAGCAGC 
AGCAGCCCTT 
ATGAATTGTT 
GCTGTAAGAA 
TGGTGCAGCT 
TGGCGAAATT 
ACATCCATGC 
CCTCTCAGTT 
TTAGAGGCCG 
TAACTAACTG 
GTCAGCTAAG 
TCCAAGCTCT 
GGATCACGGA 
CAACCTTAAG 
ACCTCATCGG 
AGGACATCCA 
AGTTGCTGTG 
ACTGTGGGGA 
ACTAGCTGGG 
GTGCATGCAT 
AGGAAAACAT 
GGAAGTTGGG 
TCTTATAGAG 
ATGTAGGAGT 
AATGTGAAGC 



31 
I 

AGCCGGGCCT 
CCTGTCAACA 
ACTCTCAGAC 
TGAGGCCAGC 
CCGATACATC 
GAGCCTGCTG 
CTTCCCGCCA 
GGTGCAGGCC 
TCACCTGGAG 
TCGCCCCAGG 
CTGGACTGTA 
TCAGCCCATG 
CATTCCAGTA 
CTCCTACCTC 
GCTGAAGATT 
GGACTCTATT 
TTCTCCTTAC 
ATCTTCCTAC 
CCTCAGTCTG 
CCTGGATCAG 
CCGGCTTTCG 
TGTCCTGAGT 
GCTGGAGAGA 
TGATCAGCTC 
CTTCTACGGG 
GCTGAGCAAT 
TGGTACCCTC 
TGAGTTGGGG 
CAGAACCTTC 
TGCACATATC 
CTTGAAGCAA 
GTTCAGTGAG 
GATAGGCAGA 
TTAGAAATAG 
TAATCCCTGT 
AAAAAAAAAA 



CTGACCCACG 
CACCTGGAGA 
CGGCCCAGCA 
TATGACCCGG 
AAATGCTTCA 
CAAAGCAGCC 
GAAAAAACAT 
TGTTGACTTG 
AATCTGAATT 
GTAGACTGTT 
AAAAAAAA 



41 
I 

GCAGCGCCTC 
GCAACTTCGC 
GTGCGTGGCA 
CTAAGTCGCT 
AGCATGAGTG 
AAGGATGAGG 
CTCTTCATGG 
TGGCCCTTCA 
ACCTTCAAAG 
AGGTGGAAAC 
TGGTCTGGAA 
ACAAAGAAGC 
GAGGTGCTCG 
ATTGAGAAAG 
TTTGCAATGC 
GAAGATTTGG 
CTGGGCCAGA 
ATTTCCCCGG 
CAGTGCCTGC 
TTGCTCAGGC 
GAAGGGGATG 
CTAAGTGGGG 
GCCTCTGCCA 
CTTGCCCTCC 
AATTCCATCT 
CTGACCCACG 
CACCTGGAGA 
CGGCCCAGCA 
TATGACCCGG 
AAATGCTTCA 
CAAAGCAGCC 
GAAAAAACAT 
TGTTGACTTG 
AATCTGAATT 
GTAGACTGTT 
AAAAAAAA 



TGCTGTATCC 
GGCTTGCCTA 
TGGTCTGGCT 
AGCCCATCCT 
TTCTGCATAC 
ACAGTTTCAG 
TCAGACAAAT 
AGGAGTTAAT 
TCTAAAGGGA 
GTAAAGAAAC 



51 
I 

AGCACCGCTC 
GGTGTGGTGA 
ACAAGTGACT 
TCAAAATGGA 
TGTGGACAAG 
CCCTGGCCAT 
CAGCCTTTGA 
CCTGCCTCCC 
CTGTGCTTGA 
TTCAAGTGCT 
ACAGGGCCAG 
GAAAAGT AG A 
TAGACCTGTT 
TGAAGCGAAA 
CCATGCAGGA 
AAGTGACTTG 
TGATTAATCT 
AGAAGGAAGA 
AGGCTCTCTA 
ACGTGATGAA 
TGATGCATCT 
TCATGCTGAC 
CCCTCCAGGA 
TGCCTTCCCT 
CCATATCTGC 
TGCTGTATCC 
GGCTTGCCTA 
TGGTCTGGCT 
AGCCCATCCT 
TTCTGCATAC 
ACAGTTTCAG 
TCAGACAAAT 
AGGAGTTAAT 
TCTAAAGGGA 
GTAAAGAAAC 



Seq ID NO: 31 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 64-2754 



GGCAGGTCTC 
CCGATGGCCG 
CTGACCCTCG 
CCTTCTAAAC 
TCTGCAGACC 
TACACAGCCA 
GACAAAAGGA 
TCGAAGACAA 
ATTCCTTGCT 
GAATCTGATG 
AAAGAACCTT 
CCTGTGGATC 
GGATATTCAG 
CACCCTGTTT 
ACTACAGTGG 
CTGAAATACA 
AGCACAGGCG 
TCATTGATAA 
ACTTGTATCA 
TATGAAGCAT 
GATAAGGATT 
GAAAATGGAC 
GTAAAGCCAC 
GAAGCGCCAT 
GTTCATGTGA 
ATTAAAGAAA 



11 
I 

GCTCTCGGCA 
CCGCTGGGCC 
TGATCTTCAG 
TAGAGGCAGA 
TCATCCGGTC 
GGGCTGTTGC 
AACAGACACA 
GACACACTAG 
CTATGCAAGA 
CAGCACAGAA 
TAAATTTGTT 
GTGAAGAATA 
CAGATCTGCC 
TCACAGAAGC 



GCATTTTGCA 
TAATCACCAC 
TGAAAGTACA 
TAACAGTAAC 
TTGTAGAGGA 
TAATTAACAC 
ATTTCAAAAT 
TGAATTATGA 
TTGCTAGAGA 
GGGATCTGGA 
ACTTAGCAGT 



21 
I 

CCCTCCCGGC 
CCGGCGCTCC 
TCGTGATGGT 
CAAAATAATT 
AAGTGATCCT 
GCTGTCTGAT 
GAAAGAGGTT 
AGAAACTGTT 
GAATTCCTTG 
CTATACTGTC 
TTATATAGAA 
TGATGTTTTT 
CCTCCCACTA 
AATTTATAAT 
TGCCACAGAC 
GCAGACACCA 
AGTCTCTCAT 
AGACATGGAT 
AGATTCAAAT 
AAATGCATTC 
TGCCAATTGG 
CAGCACAGAC 
AGAAAACCGT 
TATTCCCAGA 
TGAGGGGCCT 
GGGGTCAAAG 



31 
I 

GCCCGCGTTC 
GTGCGCGGAG 
GAAGCCTGCA 
GGCAGAGTTA 
GATTTCAGAG 
AAGAAAAGAT 
ACTGTGCTGC 
CTCAGGCGTG 
GGCCCTTTCC 
TTCTACTCAA 
AGAGACACTG 
GATTTGATTG 
CCCATCAGGG 
TTTGAAGTTT 
AGAGATGAAC 
AGGTCACCTG 
TATTTGGACA 
GGCCAGTTTT 
GATAATGCAC 
AATGTGGAAA 
AGAGTCAATT 
AAAGAAACTA 
CAAGTGAACC 
GTGACAGCCT 
GAATGCACTC 
ATCAACGGCT 



41 
I 

TCCTGGCCCT 
CCGTCTGCCT 
AAAAGGTGAT 
ATTTGGAAGA 
TTCTAAATGA 
CATTTACCAT 
TAGAACATCA 
CCAAGAGGAG 
CATTGTTTCT 
TAAGTGGACG 
GAAATCTATT 
CTTATGCGTC 
TAGAGGATGA 
TGGAAAGTAG 
CGGACACAAT 
GGCTCTTTTC 
GAGAGGTTGT 
TTGGATTGAT 
CCACTTTCAG 
TCTTACGAAT 
TTACCATTTT 
ATGAAGGTGT 
TGGAAATTGG 
TGAACAGAGC 
CTGCAGCCCA 
ATAAGGCATA 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



51 
I 

GCCCGGCATC 
GCATCTGCTG 
ACTTAATGTA 
GTGCTTCAGG 
TGGGTCAGTG 
ATGGCTTTCT 
GAAGAAGGTA 
ATGGGCACCT 
TCAACAAGTT 
TGGAGTTGAT 
TTGCACTCGG 
AACTGCAGAT 
AAATGACAAC 
TAGACCTGGT 
GCATACGCGC 
TGTGCATCCC 
AGACAAGTAC 
AGGCACATCA 
ACAAAATGCT 
ACCTATAGAA 
AAAGGGAAAT 
TCTTTCTGTT 
AGTAAACAAT 
CTTGGTTACA 
ATATGTGCGG 
TGACCCCGAA 



60 
120 
180 
240 
300 
360 
420- 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
I960 
2040 
2100 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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AATAGAAATG GCAATGGTTT AAGGTACAAA 
ATTGATGAAA TTTCAGGGTC AATCATAACT 
CCCAAAAATG AGTTGTATAA TATTACAGTC 
ACTGGAACAC TTGCTGTGAA CATTGAAGAT 
GAATATGTAG TCATTTGCAA ACCAAAAATG 
GATGAACCTG TCCATGGAGC TCCATTTTAT 
AGTAGACTGT GGAGCCTCAC CAAAGTTAAT 
AATGCTGGAT TTCAAGAATA TACCATTCCT 
GCAACAAAAT TATTGAGAGT TAATCTGTGT 
ACTTCAAGGA GTACAGGAGT AATACTTGGA 
ATAGCACTGC TCTTTTCTGT ATTGCTAACT 
GGGAAACGTT TTCCTGAAGA TTTAGCACAG 
CCTGGAGACG ATAGAGTGTG CTCTGCCAAT 
AGCCAAGGTT TTTGTGGTAC TATGGGATCA 
GAAATGATGA AAGGAGGAAA CCAGACCTTG 
ACCCTGGACT CCTGCAGGGG AGGACACACG 
GAGTGGCACA GTTTTACTCA ACCCCGTCTC 
GAAGACCGCA TGCCATCCCA AGATTATGTC 
CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA 
AATAATTTGG AACCCAAATT TATTACATTA 
AGTGCTACAA TTAGGTCTTT GTCAGACATT 
TTCAATTTCA ACATGTATGT ATATGATGAT 
CCAATTTATA TTTTTAAAGC CAGTTGTTGC 
AACAGACAAC TGGTAAATCT CAAACTCCAG 
TCTTTTTTTT TTTTACGGAT ATTTTAGTAA 
ATAGCTAAGT TATGCTAATA TCACATTATT 
ATAAACAAGA AATATTGAGT ATCACTATGT 
ACTGAATTAA ATTAAAAATG TTGCAGCTCA 
ACCAAATTCA TTTGACTTTG GAGGCAAAAT 
TCTATAGGAA TATAGTTGGA AATAAATGTG 
ATTTAAAATG AAATGAGAAC AAAGAGGAAA 
TAGTTTGTCC TACAATAGAA AAAAGAGAGA 
ATTATAACTG AGTCTATGAG GAAATAGTTC 
GTAAATAAAT TAAACTTTTC TGGTTTCTGT 
TAGCTTTGCT TTGCAGTCTG TTTCAAGATT 
GAATACTCGC TGCAGCTGGG GTTCCCTGCT 
TTTTTTTCGG GGAGCTAATA ACAAAAACAT 
CTCTATTGCT GTTTCTATTC TCTCTTATAG 
TAACCATGTC CTCCTAGAGT TTAGAGGCTA 
GCACCCTGGG GAGATTGATT GTCCTTAAAC 
GTCTGGGAGC TACAAAATTT CATTTTTCTC 
CTGAATCAAG GAAAGCCAGG CCTTGTGGGC 
ACCTCCAGCA GAGATTCCCT TAAGTGACTC 
AATTTTTAAT CAGTTTGCTT TCTCCAGAGA 
TTGAATGTAT AAAAGAAAAA GATCAAGTTG 
AAGCAGCCCA AGTAGGTTAT TTGTACAGTC 
GGGCAAGGAG AGGCCACAAG GAATATGGGT 
CTTTTTCCTA GGCTTGGCAC TGCCTTTTCC 
GTCCGGTGAG GGATCAGCCA ACCTCTTCTC 
AAGGAGACAG AGCTGACTGC ATGATGAGTC 
GTTGTGCAGA ACAAACAAGG CATTCATGGG 
CTGGGCACTA AGAAGGTCTA TGAATTAAAT 
TTTTCTGTTT TCTAATTTGA CCCTAAAATC 
CCCCCCCCCC TTTTTTTTTG AGACGGAGTC 
GCTCCGATCT CTGCTCACTG AAAGCTCCGC 
GCCTCCTGAG TAGCTGGGAC TACAGGCGCC 
TTTAATAGAG ACGGGGTTTC ACTGTGTTAG 
ATCCGCCTGC CTCGGCCTCC CAAAGTGCTG 
CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT 
TGATCATACG AATTGGATCA ATCTTGAAAT 
GGGAGAAAGA ACTCAGGGCA CAAAATATTG 
TTGCTGAAAT TTCCTGCTGT AACCAGAAGC 
ACTGTGTTTT GCTCACTCCC TCACTCACCG 
CTAGTGCCGA TAAACTTTCT CAAAGAGCAA 
TAACCATCTC TTTGTTCTTT GAACATGCTG 
TTTGTAATTC TTTTCTCTCA AATGAAAATT 
CATATGTAGT ATTATTATTT CCTTATATGT 
CAAGAAAATA TATTTTTAAA GCTTTCATTT 
TGTAAATATA CAGAATGTTT TTTCTTACTT 
GGGGTTTGTT TTGCAATGTT TTAAACAGAG 
TGCTTTTAAA GAAACTTGGC TGCTTAAAAT 
TACAGATGTG GGGAGATGTA ATAAAACAAT 
TAGAGATTAA ATAATTCTAA GATGATCACT 
AATAGAAATA CTCAATTATG TCTTTGTTGT 
ATTATCAAAT TGTCGACATC ATTAATATAT 
GAAGCACAGC TTTACAGATG AGTATCTATG 
ATTAAAAGTA TTAGAAGGTG GTTATAATTG 
AGGGGTTTTA CTTTGAGGAC CAGTGTAGTC 
GGCAATATTG CAGTCTTGAT TCTGCCACTT 
CAAGATGATC CAACCATAAA GGTGCTCTGT 
AGTGTGCTCC CCTACAAACG TTAAGACTGA 
AGCCTTACAT TTTAATATAG GTTGAACCAA 
CATTATTTTT GTGTATGTCT TCAAGAATGT 
ACCGGATACA TTTCACGTGT CCTTCAGTAT 
TGAGAAGCAT GGACACTAGA GCCAGAATGC 
TTCTGTGTGA CCTTTGAAAG GCTACTTATT 
GAACAATGCC AGCCTCATGG GGTTGTTGAA 



AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 

TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 

GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 

TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 

GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 

GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 

AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 

CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 

GGATTTATGA CCCAAACTAC CAACAACTCT 2340 

GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 

GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 

GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GGTGAAAAAT TGCATCGATG TAATCAGAAT 2580 

CTCACTTATA ACTATGAGGG AAGAGGATCT 2640 

AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2700 

GCAGAAGCAT GCACAAAGAG ATAATGTCAC 2760 

CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2820 

TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2880 

TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2940 

CACTGGAATT AAGGTCTCTA AAGCATCTGC 3000 

TAAATATGCT GGATAAATAT TAGTCCAACA 3060 

ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120 

GAAGAAAGTT TTGGAAAAGA AACAATGAAG 3180 

TAAAGAATTG GGACTCACCC CTACTGCACT 3240 

GTGTTGAAGT GCCCTATGAA GTAGCAATTT 3300 

TGTGTGTATA TTATTATTAA TCAATGCAAT 3360 

ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 3420 

GCTTCCTAGG CCTGGGCTCT TAAATGCTGC 3480 

CTGTCCAATT TGTGTAATTT GTTTAAAATT 3540 

GGGAAGGAAA TAGGGAATCC AATGGAACAG 3600 

TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3660 

TTTTGGTAGC AAGGGTCCAG AGATGAGGTG 3720 

TTTAAAACTT ACCTTTACTG AAGTTAAATC 3780 

TGACCAACAT CTTTTTAATT TAGATCCAAA 3840 

GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3900 

CTAAGCCCCA CAAACTTGAC ACCTGATCAG 3960 

CTCACTGCCC TTCTTCTGAG TGGCATTGGC 4020 

CCCCTTCTTT CGGCTTTCTG CTAAAGCAAC 4080 

CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 

AATTTTAAAA TAATAGAAGA AATAGAAATT 4200 

TCATTTTAGA ACAGAGGGAA CTTTGGGAGA 4260 

AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 4320 

GGGAGTAAAA GCAACATCGT CTGCTTCATA 4380 

TTTCTCAGGC CAATGGCAAC TGCCATTTGA 4440 

TATGGCTCAC CTTATTTGGA GTGAGAAATC ■ 4500 

TGAAGGCATT TGCAGGATGA GCCTGAACTG 4560 

AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 4620 

GCCTATCTAA AATTCTGATT TATTCCTACA 4680 

TATGTGTTTT AGACTTAGAC TTTTTATTGC 4740 

TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 4800 

CTCCCX3GGTT CATGCCATTC TCCTGCCTCA 4860 

CACCACCACG CCCGGCTAAT TTTTTGTATT 4920 

CCAGGATGGT CTCGATCTCC TGACCTCGTG 4980 

GGATTACAGG CATGACCCAC CGCTCCCGGC 5040 

AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 

ACTCAACCAA AAGACAGTCG AGAAGCCAGG 5160 

GTCTGAGAAT GGAATTCTCT GTAAGCCTAG 5220 

CAGTTTTATC TAACGGCTAC TGAAACACCC 5280 

ATCAAAACCT GCTACCTCCC CAAGACTTTA 5340 

CCAGTATCAC TTCCCTGTTT ATAAAACCTC 5400 

AAAACCACCT GGTCTGCATG TATGCCCGAA 5460 

TAATTTTAGG GATTCATTTC TATATTTTCA 5520 

GTAAGGTGAA ATTTATGGTA TTTGAGTGTG 5580 

TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 

TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 

TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 

AAGCAAAAAT TGGATGCATA AAGTAATATT 5820 

ATTAACTTGG TTTCTTGTTT TTGCTGTATT 5880 

TTGCAAAATT ATGCTTATGG CTGGCATGGA 5940 

ATTAATGGGG AATATTTTGG ACAATGTTTC 6000 

ATTGTAATGT TGGGAAGAGA TCACTATTTT 6060 

ATACATATGT ATAATAAATT TTGATCGGGT 6120 

CAGAGTATTC CATGAATAGT ACACTGACAC 6180 

AAGGGAAAAC ATGAGTTAAA AAGAAAAGCA 6240 

ACAGGATAGA TAATGCCTGA ACTTTAATGA 6300 

GCTTCACAGT GAATCTTTTC CCCATGCAGG 6360 

TCATTTCAAA AATCTATTAG CTATATCAAA 6420 

AATTTCAATT CCAGTAACTT CTATTGTAAC 6480 

TCATTGGATT TTTGTTTGTA ATAGTAAAAT 6540 

TGATTTGGTT GAATATTGGG TCATAATGGT 6600 

TTGGATATGA ATCCTGGATC TGTCACTTAC 6660 

TCCTCTCTTA GCTTTCTCAT TAAAATCAAT 6720 

TGATTAAATT AGTTAATATA CCTAAAGTAC 6780 



200 
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ATAGAACACT GCCTGCACAT AGTAAAAGAA TTATAAGTGT GAGGTAGTTG GTAAAATTAT 6840 

GTAGTTGGAT ATACTACCGA ACAATATCTA ATCTCTTTTT AGGGAAATAA AGTTTGTGCA 6900 
TATATATAAT CCCGAAACAT G 

Seq ID NO: 32 Protein sequence: 
Protein Accession #: NP_001932.1 

X 11 21 31 41 51 

1 I I I I I 

MAAAGPRRSV RGAVCLHLLL TLVIFSRDGE ACKKVILNVP SKLEADKIIG RVNLEECFRS 60 

ADLIRSSDPD FRVLNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKEVT VLLEHQKKVS 120 

KTRHTRETVL RRAKRRWAPI PCSMQENSLG PFPLFLQQVE SDAAQNYTVF YSISGRGVDK 180 

EPLNLFYIER DTGNLFCTRP VDREEYDVFD LIAYASTADG YSADLPLPLP IRVEDENDNH 240 

PVFTEAIYNF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLFSVHPS 300 

TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CIITVTDSND NAPTFRQNAY 360 

EAFVEENAFN VEILRIPIED KDLINTANWR VNFTILKGNE NGHFKISTDK ETNEGVLSW 420 

KPLNYEENRQ VNLEIGVNNE APFARDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 480 

KENLAVGSKI NGYKAYDPEN RNGNGLRYKK LHDPKGWITI DEISGSIITS KILDREVETP 540 

KNELYNITVL AIDKDDRSCT GTLAVNIEDV NDNPPEILQE YWICKPKMG YTDILAVDPD 600 

EPVHGAPFYF SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLPSVLLTL VCGVFGATKG 720 

KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKNGGQETIE 780 

MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSB WHSFTQPRLG EKLHRCNQNE 840 
DRMPSQDYVL TYNYEGRGSP AGSVGCCSEK QEEDGLDFLN NLEPKFITLA EACTKR 



Seq ID NO: 33 DMA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 64-2583 

1 11 21 31 41 51 

I I I I I I 

GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 

CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 120 

CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 

CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 

TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGTCAGTG 300 

TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 360 

GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 

TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 480 

ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 540 

GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 

AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 

CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 720 

GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 

CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 

ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 900 

CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 

AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 

TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 1080 

ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 

GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 

GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 

GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 

GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 1500 

ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 

AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 

ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 

ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 

GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 

AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 

GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 

ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 

GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 

CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 

AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 

GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 

ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAGAAT CCATTAGAGG ACACACTGGT 2580 

TAAAAATTAA ACATAAAAGA AATTGCATCG ATGTAATCAG AATGAAGACC GCATGCCATC 2640 

CCAAGATTAT GTCCTCACTT ATAACTATGA GGGAAGAGGA TCTCCAGCTG GTTCTGTGGG 2700 

CTGCTGCAGT GAAAAGCAGG AAGAAGATGG CCTTGACTTT TTAAATAATT TGGAACCCAA 2760 

ATTTATTACA TTAGCAGAAG CATGCACAAA GAGATAATGT CACAGTGCTA CAATTAGGTC 2820 

TTTGTCAGAC ATTCTGGAGG TTTCCAAAAA TAATATTGTA AAGTTCAATT TCAACATGTA 2880 

TGTATATGAT GATTTTTTTC TCAATTTTGA ATTATGCTAC TCACCAATTT ATATTTTTAA 2940 

AGCCAGTTGT TGCTTATCTT TTCCAAAAAG TGAAAAATGT TAAAACAGAC AACTGGTAAA 3000 

TCTCAAACTC CAGCACTGGA ATTAAGGTCT CTAAAGCATC TGCTCTTTTT TTTTTTTACG 3060 

GATATTTTAG TAATAAATAT GCTGGATAAA TATTAGTCCA ACAATAGCTA AGTTATGCTA 3120 

ATATCACATT ATTATGTATT CACTTTAAGT GATAGTTTAA AAAATAAACA AGAAATATTG 3180 

AGTATCACTA TGTGAAGAAA GTTTTGGAAA AGAAACAATG AAGACTGAAT TAAATTAAAA 3240 

ATGTTGCAGC TCATAAAGAA TTGGGACTCA CCCCTACTGC ACTACCAAAT TCATTTGACT 3300 
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TTGGAGGCAA AATGTGTTGA AGTGCCCTAT GAAGTAGCAA TTTTCTATAG GAATATAGTT 3360 

GGAAATAAAT GTGTGTGTGT ATATTATTAT TAATCAATGC AATATTTAAA ATGAAATGAG 3420 

AACAAAGAGG AAAATGGTAA AAACTTGAAA TGAGGCTGGG GTATAGTTTG TCCTACAATA 3480 

GAAAAAAGAG AGAGCTTCCT AGGCCTGGGC TCTTAAATGC TGCATTATAA CTGAGTCTAT 3540 

GAGGAAATAG TTCCTGTCCA ATTTGTGTAA TTTGTTTAAA ATTGTAAATA AATTAAACTT 3600 

TTCTGGTTTC TGTGGGAAGG AAATAGGGAA TCCAATGGAA CAGTAGCTTT GCTTTGCAGT 3660 

CTGTTTCAAG ATTTCTGCAT CCACAAGTTA GTAGCAAACT GGGGAATACT CGCTGCAGCT 3720 

GGGGTTCCCT GCTTTTTGGT AGCAAGGGTC CAGAGATGAG GTGTTTTTTT CGGGGAGCTA 3780 

ATAACAAAAA CATTTTAAAA CTTACCTTTA CTGAAGTTAA ATCCTCTATT GCTGTTTCTA 3840 

TTCTCTCTTA TAGTGACCAA CATCTTTTTA ATTTAGATCC AAATAACCAT GTCCTCCTAG 3900 

AGTTTAGAGG CTAGAGGGAG CTGAGGGGAG GATCTTACTG AAAGCACCCT GGGGAGATTG 3960 

ATTGTCCTTA AACCTAAGCC CCACAAACTT GACACCTGAT CAGGTCTGGG AGCTACAAAA 4020 

TTTCATTTTT CTCCTCACTG CCCTTCTTCT GAGTGGCATT GGCCTGAATC AAGGAAAGCC 4080 

AGGCCTTGTG GGCCCCCTTC TTTCGGCTTT CTGCTAAAGC AACACCTCCA GCAGAGATTC 4140 

CCTTAAGTGA CTCCAGGTTT TCCACCATCC TTCAGCGTGA ATTAATTTTT AATCAGTTTG 4200 

CTTTCTCCAG AGAAATTTTA AAATAATAGA AGAAATAGAA ATTTTGAATG TATAAAAGAA 4260 

AAAGATCAAG TTGTCATTTT AGAACAGAGG GAACTTTGGG AGAAAGCAGC CCAAGTAGGT 4320 

TATTTGTACA GTCAGAGGGC AACAGGAAGA TGCAGGCCTT CAAGGGCAAG GAGAGGCCAC 4380 

AAGGAATATG GGTGGGAGTA AAAGCAACAT CGTCTGCTTC ATACTTTTTC CTAGGCTTGG 4440 

CACTGCCTTT TCCTTTCTCA GGCCAATGGC AACTGCCATT TGAGTCCGGT GAGGGATCAG 4500 

CCAACCTCTT CTCTATGGCT CACCTTATTT GGAGTGAGAA ATCAAGGAGA CAGAGCTGAC 4560 

TGCATGATGA GTCTGAAGGC ATTTGCAGGA TGAGCCTGAA CTGGTTGTGC AGAACAAACA 4620 

AGGCATTCAT GGGAATTGTT GTATTCCTTC TGCAGCCCTC CTTCTGGGCA CTAAGAAGGT 4680 

CTATGAATTA AATGCCTATC TAAAATTCTG ATTTATTCCT ACATTTTCTG TTTTCTAATT 4740 

TGACCCTAAA ATCTATGTGT TTTAGACTTA GACTTTTTAT TGCCCCCCCC CCCTTTTTTT 4800 

TTGAGACGGA GTCTCGCTCT GACGCACAGG CTGGAGTGCA GTGGCTCCGA TCTCTGCTCA 4860 

CTGAAAGCTC CGCCTCCCGG GTTCATGCCA TTCTCCTGCC TCAGCCTCCT GAGTAGCTGG 4920 

GACTACAGGC GCCCACCACC ACGCCCGGCT AATTTTTTGT ATTTTTAATA GAGACGGGGT 4980 

TTCACTGTGT TAGCCAGGAT GGTCTCGATC TCCTGACCTC GTGATCCGCC TGCCTCGGCC 5040 

TCCCAAAGTG CTGGGATTAC AGGCATGACC CACCGCTCCC GGCCTTGTTT TCCGTTTAAA 5100 

GTCGTCTTCT TTTAATGTAA TCATTTTGAA CATGTGTGAA AGTTGATCAT ACGAATTGGA 5160 

TCAATCTTGA AATACTCAAC CAAAAGACAG TCGAGAAGCC AGGGGGAGAA AGAACTCAGG 5220 

GCACAAAATA TTGGTCTGAG AATGGAATTC TCTGTAAGCC TAGTTGCTGA AATTTCCTGC 5280 

TGTAACCAGA AGCCAGTTTT ATCTAACGGG TACTGAAACA CCCACTGTGT TTTGCTCACT 5340 

CCCACTCACC GATCAAAACC TGCTACCTCC CCAAGACTTT ACTAGTGCCG ATAAACTTTC 5400 

TCAAAGAGCA ACCAGTATCA CTTCCCTGTT TATAAAACCT CTAACCATCT CTTTGTTCTT 5460 

TGAACATGCT GAAAACCACC TGGTCTGCAT GTATGCCCGA ATTTGTAATT CTTTTCTCTC 5520 

AAATGAAAAT TTAATTTTAG GGATTCATTT CTATATTTTC ACATATGTAG TATTATTATT 5580 

TCCTTATATG TGTAAGGTGA AATTTATGGT ATTTGAGTGT GCAAGAAAAT ATATTTTTAA 5640 

AGCTTTCATT TTTCCCCCAG TGAATGATTT AGAATTTTTT ATGTAAATAT ACAGAATGTT 5700 

TTTTCTTACT TTTATAAGGA AGCAGCTGTC TAAAATGCAG TGGGGTTTGT TTTGCAATGT 5760 

TTTAAACAGA GTTTTAGTAT TGCTATTAAA AGAAGTTACT TTGCTTTTAA AGAAACTTGG 5820 

CTGCTTAAAA TAAGCAAAAA TTGGATGCAT AAAGTAATAT TTACAGATGT GGGGAGATGT 5880 

AATAAAACAA TATTAACTTG GCTGCTTAAA ATAAGCAAAA ATTGGATGCA TAAAGTAATA 5940 

TTTACAGATG TGGGGAGATG TAATAAAACA ATATTAACTT GGTTTCTTGT TTTTGCTGTA 6000 

TTTAGAGATT AAATAATTCT AAGATGATCA CTTTGCAAAA TTATGCTTAT GGCTGGCATG 6060 

GAAATAGAAA TACTCAATTA TGTCTTTGTT GTATTAATGG GGAATATTTT GGACAATGTT 6120 

TCATTATCAA ATTGTCGACA TCATTAATAT ATATTGTAAT GTTGGGAAGA GATCACTATT 6180 

TTGAAGCACA GCTTTACAGA TGAGTATCTA TGATACATAT GTATAATAAA TTTTGATCGG 6240 

GTATTAAAAG TATTAGAAGG TGGTTATAAT TGCAGAGTAT TCCATGAATA GTACACTGAC 6300 

ACAGGGGTTT TACTTTGAGG ACCAGTGTAG TCAAGGGAAA ACATGAGTTA AAAAGAAAAG 6360 

CAGGCAATAT TGCAGTCTTG ATTCTGCCAC TTACAGGATA GATAATGCCT GAACTTTAAT 6420 

GACAAGATGA TCCAACCATA AAGGTGCTCT GTGCTTCACA GTGAATCTTT TCCCCATGCA 6480 

GGAGTGTGCT CCCCTACAAA CGTTAAGACT GATCATTTCA AAAATCTATT AGCTATATCA 6540 

AAAGCCTTAC ATTTTAATAT AGGTTGAACC AAAATTTCAA TTCCAGTAAC TTCTATTGTA 6600 

ACCATTATTT TTGTGTATGT CTTCAAGAAT GTTCATTGGA TTTTTGTTTG TAATAGTAAA 6660 

ATACCGGATA CATTTCACGT GTCCTTCAGT ATTGATTTGG TTGAATATTG GGTCATAATG 6720 

GTTGAGAAGC ATGGACACTA GAGCCAGAAT GCTTGGATAT GAATCCTGGA TCTGTCACTT 6780 

ACTTCTGTGT GACCTTTGAA AGGCTACTTA TTTCCTCTCT TAGCTTTCTC ATTAAAATCA 6840 

ATGAACAATG CCAGCCTCAT GGGGTTGTTG AATGATTAAA TTAGTTAATA TACCTAAAGT 6900 

ACATAGAACA CTGCCTGCAC ATAGTAAAAG AATTATAAGT GTGAGGTAGT TGGTAAAATT 6960 

ATGTAGTTGG ATATACTACC GAACAATATC TAATCTCTTT TTAGGGAAAT AAAGTTTGTG 7020 
CATATATATA ATCCCGAAAC ATG 

Seq ID NO: 34 Protein sequence: 
Protein Accession tt: NP_077741.1 

1 11 21 31 41 51 

I I I I I I 

MAAAGPRRSV RGAVCLHLLL TLVIFSRDGE ACKKVILNVP SKLEADKIIG RVNLEECFRS 60 

ADLIRSSDPD FRVLNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKEVT VLLEHQKKVS 120 

KTRHTRETVL RRAKRRWAPI PCSMQENSLG PFPLFLQQVE SDAAQNYTVF YSISGRGVDK 180 

EPLNLFYIER DTGNLFCTRP VDREEYDVFD LIAYASTADG YSADLPLPLP IRVEDENDNH 240 

PVFTEAIYNF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLFSVHPS 300 

TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CI ITVTDSND NAPTFRQNAY 360 

EAFVEENAFN VEILRIPIED KDLINTANWR VNFTILKGNE NGHFKISTDK ETNEGVLSW 420 

KPLNYEENRQ VNLEIGVNNE APFARDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 480 

KENLAVGSKI NGYKAYDPEN RNGNGLRYKK LHDPKGWITI DEISGSIITS KILDREVETP 540 

KNELYNITVL AIDKDDRSCT GTLAVNIEDV NDNPPEILQE YWICKPKMG YTDILAVDPD 600 

BPVHGAPFYF SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 720 

KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKNGGQETIE 780 
MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EESIRGHTG 

Seq ID NO: 35 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 146-1273- 
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GGGAGTGGGC GTGGCGGTGC TGCCCAGGTG AGCCACCGCT GCTTCTGCCC AGACACGGTC 60 

GCCTCCACAT CCAGGTCTTT GTGCTCCTCG CTTGCCTGTT CCTTTTCCAC GCATTTTCCA 120 

GGATAACTGT GACTCCAGGC CCGCAATGGA TGCCCTGCAA CTAGCAAATT CGGCTTTTGC 180 

CGTTGATCTG TTCAAACAAC TATGTGAAAA GGAGCCACTG GGCAATGTCC TCTTCTCTCC 240 

AATCTGTCTC TCCACCTCTC TGTCACTTGC TCAAGTGGGT GCTAAAGGTG ACACTGCAAA 300 

TGAAATTGGA CAGGTTCTTC ATTTTGAAAA TGTCAAAGAT ATACCCTTTG GATTTCAAAC 360 

AGTAACATCG GATGTAAACA AACTTAGTTC CTTTTACTCA CTGAAACTAA TCAAGCGGCT 420 

CTACGTAGAC AAATCTCTGA ATCTTTCTAC AGAGTTCATC AGCTCTACGA AGAGACCCTA 480 

TGCAAAGGAA TTGGAAACTG TTGACTTCAA AGATAAATTG GAAGAAACGA AAGGTCAGAT 540 

CAACAACTCA ATTAAGGATC TCACAGATGG CCACTTTGAG AACATTTTAG CTGACAACAG 600 

TGTGAACGAC CAGACCAAAA TCCTTGTGGT TAATGCTGCC TACTTTGTTG GCAAGTGGAT 660 

GAAGAAATTT CCTGAATCAG AAACAAAAGA ATGTCCTTTC AGACTCAACA AGACAGACAC 720 

CAAACCAGTG CAGATGATGA ACATGGAGGC CACGTTCTGT ATGGGAAACA TTGACAGTAT 780 

CAATTGTAAG ATCATAGAGC TTCCTTTTCA AAATAAGCAT CTCAGCATGT TCATCCTACT 840 

ACCCAAGGAT GTGGAGGATG AGTCCACAGG CTTGGAGAAG ATTGAAAAAC AACTCAACTC 900 

AGAGTCACTG TCACAGTGGA CTAATCCCAG CACCATGGCC AATGCCAAGG TCAAACTCTC 960 

CATTCCAAAA TTTAAGGTGG AAAAGATGAT TGATCCCAAG GCTTGTCTGG AAAATCTAGG 1020 

GCTGAAACAT ATCTTCAGTG AAGACACATC TGATTTCTCT GGAATGTCAG AGACCAAGGG 1080 

AGTGGCCCTA TCAAATGTTA TCCACAAAGT GTGCTTAGAA ATAACTGAAG ATGGTGGGGA 1140 

TTCCATAGAG GTGCCAGGAG CACGGATCCT GCAGCACAAG GATGAATTGA ATGCTGACCA 1200 

TCCCTTTATT TACATCATCA GGCACAACAA AACTCGAAAC ATCATTTTCT TTGGCAAATT 1260 

CTGTTCTCCT TAAGTGGCAT AGCCCATGTT AAGTCCTCCC TGACTTTTCT GTGGATGCCG 1320 

ATTTCTGTAA ACTCTGCATC CAGAGATTCA TTTTCTAGAT ACAATAAATT GCTAATGTTG 1380 

CTGGATCAGG AAGCCGCCAG TACTTGTCAT ATGTAGCCTT CACACAGATA GACCTTTTTT 1440 

TTTTTCCAAT TCTATCTTTT GTTTCCTTTT TTCCCATAAG ACAATGACAT ACGCTTTTAA 1500 

TGAAAAGGAA TCACGTTAGA GGAAAAATAT TTATTCATTA TTTGTCAAAT TGTCCGGGGT 1560 

AGTTGGCAGA AATACAGTCT TCCACAAAGA AAATTCCTAT AAGGAAGATT TGGAAGCTCT 1620 

TCTTCCCAGC ACTATGCTTT CCTTCTTTGG GATAGAGAAT GTTCCAGACA TTCTCGCTTC 1680 

CCTGAAAGAC TGAAGAAAGT GTAGTGCATG GGACCCACGA AACTGCCCTG GCTCCAGTGA 1740 

AACTTGGGCA CATGCTCAGG CTACTATAGG TCCAGAAGTC CTTATGTTAA GCCCTGGCAG 1800 

GCAGGTGTTT ATTAAAATTC TGAATTTTGG GGATTTTCAA AAGATAATAT TTTACATACA 1860 

CTGTATGTTA TAGAACTTCA TGGATCAGAT CTGGGGCAGC AACCTATAAA TCAACACCTT 1920 

AATATGCTGC AACAAAATGT AGAATATTCA GACAAAATGG ATACATAAAG ACTAAGTAGC 1980 

CCATAAGGGG TCAAAATTTG CTGCCAAATG CGTATGCCAC CAACTTACAA AAACACTTCG 2040 

TTCGCAGAGC TTTTCAGATT GTGGAATGTT GGATAAGGAA TTATAGACCT CTAGTAGCTG 2100 

AAATG CAAGA CCCCAAGAGG AAGTTCAGAT CTTAATATAA ATTCACTTTC ATTTTTGATA 2160 

GCTGTCCCAT CTGGTCATGT GGTTGGCACT AGACTGGTGG CAGGGGCTTC TAGCTGACTC 2220 

GCACAGGGAT TCTCACAATA GCCGATATCA GAATTTGTGT TGAAGGAACT TGTCTCTTCA 2280 

TCTAATATGA TAGCGGGAAA AGGAGAGGAA ACTACTGCCT TTAGAAAATA TAAGTAAAGT 2340 

GATTAAAGTG CTCACGTTAC CTTGACACAT AGTTTTTCAG TCTATGGGTT TAGTTACTTT 2400 

AGATGGCAAG CATGTAACTT ATATTAATAG TAATTTGTAA AGTTGGGTGG ATAAGCTATC 2460 

CCTGTTGCCG GTTCATGGAT TACTTCTCTA TAAAAAATAT ATATTTACCA AAAAATTTTG 2520 

TGACATTCCT TCTCCCATCT CTTCCTTGAC ATGCATTGTA AATAGGTTCT TCTTGTTCTG 2580 
AGATTCAATA TTGAATTTCT CCTATGCTAT TGACAATAAA ATATTATTGA ACTACC 

Seq ID NO: 36 Protein sequence: 
Protein Accession #: Np_002 630.1 

1 11 21 31 41 51 
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MDALQLANSA FAVDLFKQLC EKEPLGNVLF SPICLSTSLS LAQVGAKGDT ANEIGQVLHF 60 

ENVKDIPFGF QTVTSDVNKL SSFYSLKLIK RLYVDKSLNL STEFISSTKR PYAKELETVD 120 

FKDKLEBTKG QINNSIKDLT DGHFENILAD NSVNDQTKIIi WNAAYFVGK WMKKFPESBT 180 

KECPFRLNKT DTKPVQMMNM EATFCMGNID SINCKIIELP FONKHLSMFI LLPKDVEDES 240 

TGLEKIEKQL NSESLSQWTN PSTMANAKVK LSIPKFKVEK MIDPKACLEN LGLKHIFSED 300 

TSDFSGMSET KGVALSNVIH KVCLEITEDG GDSIEVPGAR ILQHKDELNA DHPFIYIIRH 360 
NKTRNIIFFG KFCSP 



Seq ID NO: 37 DNA sequence 

Nucleic Acid Accession fl: NM_0168583 

Coding sequence: 72-842 

1 11 21 31 41 51 

I I I I I I 

GGAGTGGGGG AGAGAGAGGA GACCAGGACA GCTGCTGAGA CCTCTAAGAA GTCCAGATAC • 60 

TAAGAGCAAA GATGTTTCAA ACTGGGGGCC TCATTGTCTT CTACGGGCTG TTAGCCCAGA 120 

CCATGGCCCA GTTTGGAGGC CTGCCCGTGC CCCTGGACCA GACCCTGCCC TTGAATGTGA 180 

ATCCAGCCCT GCCCTTGAGT CCCACAGGTC TTGCAGGAAG CTTGACAAAT GCCCTCAGCA 240 

ATGGCCTGCT GTCTGGGGGC CTGTTGGGCA TTCTGGAAAA CCTTCCGCTC CTGGACATCC 300 

TGAAGCCTGG AGGAGGTACT TCTGGTGGCC TCCTTGGGGG ACTGCTTGGA AAAGTGACGT 360 

CAGTGATTCC TGGCCTGAAC AACATCATTG ACATAAAGGT CACTGACCCC CAGCTGCTGG 420 

AACTTGGCCT TGTGCAGAGC CCTGATGGCC ACCGTCTCTA TGTCACCATC CCTCTCGGCA 480 

TAAAGCTCCA AGTGAATACG CCCCTGGTCG GTGCAAGTCT GTTGAGGCTG GCTGTGAAGC 540 

TGGACATCAC TGCAGAAATC TTAGCTGTGA GAGATAAGCA GGAGAGGATC CACCTGGTCC 600 

TTGGTGACTG CACCCATTCC CCTGGAAGCC TGCAAATTTC TCTGCTTGAT GGACTTGGCC 660 

CCCTCCCCAT TCAAGGTCTT CTGGACAGCC TCACAGGGAT CTTGAATAAA GTCCTGCCTG 720 

AGTTGGTTCA GGGCAACGTG TGCCCTC1GG TCAATGAGGT TCTCAGAGGC TTGGACATCA 780 

CCCTGGTGCA TGACATTGTT AACATGCTGA TCCACGGACT ACAGTTTGTC ATCAAGGTCT 840 

AAGCCTTCCA GGAAGGGGCT GGCCTCTGCT GAGCTGCTTC CCAGTGCTCA CAGATGGCTG 900 

GCCCATGTGC TGGAAGATGA CACAGTTGCC TTCTCTCCGA GG AACCTGCC CCCTCTCCTT 960 

TCCCACCAGG CGTGTGTAAC ATCCCATGTG CCTCACCTAA TAAAATGGCT CTTCTTCTGC 1020 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 
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Seq ID NO: 38 Protein sequence: 
Protein Accession §: NP_057667 
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MFQTGGJjIVF YGLLAQTMAQ FGGLFVPLDQ TLPLNVNPAL PLSPTGLAGS LTNALSNGLL 60 
SGGLLGILEN LPLLDILKPG GGTSGGLLGG LLGKVTSVIP GLNNIIDIKV TDPQLLELGL 120 
VQSPDGHRLY VTIPLGIKLQ VNTPLVGASL LRLAVKLDIT AEILAVRDKQ ERIHLVLGDC 180 
THSPGSLQIS LLDGLGPLPI QGLLDSLTGI LNKVLPELVQ GNVCPLVNEV LRGLDITLVH 240 
DIVNMLIHGL QFVIKV 



Seq ID NO: 39 DNA sequence 

Nucleic Acid Accession ft: NM_0O4363.1 

Coding sequence: 115-2223 

1 11 21 31 41 51 

I I I I I I 

CTCAGGGCAG AGGGAGGAAG GACAGCAGAC CAGACAGTCA CAGCAGCCTT GACAAAACGT 60 

TCCTGGAACT CAAGCTCTTC TCCACAGAGG AGGACAGAGC AGACAGCAGA GACCATGGAG 120 

TCTCCCTCGG CCCCTCCCCA CAGATGGTGC ATCCCCTGGC AGAGGCTCCT GCTCACAGCC 180 

TCACTTCTAA CCTTCTGGAA CCCGCCCACC ACTGCCAAGC TCACTATTGA ATCCACGCCG 240 

TTCAATGTCG CAGAGGGGAA GGAGGTGCTT CTACTTGTCC ACAATCTGCC CCAGCATCTT 300 

TTTGGCTACA GCTGGTACAA AGGTGAAAGA GTGGATGGCA ACCGTCAAAT TATAGGATAT 360 

GTAATAGGAA CTCAACAAGC TACCCCAGGG CCCGCATACA GTGGTCGAGA GATAATATAC 420 

CCCAATGCAT CCCTGCTGAT CCAGAACATC ATCCAGAATG ACACAGGATT CTACACCCTA 480 

CACGTCATAA AGTCAGATCT TGTGAATGAA GAAGCAACTG GCCAGTTCCG GGTATACCCG 540 

GAGCTGCCCA AGCCCTCCAT CTCCAGCAAC AACTCCAAAC CCGTGGAGGA CAAGGATGCT 600 

GTGGCCTTCA CCTGTGAACC TGAGACTCAG GACGCAACCT ACCTGTGGTG GGTAAACAAT 660 

CAGAGCCTCC CGGTCAGTCC CAGGCTGCAG CTGTCCAATG GCAACAGGAC CCTCACTCTA 720 

TTCAATGTCA CAAGAAATGA CACAGCAAGC TACAAATGTG AAACCCAGAA CCCAGTGAGT 780 

GCCAGGCGCA GTGATTCAGT CATCCTGAAT GTCCTCTATG GCCCGGATGC CCCCACCATT 840 

TCCCCTCTAA ACACATCTTA CAGATCAGGG GAAAATCTGA ACCTCTCCTG CCACGCAGCC 900 

TCTAACCCAC CTGCACAGTA CTCTTGGTTT GTCAATGGGA CTTTCCAGCA ATCCACCCAA 960 

GAGCTCTTTA TCCCCAACAT CACTGTGAAT AATAGTGGAT CCTATACGTG CCAAGCCCAT 1020 

AACTCAGACA CTGGCCTCAA TAGGACCACA GTCACGACGA TCACAGTCTA TGCAGAGCCA 1080 

CCCAAACCCT TCATCACCAG CAACAACTCC AACCCCGTGG AGGATGAGGA TGCTGTAGCC 1140 

TTAACCTGTG AACCTGAGAT TCAGAACACA ACCTACCTGT GGTGGGTAAA TAATCAGAGC 1200 

CTCCCGGTCA GTCCCAGGCT GCAGCTGTCC AATGACAACA GGACCCTCAC TCTACTCAGT 1260 

GTCACAAGGA ATGATGTAGG ACCCTATGAG TGTGGAATCC AGAACGAATT AAGTGTTGAC 1320 

CACAGCGACC CAGTCATCCT GAATGTCCTC TATGGCCCAG ACGACCCCAC CATTTCCCCC 1380 

TCATACACCT ATTACCGTCC AGGGGTGAAC CTCAGCCTCT CCTGCCATGC AGCCTCTAAC 1440 

CCACCTGCAC AGTATTCTTG GCTGATTGAT GGGAACATCC AGCAACACAC ACAAGAGCTC 1500 

TTTATCTCCA ACATCACTGA GAAGAACAGC GGACTCTATA CCTGCCAGGC CAATAACTCA 1560 

GCCAGTGGCC ACAGCAGGAC TACAGTCAAG ' ACAATCACAG TCTCTGCGGA GCTGCCCAAG 1620 

CCCTCCATCT CCAGCAACAA CTCCAAACCC GTGGAGGACA AGGATGCTGT GGCCTTCACC 1680 

TGTGAACCTG AGGCTCAGAA CACAACCTAC CTGTGGTGGG TAAATGGTCA GAGCCTCCCA 1740 

GTCAGTCCCA GGCTGCAGCT GTCCAATGGC AACAGGACCC TCACTCTATT CAATGTCACA 1800 

AGAAATGACG CAAGAGCCTA TGTATGTGGA ATCCAGAACT CAGTGAGTGC AAACCGCAGT 1860 

GACCCAGTCA CCCTGGATGT CCTCTATGGG CCGGACACCC CCATCATTTC CCCCCCAGAC 1920 

TCGTCTTACC TTTCGGGAGC GAACCTCAAC CTCTCCTGCC ACTCGGCCTC TAACCCATCC 1980 

CCGCAGTATT CTTGGCGTAT CAATGGGATA CCGCAGCAAC ACACACAAGT TCTCTTTATC 2040 

GCCAAAATCA CGCCAAATAA TAACGGGACC TATGCCTGTT TTGTCTCTAA CTTGGCTACT 2100 

GGCCGCAATA ATTCCATAGT CAAGAGCATC ACAGTCTCTG CATCTGGAAC TTCTCCTGGT 2160 

CTCTCAGCTG GGGCCACTGT CGGCATCATG ATTGGAGTGC TGGTTGGGGT TGCTCTGATA 2220 

TAGCAGCCCT GGTGTAGTTT CTTCATTTCA GGAAGACTGA CAGTTGTTTT GCTTCTTCCT 2280 

TAAAGCATTT GCAACAGCTA CAGTCTAAAA TTGCTTCTTT ACCAAGGATA TTTACAGAAA 2340 

AGACTCTGAC CAGAGATCGA GACCATCCTA GCCAACATCG TGAAACCCCA TCTCTACTAA 2400 

AAATACAAAA ATGAGCTGGG CTTGGTGGCG CGCACCTGTA GTCCCAGTTA CTCGGGAGGC 2460 

TGAGGCAGGA GAATCGCTTG AACCCGGGAG GTGGAGATTG CAGTGAGCCC AGATCGCACC 2520 

ACTGCACTCC AGTCTGGCAA CAGAGCAAGA CTCCATCTCA AAAAGAAAAG AAAAGAAGAC 2580 

TCTGACCTGT ACTCTTGAAT ACAAGTTTCT GATACCACTG CACTGTCTGA GAATTTCCAA 2640 

AACTTTAATG AACTAACTGA CAGCTTCATG AAACTGTCCA CCAAGATCAA GCAGAGAAAA 2700 

TAATTAATTT CATGGGACTA AATGAACTAA TGAGGATTGC TGATTCTTTA AATGTCTTGT 2760 

TTCCCAGATT TCAGGAAACT TTTTTTCTTT TAAGCTATCC ACTCTTACAG CAATTTGATA 2820 

AAATATACTT TTGTGAACAA AAATTGAGAC ATTTACATTT TCTCCCTATG TGGTCGCTCC 2880 

AGACTTGGGA AACTATTCAT GAATATTTAT ATTGTATGGT AATATAGTTA TTGCACAAGT 2940 
TCAATAAAAA TCTGCTCTTT GTATAACAGA AAAA 

Seq ID NO: 40 Protein sequence: 
Protein Accession #: NP_004354.1 

1 11 21 31 41 51 

I I I I I I 

MESPSAPPHR WCIPWQRLLL'TASIiLTFWNP PTTAKLTIES TPFNVAEGKE VLLLVHNLPQ 60 

HLFGYSWYKG ERVDGNRQII GYVIGTQQAT PGPAYSGREI IYPNASLLIQ NIIQNDTGFY 120 

TLHVIKSDLV NEEATGQFRV YPELPKPSIS SNNSKPVEDK DAVAFTCEPE TQDATYLWWV 180 

NNQSLPVSPR LQLSNGNRTL TLFNVTRNDT ASYKCETQNP VSARRSDSVI LNVLYGPDAP 240 

TISPLNTSYR SGENLNLSCH AASNPPAQYS WFVNGTFQQS TQELFIPNIT VNNSGSYTCQ 300 

AHNSDTGLNR TTVTTITVYA EPPKPFITSN NSNPVEDEDA VALTCEPEIQ NTTYLWWVNN 360 

QSLPVSPRLQ LSNDNRTLTL LSVTRNDVGP YECGIQNELS VDHSDPVILN VLYGPDDPTI 420 

SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL IDGNIQQHTQ ELFISNITEK NSGLYTCQAN 480 

NSASGHSRTT VKTITVSAEL PKPSISSNNS KPVEDKDAVA FTCEPEAQNT TYLWWVNGQS 540 

LPVSPRLQLS NGNRTLTLFN VTRNDARAYV CGIQNSVSAN RSDPVTLDVL YGPDTPIISP 600 

PDSSYLSGAN LNLSCHSASN PSPQYSWRIN GIPQQHTQVL FIAKITPNNN GTYACFVSNI* 660 
ATGRNNSIVK SITVSASGTS PGLSAGATVG IMIGVLVGVA LI 
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Seq ID NO: 41 DNA sequence 

Nucleic Acid Accession 8: NMJ)06952.l 

Coding sequence: 11-793 

1 11 21 31 . 41 51 

I I I I I | 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TGCTGATTTT 60 

TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA TCTTCTTTGT 120 

ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG ACATCTATGG 180 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC TGATGTTTAT 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 360 

ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 420 

TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 540 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCCT GGTTTTTATC ACAATCAGGG 660 

CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACGCC TGGGGGGTTG CCTGGTTTGG 720 

ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 
AATTGAATAT TAAGAA 

Seq ID NO: 42 Protein sequence: 
Protein Accession #: NP_0068B3.1 

1 11 21 31 41 51 

I ] I I 1 I 

MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT DNDDIYGAAW 60 
IGIFVGICLP CLSVLGIVGI MKSSRKILLA YPILMFIVYA FBVASCITAA TQRDFFTPNL 120 
FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 180 
DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE lilSGPMNRHA WGVAWFGFAI 240 
LCWTFWVLLG TMFYWSRIEY 



Seq ID NO: 43 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 83-2605 

1 11 21 31 41 51 

I I I I I I 

GCCGGACAGA TCTGCGCGTA TCCTGGAGCC GGCCCAGTTG TGAACTAGGA GAGCTTTGGG 60 

ACCTCTGTCC CAAGCAAGAG AGATGAATGG AGAGTATAGA GGCAGAGGAT TTGGACGAGG 120 

AAGATTTCAA AGCTGGAAAA GGGGAAGAGG TGGTGGGAAC TTCTCAGGAA AATGGAGAGA 180 

AAGAGAACAC AGACCTGATC TGAGTAAAAC CACAGGAAAA CGTACTTCTG AACAAACCCC 240 

ACAGTTTTTG CTTTCAACAA AGACCCCACA GTCAATGCAG TCAACATTGG ATCGATTCAT 300 

ACCATATAAA GGCTGGAAGC TTTATTTCTC TGAAGTTTAC AGCGATAGCT CTCCTTTGAT 360 

TGAGAAGATT CAAGCATTTG AAAAATTTTT CACAAGGCAT ATTGATTTGT ATGACAAGGA 420 

TGAAATAGAA AGAAAGGGAA GTATTTTGGT AGATTTTAAA GAACTGACAG AAGGTGGTGA 480 

AGTAACTAAC TTGATACCAG ATATAGCAAC TGAACTAAGA GATGCACCTG AGAAAACCTT 540 

GGCTTGCATG GGTTTGGCAA TACATCAGGT GTTAACTAAG GACCTTGAAA GGCATGCAGC 600 

TGAGTTACAA GCCCAGGAAG GATTGTCTAA TGATGGAGAA ACAATGGTAA ATGTGCCACA 660 

TATTCATGCA AGGGTGTACA ACTATGAGCC TTTGACACAG CTCAAGAATG TCAGAGCAAA 720 

TTACTATGGA AAATACATTG CTCTAAGAGG GACAGTGGTT CGTGTCAGTA ATATAAAGCC 780 

TCTTTGCACC AAGATGGCTT TTCTTTGTGC TGCATGTGGA GAAATTCAGA GCTTTCCTCT 840 

TCCAGATGGA AAATACAGTC TTCCCACAAA GTGTCCTGTG CCTGTGTGTC GAGGCAGGTC 900 

ATTTACTGCT CTCCGCAGCT CTCCTCTCAC AGTTACGATG GACTGGCAGT CAATCAAAAT 960 

CCAGGAATTG ATGTCTGATG ATCAGAGAGA AGCAGGTCGG ATTCCACGAA CAATAGAATG 1020 

TGAGCTTGTT CATGATCTTG TGGATAGCTG TGTCCCGGGA GACACAGTGA CTATTACTGG 1080 

AATTGTCAAA GTCTCAAATG CGGAAGAAGG TTCTCGAAAT AAGAATGACA AGTGTATGTT 1140 

CCTTTTGTAT ATTGAAGCAA ATTCTATTAG TAATAGCAAA GGACAGAAAA CAAAGAGTTC 1200 

TGAGGATGGG TGTAAGCATG GAATGTTGAT GGAGTTCTCA CTTAAAGACC TTTATGCCAT 1260 

CCAAGAGATT CAAGCTGAAG AAAACCTGTT TAAACTCATT GTCAACTCGC TTTGCCCTGT 1320 

CATTTTTGGT CATGAACTTG TTAAAGCAGG TTTGGCATTA GCACTCTTTG GAGGAAGCCA 1380 

GAAATACGCA GATGACAAAA ACAGAATTCC AATTCGGGGA GACCCCCACA TCCTTGTTGT 1440 

TGGAGATCCA GGCCTAGGAA AAAGTCAAAT GCTACAGGCA GCGTGCAATG TTGCCCCACG 1500 

TGGCGTGTAT GTTTGTGGTA ACACCACGAC CACCTCTGGT CTGACGGTAA CTCTTTCAAA 1560 

AGATAGTTCC TCTGGAGATT TTGCTTTGGA AGCTGGTGCC CTGGTACTTG GTGATCAAGG 1620 

TATTTGTGGA ATCGATGAAT TTGATAAGAT GGGGAATCAA CATCAAGCCT TGTTGGAAGC 1680 

CATGGAGCAG CAAAGTATTA GTCTTGCTAA GGCTGGTGTG GTTTGTAGCC TTCCTGCAAG 1740 

AACTTCCATT ATTGCTGCTG CAAATCCAGT TGGAGGACAT TACAATAAAG CCAAAACAGT 1800 

TTCTGAGAAT TTAAAAATGG GGAGTGCACT ACTATCCAGA TTTGATTTGG TCTTTATCCT 1860 

GTTAGATACT CCAAATGAGC ATCATGATCA CTTACTCTCT GAACATGTGA TTGCAATAAG 1920 

AGCTGGAAAG CAGAGAACCA TTAGCAGTGC CACAGTAGCT CGTATGAATA GTCAAGATTC 1980 

AAATACTTCC GTACTTGAAG TAGTTTCTGA GAAGCCATTA TCAGAAAGAC TAAAGGTGGT 2040 

TCCTGGAGAA ACAATAGATC CCATTCCCCA CCAGCTATTG AGAAAGTACA TTGGCTATGC 2100 

TCGGCAGTAT GTGTACCCAA GGCTATCCAC AGAAGCTGCT CGAGTTCTTC AAGATTTTTA 2160 

CCTTGAGCTC CGGAAACAGA GCCAGAGGTT AAATAGCTCA CCAATCACTA CCAGGCAGCT 2220 

GGAATCTTTG ATTCGTCTGA CAGAGGCACG AGCAAGGTTG GAATTGAGAG AGGAAGCAAC 2280 

CAAAGAAGAC GCTGAGGATA TAGTGGAAAT TATGAAATAT AGCATGCTAG GAACTTACTC 2340 

TGATGAATTT GGGAACCTAG ATTTTGAGCG ATCCCAGCAT GGTTCTGGAA TGAGCAACAG 2400 

GTCAACAGCG AAAAGATTTA TTTCTGCTCT CAACAACGTT GCTGAAAGAA CTTATAATAA 2460 

TATATTTCAA TTTCATCAAC TTCGGCAGAT TGCCAAAGAA CTAAACATTC AGGTTGCTGA 2520 

TTTTGAAAAT TTTATTGGAT CACTAAATGA CCAGGGTTAC CTCTTGAAAA AAGGCCCAAA 2580 

AGTTTACCAG CTTCAAACTA TGTAAAAGGA CTTCACCAAG TTAGGGCCTC CTGGGTTTAT 2640 

TGCAGATTAA AGCCATCTCA GTGAAGATAT GCGTGCACGC ACAGACAGAC AGACACACAC 2700 

ACACACACAC ACACACACAC ACACACACAC ACACACAGTC AAATACTGTT CTCTGAAAAA 2760 

TGATGTCCCA AAAGTATTAT AATAGGAAAA AAGCATTAAA TATAATAAAC TAATTTAAGA 2820 
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AGTGATAAAG TCTCCAGATG CAGTAGCTCA CACTGTAATC ACAGTGACTC AGGAGGCTGA 2880 

GGTGAGAGGA TTCCTTGAGG CCAGGGTTCG AGACCAACCT TGGGCAACAT AGCAAGACCC 2940 

CATTTCTTAA AAAAAAAAAA AAAAAATTTA AACTTAGCTG GGTATGGTGG CACATGCCTA 3000 

TAGTCTCAGC TACTTGTGAG GCTGAGGCAG GAGGATTCTT TGAGCCCAGG AGTTTGAGGT 3060 

TACAGTGAGC CACAATCACA CCAATCACTG CACTCCAGCC TGGGCAATAA AGTAACTCTT 3120 

GACTCAAAAA AATAAAAAAA ATTGTAGTGG TAGCCATGTG TTAATTGTTA AATAAATTCT 3180 

CCAAAGGGCT AAAAGTAAAT TACTTATAAA TTTTTTATAG TTGTATTTTT GACCTGCCTT 3240 

TTATATGTAT GAATATTTCA TAGTTTTGCA TATCAGATGT AGGCATACAG ACAAATACAT 3300 

AAACCAATGA ATATATTACA TATTCTGTGT TCCAATAAAA CTTT ATTTAT GGACACTAAA 3360 

ATTTGAATTT CATAAAATTT TCCCATGTCA AGAATACAAA ATACTTGAGT TTTGTTTTTA 3420 

GCTATTTAAT AATAGGTCTC ATTTATTCCA CAGGCTGTAG TTTGTAGTCT TGCTTGAAAC 3480 

AATAGAAACA GACTGATTAA GCAGGAGAAG TTTTTTGAAA GAATTTTGTT TGGCTCACGG 3540 

AATTATTAGA AGGCAGGTGA ACCAGGAGGG TAAGCTTCCA GCAGCAATTT GTAAAACCAT 3600 

GCCTTAGAAT TGGACTAAGG AAGAAGCTGC TGACACTCCA CTGCCACACA GGGCACTGGA 3660 

AGAAAGTGCT GCTGCCTCCC TGCCCCACCT TTGCCACTTC TGCAGCAGGA ATAGGTAGAA 3720 

GAATGCCCCC ACCCGCACCG GAACAGCAAC AAAAGGATTC TGCATGAGAT GCCTCCCTAA 3780 

ATTGCTGAAT TCAAAAAAGA AGTTGCATAC AAAGACATCT GATTGAAAAA GGGTATGTTA 3840 

TATGCCCCTT TCATAGGCTG CTAGGGAGTT TTCCTGGTTC TACTTTCAGG TGGTGGGATC 3900 

AATAAGACCA GAATTTCTCA TATGTTGTGA GAGGATTCAA ATGTTACAGG GTTGCCAGCC 3960 

AAACTATCAA TCATGTATAA ATCCAACAAA CACTTTGTAA CATACAAGAA CTCAGGAAAT 4020 

GTGAACCATT GTTGGAGAAT CTACTAAAAT ACGGCTTCCC GCAAACGAAG ATGAATGGAA 4080 

AATGTAAATA AAAAGAACTG GCAGTGTATA TCAGATGTTT AACTATAGGA CCAGAACTAA 4140 

GATGTGGAGA CTATTGCCAT AGACCACAAT GTAAATTTTT AAGTGAGGAA GGAAAAATCA 4200 

GGAATCAAAA GGGGCCAGGT GCAGTGGCTC ACATCTATAA TCCCAGAGCT TTGGGAGTTC 4260 

GAGGCAGGAG GATCACTTGA AGCCAGTTTT GAGACCAGCC TATGCAACAC ATTGAGACCC 4320 

TATCTCTACA AAAAATAGAT TAGCTGGGCA CGGTGGTGCA TGCCTATTGT CCTACCTACT 4380 

GTGGAGGCTG AAGTAGGAAA TCACTTGAGC CCGAGAGTTT GAGGTTACAG TGAGCTATGA 4440 
TTATACCACT GCACTCCAGC CTGGGCAAGA GAGCAAGACC TTGTCTCTT 

Seq ID NO: 44 Protein sequence: 
Protein Accession #s CAB55276.2 



PCT/US02/12476 



MNGEYRGRGF 
TPQSMQSTLD 
ILVDFKELTE 
LSNDGETMVN 
LCAACGEIQS 
QREAGRIPRT 
SISNSKGQKT 
KAGLALALFG 
TTTTSGLTVT 
IiAKAGWCSL 
HDHLLSEHVI 
IPHQLLRKYI 
EARARLELRE 
SALNNVAERT 



11 
I 

GRGRFQSWKR 
RFIPYKGWKL 
GGEVTNLIPD 
VPHIHARVYN 
FPLPDGKYSL 
IECELVHDLV 
KSSEDGCKHG 
GSQKYADDKN 
LSKDSSSGDF 
PARTS I I AAA 
AIRAGKQRTI 
GYARQYVYPR 
EATKEDAEDI 
YNNIFQFHQL 



21 
I 

GRGGGNFSGK 
YFSEVYSDSS 
IATELRDAPE 
YEPLTQLKNV 
PTKCPVPVCR 
DSCVPGDTVT 
MLMEFSliKDL 
RIPIRGDPHI 
ALEAGALVLG 
NPVGGHYNKA 
SSATVARMNS 
LSTEAARVLQ 
VEIMKYSMLG 
RQIAKELNIQ 



31 

I 

WREREHRPDL 
PLIEKIQAFE 
KTLACMGLAI 
RANYYGKYIA 
GRSFTALRSS 
ITGIVKVSNA 
YAIQEIQAEE 
LWGDPGLGK 
DQGICGIDEF 
KTVSENIiKMG 
QDSNTSVLEV 
DFYLELRKQS 
TYSDEFGNLD 
VADFENFIGS 



41 
I 

SKTTGKRTSE 
KFFTRHIDLY 
HQVLTKDLER 
LRGTWRVSN 
PLTVTMDWQS 
EEGSRNKNDK 
NLFKLIVNSL 
SQMLQAACNV 
DKMGNQHQAL 
SALLSRFDLV 
VSEKPIiSERIi 
QRLNSSPITT 
FERSQHGSGM 
LNDQGYLLKK 



51 
I 

QTPQFLLSTK 
DKDEIERKGS 
HAAELQAQEG 
IKPLCTKMAF 
IKIQELMSDD 
CMFLLYIEAN 
CPVIFGHELV 
APRGVYVCGN 
LEAMEQQSIS 
FILLDTPNEH 
KWPGETIDP 
RQLESLIRLT 
SNRSTAKRFI 
GPKVYQLQTM 



Seq ID NO: 45 DNA sequence 

Nucleic Acid Accession ft: NM_005416. 

Coding sequence: 149.. 658 



ACCAGATCCC 
CTGAAGACCA 
AAAGAGTGTG 
CCCACCACCT 
AATATTTGTT 
AAAGATTCCA 
GCCAGGCTGT 
CAAGGTCCCT 
ACCAGGCAGC 
CAAAGTTCCT 
GCCATGTCCT 
TGGTGCACAG 
TGTTTCTGTG 
AGTCTCTCTC 
CTGAAGAATC 
GGCTGCTCAG 
CTCATTAAAT 



11 
I 

AGAGGCTGAA 
GAAAAGCCAC 
TCCACGATCC 
CAGCTTCAAC 
CCCACAACCA 
GAGCCAGGCT 
ACCAAGGTCC 
GAGCCAGGTT 
ATCAAGGTCC 
GAGCAAGGAT 
TCAACGGTCA 
ACAAGCCCTT 
TCTTAATTGT 
TTATTTGTAT 
CTGTAAGCCC 
GGTTCATCTG 
TGCTTTTAAT 



21 
I 

CACCTCGACC 
TAAGACTTTC 
TTTGAAGCAT 
AGCAGCAGGT 
AGGAGCCATG 
GTACCAAGGT 
CTGAGCCAGG 
GTACCAAGGT 
CTGACCAAGG 
ACACCAAAGT 
CTCCAGGCCC 
GAGAAGCCAA 
CTGTAGACCT 
CCTAAAAATA 
CTGAATTAAG 
AAGATTCGAA 
TCCA 



31 
I 

TTCTCTGCAC 
TGCTTAATTC 
GAGTTCTTAC 
GAAACAACCC 
CCACTCAAAG 
CCCTGAGCCA 
TTGTACCAAG 
CCCTGAGCCA 
CTTCATCAAG 
TCCTGTGCCA 
AGCTCAGCAG 
CCACCAGATG 
TGTAATCAGC 
CGTACTATAA 
CAGAAAGTCT 
TGAAAAGAAA 



41 
I 

AGCAGATGAT 
AGGAGCTTAG 
CAGCAGAAGC 
AGCCAGCCTC 
GTTCCACAAC 
GGCTGTACCA 
GTCCCTGAGC 
GGCTACACCA 
TTTCCTGAGC 
GGCTACACAA 
AAGACCAAGC 
CTGGACACCC 
ACATTGTCAC 
AGCTTTTGTT 
TCATGGCTTT 
TGCATGTTTC 



51 
I 

CCCTGAGCAG 
AGGATTCTTC 
AGACCTTTAC 
CACCTCAGGA 
CTGGAAACAC 
AGGTCCCTGA 
CAGGCTGTAC 
AGGTCCCTGA 
CAGGTGCCAT 
AGCTACCAGA 
AGAAGTAATT 
TCTTCCCATC 
CCCAAGCCAT 
CACACACACT 
TCTGGTCTTC 
CTGCTCTTCC 



Seq ID NO: 46 Protein sequence: 
Protein Accession #: NP_005407.1 



11 



21 



51 



31 41 

I I I I I I 

MSSYQQKOTF TPPPQLQQQQ VKQPSQPPPQ EIFVPTTKEP CHSKVPQPGN TKIPEPGCTK 
VPEPGCTKVP EPGCTKVPEP GCTKVPEPGC TKVPEPGCTK VPEPGYTKVP EPGSIKVPDQ 
GFIKFPEPGA IKVPEQGYTK VPVPGYTKLP EPCPSTVTPG PAQQKTKQK 

Seq ID NO: 47 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
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GCGTCGTGTG 
AAGGCTCGTT 
TTGCAATTAA 
GAAAATGGAT 
TTTGGGGAAA 
AGTCGGCGTT 
TAAGGATAAC 
TTGGAGCTGC 
CTAAAAACTT 



11 
i 

CAGGCGTCCC 
AGAATTCGCC 
GCTTAGGGAA 
TAGAGAAACT 
GTGCCCCGAC 
GGCGGCAGCG 
ATCCTGGAAA 
CCTGTGGAGT 
TGTGAGAATT 



21 
I 

CGGGCTGTGG 
CTAGAGCTGT 
CCAGCAACAA 
TCTTCCCCGA 
CGCAGAGGCG 
GTGGCCTTCC 
TGACTTCTGT 
TACAGTTTAC 
TTCTTTTACT 



31 
I 

ATAATTAGAC 
ATCATGTATT 
AAGCAAACTT 
TTTAAGGGGA 
ACGACAGGGG 
TCATCTGGGC 
ACGGTTTGAG 
CAAACACATT 
AAAATTTTTT 



41 

I 

ACGTTCTTCC 
TTCTTTCAAA 
GGCCCGAGGT 
AAGATTCCTG 
AGCAGGAAGC 
GATGTGGGCT 
CCCAACTGCA 
CATGAACATA 
CTTATTACAA 



51 
I 

CTCATTGCCC 
TTAACTTTGC 
CGTTCACCGC 
CGGCCAGCGC 
TGCTCACGGT 
CCTAGAAGAG 
CACTCATGAC 
ATCTCATTTA 
A 



Seq ID NO: 48 DNA sequence: 

Nucleic Acid Accession #: CAT cluster 



TTCCAAATTT 
TTTTAGTAAA 
CTCCAAGTCA 
TCCTTACTCT 
CCGACTACCG 
CCCAAAGCGC 
ATTTTCGCGG 
TTGCAAGCAA 
AGCCTTGGGC 
CGACGCT 



11 

I 

TTTTTTTTGT 
TGAGATTATG 
TGAGTGTGCA 
TCTCGGAGCC 
TGAGCAGCTT 
TGGCCGCAGG 
TGAACGACCT 
AGTTAATTTG 
AATGAGGGAA 



21 
1 

AATAAGAAAA 
TTCATGAATG 
GTTGGGCTCA 
CACATCGCCC 
CCTGCTCCCC 
AATCTTTCCC 
CGGGCCAAGT 
AAAGAAAATA 
GAACGTGTCT 



31 
I 

AATTTTAGTA 
TGTTTGGTAA 
AACCGTACAG 
AGATGAGGAA 
TGTCGTCGCC 
CTTAAATCGG 
TTGCTTTTGT 
CATGATACAG 
AGTTATCCAC 



41 
I 

AAAGAAAATT 
ACTGTAACTC 
AAGTCATTTC 
GGCCACCGCT 
TCTGCGGTCG 
GGAAGAAGTT 
TGCTGGTTCC 
CTCTAGGGCG 
AGCCCGGGGA 



51 
I 

CTCACAAAGT 
CACAGGGCAG 
CAGGATGTTA 
GCCGCCAACG 
GGGCACTTTC 
TCTCTAATCC 
CTAAGCTTAA 
AATTCTAACG 
CGCCTGCACA 



Seq ID NO: 49 DNA sequence 

Nucleic Acid Accession 8: CAT cluster 



TCTTTCTTCT 
CCTGCCGACC 
ACCGTAGACC 
GACACATGCA 
CCCAACCAAA 
TTCATTTAAA 
CTTTCTCTGA 
TCGCTGTAGC 
TAAAGAAACT 
GATTGAACCA 
TGCTGGTATC 
GACGTGGGGA 
CCGGGTCTCT 
GAGACAGCAT 
ACTG 



11 
I 

GCTGCTCGTT 
TCTGTTGTCT 
CGAAACCATT 
GACCAGTTTT 
GTGTTTAAAA 
AAACTCTAAT 
TCTGTGTCTT 
CATGGGAATC 
GACACAGGAG 
GTGCACTCCA 
GTCCTGCAGC 
GAGCTGGTCT 
CCTGGCCCCG 
CATTTATGAG 



21 
i 

TGTCTCTCCT 
CTTCTCTGAT 
GGGTGTCACA 
CCTGGAACNG 
CTTTTTAGGG 
ATTTATATTA 
TTTTCTTTGA 
CGTTTCATTA 
AATCACTTGA 
GCCTTGGCAG 
CCCATCCTCG 
ATATATCCGG 
GGGACCTAGT 
CCTGCAGCAT 



31 
I 

GTGCTCTTCT 
GGCGGGGGGC 
AGCCGGTCGC 
CATGACCATG 
CACCCCCAAA 
AATACAAAGA 
CAGCATCTCC 
TTATGGTAGC 
ACTTGGGAGG 
CGGAGCAAGA 
GTTCCATTGC 
GTGAAGCTCA 
ATTTTTGCCA 
CCACCCTACT 



41 
I 

TCTTTCTTTC 
GGGAGAAGCT 
CGGCTTTTTT 
TTATTACTAT 
ATTTTTTTTT 
TACCCAAACC 
ATTTTTTTTC 
AATATGGAGT 
CAGAGTTTGC 
TTCTGTCACA 
GCTGCCAGGC 
GCTGTGGCAC 
CGAGTGTACA 
GCTGTATCCA 



51 
I 

CCTCGCCGCT 
GACCGGTGAG 
GGGAGAACCC 
GGGCCGCCTC 
TTTTTTTTTT 
CTTTATGCTT 
TGCTGCTTCA 
GCTGTATTCC 
AGTGAGCCGA 
GTTCCTGAAG 
AGGGTGCTGG 
ACCTTGGATG 
CCAAACAAAG 
GTTTCCATTG 



Seq ID NO: 50 DNA sequence 
Nucleic Acid Accession #: L05187 
Coding sequence: 1991.. 2260 



CTGCAGGGAG 
TCAGAAAGGA 
CAGAAGAAGG 
TGAAGGAAAG 
AGAGTCATAA 
GGAAATGGAT 
ATTTCTAGCT 
CCCCTCCCTT 
ACAACCATCT 
CCAGGGTTAA 
CCCTGCACCT 
CAGCCAGCTA 
GATGAGGATG 
AGCTTCTATT 
TCACACCAAA 
ATTGCAACAA 
ATATGTGTAA 
TATTTTAAGT 
CCTCAGTAGA 
AGTTCATAGC 
TGACAAGATA 
ATTTAAGGCA 
AACATAAAAC 
AGTAATTGGC 
AGGAGACCTC 
AGATGGGAAG 
GAGGCTTAGA 
GAGGAAAGTG 
GAAGCCAGCT 
GAGCCAAGAA 
TTCAAAGGGC 



11 
I 

GCAGGTAGAA 
GGAAAAGGCC 
ATTAGCCCCT 
CAGGTTTTCC 
GTAAATTATT 
GGAAGGTCTT 
TCCACCTTCA 
TCCCACCTAT 
CAATGACAAG 
CTCATGAAAC 
GGGTCTGAGG 
GTGCCAAAAA 
GTAGTGTGAG 
TCCTTGAGGC 
CCCAAGGGAC 
ACTGGCAATT 
GCAGGTTAAT 
TAAATTACAG 
TAGTCATTGA 
AGAACTAGAA 
TTTATAGAAA 
GTATGCTAGG 
CTAGCAGGAA 
ATGACGGAGA 
TAGGGTGTCA 
AAAAGCATTT 
TGAATATAAA 
GTCTGATGCC 
TTAGTAGGGC 
GAGAACTCCA 
CTGAAAATTA 



21 
I 

AAGGCTTTTG 
AGGGCAGATG 
GAAAGTCCCT 
CAGATTAGCA 
CTGAATGTGT 
GGACTCTGAG 
CCAAGGCAGA 
TCATGTGTGC 
GACAGCAGGT 
CCTCCATGAA 
ATGAGGGTGG 
ATATCAGGTG 
TCATGTGTGA 
AGGGCTCATT 
CACACAGCCC 
CTAGTGTACT 
CCAGGGTTTC 
TCTGGATTTG 
ACTGGGAGTC 
CTCAGGCCAG 
TTTTAATTTA 
CACTTTGGAC 
GGTAATACAT 
TGGGCAGAGA 
AGTGATGTGA 
GGAAGGGACT 
GCCATCCTAT 
ATTTTCCAAA 
ATTTTTCCAG 
ATAAAATGGA 
TCCAAGCTTA 



31 
I 

GGTTTTCAGG 
TCTGGGTGGA 
GAAGTAGGAG 
ACCAGTCAGG 
GTAGTTTAAT 
ACAAGGGGTC 
CAAGGAGGGC 
AAGAGTGCCC 
GGCAAGGCTC 
GCCTGCTGCT 
CAGTGAAAAT 
GTGTTCATCA 
CAGGTGAGGA 
CATCTTATAA 
ATTCTGCTCC 
TTTTCATTAT 
AATGGGAGAT 
AAAGGACCTT 
CTGGAGAAGA 
AGCACTCTCA 
TTAGATGGAT 
AAATCAATGC 
ATATATAAAT 
AGGGCTGTGC 
GCTATGATGG 
GTGTAAGCAC 
AAGTCACAGG 
AGACCTAATA 
AACAGATATA 
GCAGAAGAAA 
TTTCATTTTT 



41 
I 

TGGGGGGCAG 
GTGAAGGGAA 
AAGGGTAAAG 
GGGAGGAAGG 
GGAATTGGGA 
TATAATCAGT 
CCACCTCAGC 
TGTCCCACAG 
AACAGGACTC 
CACCCCTCCC 
TAGGCCAGTG 
AATAAGCCGA 
ATGAAAACAG 
AAGCCAGCTG 
GTATACCAGG 
TAGAAATTAG 
AGAGAATAGT 
AGAGATGGTT 
TTGTTCAAAT 
GTAACACTGC 
CTCTACTGAG 
CCTAACGTAC 
AAATGAAATG 
ACTTTTGGGA 
AGGGGTATTT 
AGACCAGAAG 
CTTTCTACAT 
TGCGGACCTC 
AGGTGCCTTG 
TTGCCTTTTA 
AAATGTAATG 



51 
I 

TCTAGCCTGA 
AAAGTGATCC 
GTGTGGTTGG 
TGAGAGTGGG 
AAAAGATGGG 
CCATTTCATT 
TCCTCTGCTC 
AACACGGGGA 
AGATGTCCCC 
TCAAGGCAAG 
ACATCATTTT 
GCCAACCGGT 
AGTGCCCGAG 
GCCATTGCCT 
TAAGTCTCTG 
CTAAAGGCAA 
GGAATATCTT 
AGGGCTCCCA 
GCCCATGGGA 
AATTTCCCCC 
CATTTATTCC 
TTACTTAACA 
CAAAGTAGAT 
GACTTGCTCA 
GGACAAGCAG 
CAAAACCATA 
GGTACTAGGA 
ATGTCCCTCA 
GGTAGGAAGG 
GCTCCTCCTC 
GGGGAGCTAA 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



207 



10 



WO 02/086443 

GGGAGATGAA AGGCTTTCTC TTCTAAAGGG TCCTGAAATA AAATCTGTTT GGCATTGAAT 1920 

TTGTATCCAT CTTTCTTTAA TTGAATCACT GTGTCAGCTT TCTGTCTCTA GAAAAAAACA 1980 

CATTTGAAGC ATGAATTCTC AGCAGCAGAA GCAGCCTTGC ACCCCACCCC CTCAGCCTCA 2040 

GCAGCAGCAG GTGAAACAAC CTTGCCAGCC TCCACCCCAG GAACCATGCA TCCCCAAAAC 2100 

CAAGGAGCCC TGCCAACCCA AGGTGCCTGA GCCCTGCCAC CCCAAAGTGC CTGAGCCCTG 2160 

CCAGCCCAAG ATTCCAGAGC CCTGCCAGCC CAAGGTGCCT GAGCCCTGCC CTTCAACGGT 2220 

CACTCCAGCA CCAGCCCAGC AGAAGACCAA GCAGAAGTAA TGTGGTCCAC AGCCATGCCC 2280 

TTGAGGAGCT GGCCACTGGA TACTGAACAC CCTACTCCAT TCTGCTTATG AATCCCATTT 2340 

GCCTATTGAC CCTGCAGTTA GCATGCTGTC ACCCTGAATC ATAATCGCTC CTTTGCACCT 2400 

CTAAAAAGAT GTCCCTTACC CTCATTCTGG AGGCTCCTGA GCCTCTGCGT AAGGCTGAAC 2460 

GTCTCACTGA CTGAGCTAGT CTTCTTGTTG CTCGGGTGCA TTTGAGGATG GATTTGGGGA 2520 
AGGTCAAGTG ACCATCCCTA G 



PCT/US02/12476 



15 



Seq ID NO: 51 Protein sequence: 
Protein Accession #:AAC26838 



20 



1 11 21 31 41 51 

I II I I I 

MNSQQOKQPC TPPPQPQQOQ VKQPCQPPPQ EPCIPKTKEP CQPKVPEPCH PKVPEPCQPK 
IPEPCQPKVP EPCPSTVTPA PAQQKTKQK 



60 



25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 52 DNA sequence 

Nucleic Acid Accession #: NM_002638.1 

Coding sequence: 120-473 



CAATACAGCT 
GCTGGACTGC 
TGAGGGCCAG 
AGGCAGCTGT 
TCAATGGACA 
CGCAAGAGCC 
TCCGGTGCGC 
TCAAGAAGTG 
CGGTCCTTGC 
TGCTGCCCTT 
GAGCTGCCTC 



11 
i 

AAGGAATTAT 
ATAAAGATTG 
CAGCTTCTTG 
CACGGGAGTT 
AGATCCCGTT 
AGTCAAAGGT 
CATGTTGAAT 
CTGTGAAGGC 
TGCACCTGTG 
CCCCTTCCCA 
TCTCATCCAC 



21 
I 

CCCTTGTAAA 
GTATGGCCTT 
ATCGTGGTGG 
CCTGTTAAAG 
AAAGGACAAG 
CCAGTCTCCA 
CCCCCTAACC 
TCTTGCGGGA 
CCGTCCCCAG 
CACTGTCCAT 
TTTCCAATAA 



31 
I 

TACCACAGAC 
AGCTCTTAGC 
TGTTCCTCAT 
GTCAAGACAC 
TTTCAGTTAA 
CTAAGCCTGG 
GCTGCTTGAA 
TGGCCTGTTT 
AGCTACAGGC 
TCTTCCTCCC 
A 



41 
I 

CCGCCCTGGA 
CAAACACCTT 
CGCTGGGACG 
TGTCAAAGGC 
AGGTCAAGAT 
CTCCTGCCCC 
AGATACTGAC 
CGTTCCCCAG 
CCCATCTGGT 
ATTCAGGATG 



51 
I 

GCCAGGCCAA 
CCTGACACCA 
CTGGTTCTAG 
CGTGTTCCAT 
AAAGTCAAAG 
ATTATCTTGA 
TGCCCAGGAA 
TGAAGGGAGC 
CCTAAGTCCC 
CCCACGGCTG 



Seq ID NO: 53 Protein sequence: 
Protein Accession #: NP_002629.1 

1 11 21 31 41 51 

I I 1 I I I 

MRASSFLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVK 
AQEPVKGPVS TKPGSCPIIL IRCAMLNPPN RCLKDTDCPG IKKCCEGSCG MACFVPQ 

Seq ID NO: 54 DNA sequence 

Nucleic Acid Accession #: NM_019618 

Coding sequence: 75-584 



GGCACGAGCC 
GAGACAACCA 
ATCAATCAAT 
CCCTTCAGGG 
TTGCTGTTAT 
ATTTGGGAAT 
CATTGCAGCT 
CCTTCCTTTT 
CGGACTGGTT 
GGAAGTCATA 
GCAGCTTGGT 
AGTGTCATTT 
TAATGAAGAA 
GGAGAGCTGG 
CTGCATGAGT 
TGAAGATGCT 
CTCTGTTTCT 
CCAATATACC 
TAATTCTTGT 
AATAAACTTT 



11 
I 

ACGATTCAGT 
CACTATGAGA 
GTGTAAACCT 
TCAGAACCTT 
CACATGCAAG 
CCAGAATCCA 
AAAAGAGCAG 
CTACCGTGCC 
CATTGCCTCC 
CAACACTGCC 
CTTTGTCTTA 
TCACGCTGGT 
GAAGCAATTA 
GTGGTATAAG 
GACTTTAAGA 
TCAGAGCTCA 
GTTTTGCTTT 
TCATTGTGTG 
GTTAAGTTAA 
GTGTATTTAT 



21 
I 

CCCCTGGACT 
GGCACTCCAG 
ATTACTGGGA 
GTGGCAGTTC 
TATCCAGAGG 
GAAATGTGTT 
AAGATCATGG 
AAGACTGGTA 
TCCAAGAGAG 
TTTGAATTAA 
AAGTTTCTGG 
GCTGAGACAG 
CTTCATAGCA 
GCTGTCCTCT 
CTCAAAGACC 
TGCGCGTTAC 
ATTCCCTCTT 
TAATAGAACC 
ATCATTTTTG 
ATAATAAAAA 



31 
I 

GTAGATAAAG 
GAGACGCTGA 
CTATTAATGA 
CACGAAGTGA 
CTCTTGAGCA 
TGTATTGTGA 
ATCTGTATGG 
GGACCTCCAC 
ACCAGCCCAT 
ATATAAATGA 
TTCCCAATGT 
GGGCAAGGCT 
ACTGAAGAAC 
CAAGCTGGTG 
AAACACTGAG 
CCACGATGGC 
GGGATGATAT 
TTCTTAGCAT 
TCCTAATTGT 
AAAAAAAAAA 



41 
I 

ACCCTTTCTT 
TGGTGGAGGA 
TTTGAATCAG 
CAGTGTGACC 
AGGCAGAGGG 
GAAGGTTGGA 
CCAACCCGAG 
CCTTGAGTCT 
CATTCTGACT 
CTGAACTCAG 
GTTTTCGTCT 
GCTGTTATCA 
AGGATGTGGC 
CTGTGTAGGC 
CTTTCTTCTA 
ATGACTAGCA 
CATCCAGTCT 
TAAGACCTTG 
AATGTGTAAT 
AAA 



51 
I 

GCCAGGTGCT 
AGGGCCGTCT 
CAAGTGTGGA 
CCAGTCACTG 
GATCCCATTT 
GAACAGCCCA 
CCCGTGAAAC 
GTGGCCTTCC 
TCAGAACTTG 
CCTAGAGGTG 
ACATTTTCTT 
TCTCATTTTA 
CTCAGAAGCA 
CACAAGGCAT 
GGGGTGGGTA 
CAGAGCTGAT 
TTATATGTTG 
TAAACAAAAA 
CTTAAAGTTA 



Seq ID NO: 55 Protein sequence: 
Protein Accession #: NP_062564 

11 



41 



51 



1 11 21 31 

I I I I I I 

MRGTPGDADG GGRAVYQSMC KPITGTINDL NQQVWTLQGQ NLVAVPRSDS VTPVTVAVIT 
CKYPEALEQG RGDPIYLGIQ NPEMCIiYCEK VGEQPThQLK EQKIMDLYGQ PEPVKPFLFY 
RAKTGRTSTL ESVAFPDWFI ASSKRDQPII LTSELGKSYN TAFELNIND 

Seq ID NO: 56 DNA sequence 

Nucleic Acid Accession #: NM_003125 

Coding sequence: 65-334 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 



208 



WO 02/086443 



PCT/US02/12476 



31 - 41 51 

I I I 

CTCTCTTCAC ACCAGGACCA GCCACTGTTG 60 

CTGCATCCCA CCCCCTCAGC TTCAGCAGCA 120 

TCAGGAACCA TGCATCCCCA AAACCAAGGA 180 
CCACCCCAAA GTGCCTGAGC CCTGCCAGCC ' 240 

GCCTGAGCCC TGCCCTTCAA TAGTCACTCC 300 

GTAATGTGGT CCACAGCCAT GCCCTTGAGG 360 

CCATTCTGTG TATGAGTCCC ATTTGCCTTG 420 

ATGTGCTATG AAGCTTTCTT TCCTACACAC 480 

AGTACCAGAG CTAGTTTTCA GCTGCTCAGA 540 

CAAATGATTC AGCTCCCTTA TACCCCCATT 600 



31 41 51 

1 i I 

EPCIPKTKEP CHPKVPEPCH PKVPEPCQPK 60 



Seq ID NO:. 58 DNA sequence 

Nucleic Acid Accession 8: NM_001793.2 

Coding sequence: 71-2560 

30 1 11 21 31 41 51 

I I I I I I 

AAAGGGGCAA GAGCTGAGCG GAACACCGGC CCGCCGTCGC GGCAGCTGCT TCACCCCTCT 60 

CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TCCAGGTTTG 120 

CTGGCTGCAG TGCGCGGCCT CCGAGCCGTG CCGGGCGGTC TTCAGGGAGG CTGAAGTGAC 180 

35 CTTGGAGGCG GGAGGCGCGG AGCAGGAGCC CGGCCAGGCG CTGGGGAAAG TATTCATGGG 240 

CTGCCCTGGG CAAGAGCCAG CTCTGTTTAG CACTGATAAT GATGACTTCA CTGTGCGGAA 300 

TGGCGAGACA GTCCAGGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 360 

ATCCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 420 

TGAAAATGGC AAGGGTCCCT TCCCCCAGAG ACTGAATCAG CTCAAGTCTA ATAAAGATAG 480 

40 AGACACCAAG ATTTTCTACA GCATCACGGG GCCGGGGGCA GACAGCCCCC CTGAGGGTGT 540 

CTTCGCTGTA GAGAAGGAGA CAGGCTGGTT GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 600 

GATTGCCAAG TATGAGCTCT TTGGCCACGC TGTGTCAGAG AATGGTGCCT CAGTGGAGGA 660 

CCCCATGAAC ATCTCCATCA TCGTGACCGA CCAGAATGAC CACAAGCCCA AGTTTACCCA 720 

GGACACCTTC CGAGGGAGTG TCTTAGAGGG AGTCCTACCA GGTACTTCTG TGATGCAGGT 780 

45 GACAGCCACG GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 840 

CCATAGCCAA GAACCAAAGG ACCCACACGA CCTCATGTTC ACCATTCACC GGAGCACAGG 900 

CACCATCAGC GTCATCTCCA GTGGCCTGGA CCGGGAAAAA GTCCCTGAGT ACACACTGAC 960 

CATCCAGGCC ACAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAGTGGA 1020 

GATCCTTGAT GCCAATGACA ATGCTCCCAT GTTTGACCCC CAGAAGTACG AGGCCCATGT 1080 

50 GCCTGAGAAT GCAGTGGGCC ATGAGGTGCA GAGGCTGACG GTCACTGATC TGGACGCCCC 1140 

CAACTCACCA GCGTGGCGTG CCACCTACCT TATCATGGGC GGTGACGACG GGGACCATTT 1200 

TACCATCACC ACCCACCCTG AGAGCAACCA GGGCATCCTG ACAACCAGGA AGGGTTTGGA 1260 

TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTTTGT 1320 

GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTCCAC GTGGAGGATG TGAATGAGGC 1380 

55 ACCTGTGTTT GTCCCACCCT CCAAAGTCGT TGAGGTCCAG GAGGGCATCC CCACTGGGGA 1440 

GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA TCAGCTACCG 1500 

CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC 1560 

TGTGGGCACC CTCGACCGTG AGGATGAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 1620 

GGTCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 1680 

60 ACTGATTGAT GTCAATGACC ATGGCCCAGT CCCTGAGCCC CGTCAGATCA CCATCTGCAA 1740 

CCAAAGCCCT GTGCGCCAGG TGCTGAACAT CACGGACAAG GACCTGTCTC CCCACACCTC 1800 

CCCTTTCCAG GCCCAGCTCA CAGATGACTC AGACATCTAC TGGACGGCAG AGGTCAACGA 1860 

GGAAGGTGAC ACAGTGGTCT TGTCCCTGAA GAAGTTCCTG AAGCAGGATA CATATGACGT 1920 

GCACCTTTCT CTGTCTGACC ATGGCAACAA AGAGCAGCTG ACGGTGATCA GGGCCACTGT 1980 

65 GTGCGACTGC CATGGCCATG TCGAAACCTG CCCTGGACCC TGGAAGGGAG GTTTCATCCT 2040 

CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTCCTG CTGGTGCTGC TTTTGTTGGT 2100 

GAGAAAGAAG CGGAAGATCA AGGAGCCCCT CCTACTCCCA GAAGATGACA CCCGTGACAA 2160 

CGTCTTCTAC TATGGCGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 2220 

GCTCCACCGA GGTCTGGAGG CCAGGCCGGA GGTGGTTCTC CGCAATGACG TGGCACCAAC 2280 

70 CATCATCCCG ACACCCATGT ACCGTCCTCG GCCAGCCAAC CCAGATGAAA TCGGCAACTT 2340 

TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GCCCCGCCCT ACGACACCCT 2400 

CTTGGTGTTC GACTATGAGG GCAGCGGCTC CGACGCCGCG TCCCTGAGCT CCCTCACCTC 2460 

CTCCGCCTCC GACCAAGACC AAGATTACGA TTATCTGAAC GAGTGGGGCA GCCGCTTCAA 2520 

GAAGCTGGCA GACATGTACG GTGGCGGGGA GGACGACTAG GCGGCCTGCC TGCAGGGCTG 2580 

75 GGGACCAAAC GTCAGGCCAC AGAGCATCTC CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 2640 

GACTTCGGAG CTTGTCAGGA AGTGGCCGTA GCAACTTGGC GGAGACAGGC TATGAGTCTG 2700 

ACGTTAGAGT GGTTGCTTCC TTAGCCTTTC AGGATGGAGG AATGTGGGCA GTTTGACTTC 2760 

AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TCCAGAAGCC 2820 

TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTGACTGACC 2880 

80 . TACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATTTT 2940 

TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 3000 

GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 3060 

TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 3120 

GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 

85 TAAAGAAACT TTTCCCAGAA AAAAA 

Seq ID NO: 59 Protein sequence: 



209 



1 11 21 

I I I 

AGCAGTTCTA AGGGACCATA CAGAGTATTC 
5 CAGCATGAGT TCCCAGCAGC AGAAGCAGCC 
GCAGGTGAAA CAGCCTTGCC AGCCTCCACC 
GCCCTGCCAC CCCAAGGTGC CTGAGCCCTG 
CAAGCTTCCA GAGCCATGCC ACCCCAAGGT 
AGCACCAGCC CAGCAGAAGA CCAAGCAGAA 
10 AGCCGGCCAC CAGATGCTGA ATCCCCTATC 
CAATTAGCAT TCTGTCTCCC CCAAAAAAGA 
TCTGAGTCTC TGAATGAAGC TGAAGGTCTT 
ATTCATCTGA AGAGAGACTT AAGATGAAAG 
AAATTCACTT TCAATTCCA 



15 
20 



Seq ID NO: 57 Protein sequence: 
Protein Accession 8: NP_003116 

1 11 21 

I I I 

MSSQQQKQPC IPPPQLQQQQ VKQPCQPPPQ 
LPEPCHPKVP EPCPSIVTPA PAQQKTKQK 



WO 02/086443 

Protein Accession ft: NP_001784.2 



PCT/US02/12476 



1 11 21 31 41 51 

c | | I I I i 

J MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 

QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHXRDWV VAPISVPENG 120 

KGPFPQRLNQ LKSNKDROTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 180 

YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 

DEDDAIYTYN GWAYSIHSQ EPKDPHDLMP TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

10 TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 

AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 

PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 

DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 

VNDHGPVPEP RQITIQJQSP VRQVLNITDK DI/SPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 

15 TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 

GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 

GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NUCAANTDPT APPYDTIjLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 

20 Seq ID NO: 60 DNA sequence 

Nucleic Acid Accession ih Eos sequence 
Coding sequence: 162*428 



1 11 21 31 41 51 

25 | | | I I I 

GCGTTCCGTT GGCGGCGGAT TCGAACGTTC GGACTGAGGT TTTTCTGCCT GAAGAAGCGT 60 

CATACGGACC GGATTGTTTT CGCTGGCCCA GTGTCCCCGG AGCTTGTGTG CGATACAGAG 120 

AGCACCTCGG AAGCTGAGGC AGCTGGTACT TGACAGAGAG GATGGCGCTG TCGACCATAG 180 

TCTCCCAGAG GAAGCAGATA AAGCGGAAGG CTCCCCGTGG CTTTCTAAAG CGAGTCTTCA 240 

30 AGCGAAAGAA GCCTCAACTT CGTCTGGAGA AAAGTGGTGA CTTATTGGTC CATCTGAACT 300 

GTTTACTGTT TGTTCATCGA TTAGCAGAAG AGTCCAGGAC AAACGCTTGT GCGAGTAAAT 360 

GTAGAGTCAT TAACAAGGAG CATGTACTGG CCGCAGCAAA GGTAATTCTA AAGAAGAGCA 420 

GAGGTTAGAA GTCAAAGAAC ATATTCTTGA AAGTTATGAT GCATTCTTTT GGGTGGTAAC 480 
AG AT CAT AAA GACATTTTTT ACACATCAGT TAATATGGGA TTATTAAATA TTGG 



35 



Seq ID NO: 61 Protein sequence: 
Protein Accession #: Eos sequence 



1 11 21 31 41 51 

40 | | | I I I 

MALSTIVSQR KQIKRKAPRG FLKRVFKRKK PQLRLEKSGD LLVHLNCLLF VHRLAEESRT 60 
NACASKCRVI NKEHVLAAAK VILKKSRG 



Ae Seq ID NO: 62 DNA sequence 
45 Nucleic Acid Accession #: NM_000094.2 
Coding sequence: 99-8933 

1 11 21 31 41 51 

<n 1 1 1 1 ' ' 

jU GGGCTGGAGG GGCGCTGGGC TCGGACCTGC CAAGGCCACC GCAGGGGGGA GCAAGGGACA 60 

GAGGCGGGGG TCCTAGCTGA CGGCTTTTAC TGCCTAGGAT GACGCTGCGG CTTCTGGTGG 120 

CCGCGCTCTG CGCCGGGATC CTGGCAGAGG CGCCCCGAGT GCGAGCCCAG CACAGGGAGA 180 

GAGTGACCTG CACGCGCCTT TACGCCGCTG ACATTGTGTT CTTACTGGAT GGCTCCTCAT 240 

CCATTGGCCG CAGCAATTTC CGCGAGGTCC GCAGCTTTCT CGAAGGGCTG GTGCTGCCTT 300 

55 TCTCTGGAGC AGCCAGTGCA CAGGGTGTGC GCTTTGCCAC AGTGCAGTAC AGCGATGACC 360 

CACGGACAGA GTTCGGCCTG GATGCACTTG GCTCTGGGGG TGATGTGATC CGCGCCATCC 420 

GTGAGCTTAG CTACAAGGGG GGCAACACTC GCACAGGGGC TGCAATTCTC CATGTGGCTG 480 

ACCATGTCTT CCTGCCCCAG CTGGCCCGAC CTGGTGTCCC CAAGGTCTGC ATCCTGATCA 540 

CAGACGGGAA GTCCCAGGAC CTGGTGGACA CAGCTGCCCA AAGGCTGAAG GGGCAGGGGG 600 

60 TCAAGCTATT TGCTGTGGGG ATCAAGAATG CTGACCCTGA GGAGCTGAAG CGAGTTGCCT 660 

CACAGCCAAC CTCCGACTTC TTCTTCTTCG TCAATGACTT CAGCATCTTG AGGACACTAC 720 

TGCCCCTCGT TTCCCGGAGA GTGTGCACGA CTGCTGGTGG CGTGCCTGTG ACCCGACCTC 780 

CGGATGACTC GACCTCTGCT CCACGAGACC TGGTGCTGTC TGAGCCAAGC AGCCAATCCT 840 

TGAGAGTACA GTGGACAGCG GCCAGTGGCC CTGTGACTGG CTACAAGGTC CAGTACACTC 900 

65 CTCTGACGGG GCTGGGACAG CCACTGCCGA GTGAGCGGCA GGAGGTGAAC GTCCCAGCTG 960 

GTGAGACCAG TGTGCGGCTG CGGGGTCTCC GGCCACTGAC CGAGTACCAA GTGACTGTGA 1020 

TTGCCCTCTA CGCCAACAGC ATCGGGGAGG CTGTGAGCGG GACAGCTCGG ACCACTGCCC 1080 

TAGAAGGGCC GGAACTGACC ATCCAGAATA CCACAGCCCA CAGCCTCCTG GTGGCCTGGC 1140. 

GGAGTGTGCC AGGTGCCACT GGCTACCGTG TGACATGGCG GGTCCTCAGT GGTGGGCCCA 1200 

70 CACAGCAGCA GGAGCTGGGC CCTGGGCAGG GTTCAGTGTT GCTGCGTGAC TTGGAGCCTG 1260 

GCACGGACTA TGAGGTGACC GTGAGCACCC TATTTGGCCG CAGTGTGGGG CCCGCCACTT 1320 

CCCTGATGGC TCGCACTGAC GCTTCTGTTG AGCAGACCCT GCGCCCGGTC ATCCTGGGCC 13 80 

CCACATCCAT CCTCCTTTCC TGGAACTTGG TGCCTGAGGC CCGTGGCTAC CGGTTGGAAT 1440 

GGCGGCGTGA GACTGGCTTG GAGCCACCGC AGAAGGTGGT ACTGCCCTCT GATGTGACCC 1500 

75 GCTACCAGTT GGATGGGCTG CAGCCGGGCA CTGAGTACCG CCTCACACTC TACACTCTGC 1560 

TGGAGGGCCA CGAGGTGGCC ACCCCTGCAA CCGTGGTTCC CACTGGACCA GAGCTGCCTG 1620 

TGAGCCCTGT AACAGACCTG CAAGCCACCG AGCTGCCCGG GCAGCGGGTG CGAGTGTCCT 1680 

GGAGCCCAGT CCCTGGTGCC ACCCAGTACC GCATCATTGT GCGCAGCACC CAGGGGGTTG 1740 

AGCGGACCCT GGTGCTTCCT GGGAGTCAGA CAGCATTCGA CTTGGATGAC GTTCAGGCTG 1800 

GGCTTAGCTA CACTGTGCGG GTGTCTGCTC GAGTGGGTCC CCGTGAGGGC AGTGCCAGTG 1860 

TCCTCACTGT CCGCCGGGAG CCGGAAACTC CACTTGCTGT TCCAGGGCTG CGGGTTGTGG 1920 

TGTCAGATGC AACGCGAGTG AGGGTGGCCT GGGGACCCGT CCCTGGAGCC AGTGGATTTC 1980 

GGATTAGCTG GAGCACAGGC AGTGGTCCGG AGTCCAGCCA GACACTGCCC CCAGACTCTA 2040 

CTGCCACAGA CATCACAGGG CTGCAGCCTG GAACCACCTA CCAGGTGGCT GTGTCGGTAC 2100 

85 TGCGAGGCAG AGAGGAGGGC CCTGCTGCAG TCATCGTGGC TCGAACGGAC CCACTGGGCC 2160 

CAGTGAGGAC GGTCCATGTG ACTCAGGCCA GCAGCTCATC TGTCACCATT ACCTGGACCA 2220 

GGGTTCCTGG CGCCACAGGA TACAGGGTTT CCTGGCACTC AGCCCACGGC CCAGAGAAAT 2280 



80 



210 



WO 02/086443 

CCCAGTTGGT TTCTGGGGAG GCCACGGTGG 
AGTATACGGT GCATGTGAGG GCCCATGTGG 
TTGTGAGGAC TGCCCCTGAG CCTGTGGGTC 
CCAGCGACGT TCTACGGATC ACCTGGGTAG 
CCTGGGGCCG GAGTGAAGGC GGCCCCATGA 
CTGCAGAGAT CCGGGGTCTC GAAGGTGGAG 
TCGGGGACCG CGAGGGCACA CCTGTCTCCA 
CAGCCCTGGG GACGCTTCAC GTGGTGCAGC 
AGCCGGTGCC CAGAGCGCAG GGCTTCCTTC 
AGTCCCGGGT CCTGGGGCCC GAGCTCAGCA 
CACAGTACCG CGTGAGGCTG AGTGTCCTAG 
TGACTGCGOG CACTGAGTCA CCTCGTGTTC 
CGATCGACTC GGTGACTTTG GCCTGGACTC 
CCTGGCGGCC ACTCAGAGGC CCTGGCCAGG 
GGATCTCAAG CTCCCAGCGG GTGACAGGGC 
TGACGCCTGT CCTGGATGGT GTGCGGGGTC 
GCCCCCGTGG CCTGGCGGAT GTGGTGTTCC 
GTGCGGAGGC TACGAGGAGG GTCCTGGAGC 
CACAGGCAGT TCAGGTTGGC CTGCTGTCTT 
TGAATGGCTC CCATGACCTT GGCATTATCT 
ACCCAAGTGG GAACAACCTG GGCACAGCCG 
CAGATGCTCC TGGGCGCCGC CAGCACGTAC 
CCTTGAGAGG TGACATATTC AGCCCCATCC 
TGATGTTGGG AATGGCTGGA GCGGACCCAG 
ACTCTGTCCA GACCTTCTTC GCCGTGGATG 
GTCTGGCCAC AGCCCTGTGT CAGGCATCCT 
CAGTGTATTG TCCAAAGGGC CAGAAGGGGG 
TTGGGCCTCC TGGCGACCCT GGCCTCCCGG 
CCCCTGGAAG TGCCACTGCC AAGGGCGAGA 
GCAGCCCTGG CCGCGCCGGG AATCCTGGGA 
CAGGGTTGCC TGGCCCTCGT GGGGACCCGG 
AGCCGGGGGC TCCCGGACAA GTCATCGGAG 
GGGACCCTGG ACCATCGGGC CCCCCTGGAC 
GTGGCCCCCC AGGGCTTCCT GGAACAGCCA 
GGGGTCCCCC TGGACCAGGT GAAGGTGGCA 
TTCCCGGAAG CCCTGGACCC CAAGGCCCCG 
GTGACTCTGA GGATGGAGCT CCAGGCCTCC 
GCCCACGGGG ACCTCCTGGA GCTATTGGCC 
TGGGTGAGGC TGGAGAGAAG GGCGAACGTG 
TGCCAGGGGT TGCTGGACGT CCTGGAGCCA 
GCCGCCAAGG AGAGAAGGGG GAGCCTGGTC 
CTGTTGCTGG ACCCAAAGGA GAAAAGGGAG 
CCGGAGTCCA AGGGGAACGG GGCCCACCCG 
AGGGAGACCC TGGAGACCGG GGTCCCATTG 
ACTCAGGGCC TCCTGGAGAG AAGGGAGACC 
GCCCCCGAGG ACGAGATGGT GAAGTTGGAG 
CGGGTTTGCC TGGAAAAGCA GGCGAGCGTG 
CTGTGGGTGA AAAGGGAGAC CAGGGAGATC 
GATCATCTGG ACCCAAGGGT GACCGTGGGG 
TGGTAGACAC AGGACCTGGA GCCAGAGAGA 
GTCCTCGAGG GCCCAAGGGT GATCCTGGCC 
AAGGGTTTCG GGGACCCCCA GGCCCACAGG 
AAAAGGGTGA CCGGGGTCCC CCTGGGCTGG 
GAGCCGCTGG GCCCTCTGGG CCGAATGGTG 
ACGGGCTTCC AGGCCTCCGT GGAGAACAAG 
TACCGGGAAA GCCAGGCGAG GATGGGAAAC 
GGGACCCTGG AGAAGACGGG AGGAAGGGAG 
AAGGTCGTGA TGGCCCCAAG GGTGAGCGTG 
CTCCAGGCCT CCCAGGGCCA GTGGGCCCTC 
GCACGGGCCC CAAGGGTGAC CGTGGGGAGA 
GAGAGCGTGG CCTGCGAGGA GAGCCTGGAA 
CTGCTGGCAT CAAGGCATCT GCCCTGCGGG 
GTAGCTTCCT GCCTGTGCCC GAACGGCGTC 
GCCCCCCAGG CAAGGAGGGC CCCATCGGCT 
GTGGAGACCC TGGCCCTCAG GGGCCACCTG 
GGCCTTCCGG CCTTGCCGGG GAGCCTGGAA 
CTGGGGGTGT GGGAGAGGCA GGAAGGCCAG 
AACGTGGAGA ACAGGGCAGA GATGGCCCTC 
GACCCCCTGG CCCCAAGGTG TCTGTGGATG 
GACCCCCTGG ACTCAAGGGT GCTAAGGGGG 
AAGGAGACAG GGGTGTGCCA GGCATCAAAG 
AGGACGGCAA CCCGGGTCTA CCAGGAGAGC 
GTCTGCAGGG TCCAAGAGGC CCCCCTGGCC 
CTGGTGCCCC GGGTCTTGCT GGCCCTGCAG 
AGCCTGGAGA GACAGGACCT CCAGGACGGG 
TTCCTGGACC CCCCGGCCCT TCAGGCCTTG 
GACAAGTGGG GGAGACAGGG AAGCCGGGAG 
ATGGAGACAG AGGGAGCCCT GGTGTGCCAG 
CTAAAGGAGA ACCTGGCCCC ACGGGGGCCC 
CAAAGGGAGA GAAGGGAGCC CCTGGAGGCC 
CCAAAGGTGA CCGAGGACTG CCAGGGCCGC 
GGGAGCCCGG AGACCCTGGG GAAGATGGTC 
AGGGTGACCC AGGAGTCGGG GTCCCGGGCT 
AGGGAGATCT GGGCCTCCCT GGCCTGCCCG 
AGACAGGCCC TCGAGGAGAG ATGGGTCAGC 
GCCCCCCAGG GAGAGAAGGA ATCCCAGGAC 
TGGGACCACC TGGGGCCTCT GGACTCAAAG 



CTGAGCTGGA TGGACTGGAG CCAGATACTG 2340 

CTGGCGTGGA TGGGCCCCCT GCCTCTGTGG 2400 

GTGTGTCGAG GCTGCAGATC CTCAATGCTT 2460 

GGGTCACTGG AGCCACAGCT TACAGACTGG 2520 

GGCACCAGAT ACTCCCAGGA AACACAGACT 2580 

TCAGCTACTC AGTGCGAGTG ACTGCACTTG 2640 

TTGTTGTCAC TACGCCGCCT GAGGCTCCGC 2700 

GCGGGGAGCA CTCGCTGAGG CTGCGCTGGG 2760 

TGCACTGGCA ACCTGAGGGT GGCCAGGAAC 2820 

GCTATCACCT GGACGGGCTG GAGCCAGOGA 2880 

GGCCGGCTGG AGAAGGGCCC TCTGCAGAGG 2940 

CAAGCATTGA ACTACGTGTG GTGGACACCT 3000 

CAGTGTCCAG GGCATCCAGC TACATCCTAT 3060 

AAGTGCCTGG GTCCCCGCAG ACACTTCCAG 3120 

TAGAGCCTGG CGTCTCTTAC ATCTTCTCCC 3180 

CTGAGGCATC TGTCACACAG ACGCCAGTGT 3240 

TACCACATGC CACTCAAGAC AATGCTCACC 3300 

GTCTGGTGTT GGCACTTGGG CCTCTTGGGC 3360 

ACAGTCATCG GCCCTCCCCA CTGTTCCCAC 3420 

TGCAAAGGAT CCGTGACATG CCCTACATGG 3480 

TGGTCACAGC TCACAGATAC ATGTTGGCAC 3540 

CAGGGGTGAT GGTTCTGCTA GTGGATGAAC 3600 

GTGAGGCCCA GGCTTCTGGG CTTAATGTGG 3660 

AGCAGCTGCG TCGCTTGGCG CCGGGTATGG 3720 

ATGGGCCAAG CCTGGACCAG GCAGTCAGTG 3780 

TCACTACTCA GCCCCGGCCA GAGCCCTGCC 3840 

AACCTGGAGA GATGGGCCTG AGAGGACAAG 3900 

GCAGGACCGG TGCTCCCGGC CCCCAGGGGC 3960 

GGGGCTTCCC TGGAGCAGAT GGGCGTCCAG 4020 

CCCCTGGAGC CCCTGGCCTA AAGGGCTCTC 4080 

GAGAGCGAGG ACCTCGAGGC CCAAAGGGGG 4140 

GTGAAGGACC TGGGCTTCCT GGGCGGAAAG 4200 

CTCGTGGACC ACTGGGGGAC CCAGGACCCC 4260 

TGAAGGGTGA CAAAGGCGAT CGTGGGGAGC 4320 

TTGCTCCTGG GGAGCCTGGG CTGCCGGGTC 4380 

TTGGCCCCCC TGGAAAGAAA GGAGAAAAAG 4440 

CAGGACAACC TGGGTCTCCG GGTGAGCAGG 4500 

CCAAAGGTGA CCGGGGCTTT CCAGGGCCCC 4560 

GACCCCCAGG CCCAGCGGGA TCCCGGGGGC 4620 

AGGGTCCTGA AGGGCCACCA GGACCCACTG 4680 

GCCCTGGGGA CCCTGCAGTG GTGGGACCTG 4740 

ATGTGGGGCC CGCTGGGCCC AGAGGAGCTA 4800 

GCTTGGTTCT TCCTGGAGAC CCTGGCCCCA 4860 

GCCTTACTGG CAGAGCAGGA CCCCCAGGTG 4920 

CTGGGCGGCC TGGCCCCCCA GGACCTGTTG 4980 

AGAAAGGTGA CGAGGGTCCT CCGGGTGACC 5040 

GCCTTCGGGG GGCACCTGGA GTTCGGGGGC 5100 

CTGGAGAGGA TGGACGAAAT GGCAGCCCTG 5160 

AGCCGGGTCC CCCAGGACCC CCGGGACGGC 5220 

AGGGAGAGCC TGGGGACCGC GGACAAGAGG 5280 

TCCCTGGAGC CCCTGGGGAA AGGGGCATTG 5340 

GGGACCCAGG TGTCCGAGGC CCAGCAGGAG 5400 

ATGGCCGGAG CGGACTGGAT GGGAAACCAG 5460 

CTGCAGGCAA AGCTGGGGAC CCAGGGAGAG 5520 

GCCTCCCTGG CCCCTCTGGT CCCCCTGGAT 5580 

CTGGCCTGAA TGGAAAAAAC GGAGAACCTG 5640 

AGAAAGGAGA TTCAGGCGCC TCTGGGAGAG 5700 

GAGCTCCTGG TATCCTTGGA CCCCAGGGGC 5760 

CTGGCCAGGG TTTTCCTGGT GTCCCAGGAG 5820 

CTGGATCCAA AGGGGAGCAG GGCCTCCCTG 5880 

GTGTGCCGAA TGTGGATCGG TTGCTGGAAA 5940 

AGATCGTGGA GACCTGGGAT GAGAGCTCTG 6000 

GAGGCCCCAA GGGGGACTCA GGCGAACAGG 6060 

TTCCTGGAGA ACGCGGGCTG AAGGGCGACC 6120 

GTCTGGCCCT TGGGGAGAGG GGCCCCCCCG 6180 

AGCCTGGTAT TCCCGGGCTC CCAGGCAGGG 6240 

GAGAGAGGGG AGAACGGGGA GAGAAAGGAG 6300 

CTGGACTCCC TGGAACCCCT GGGCCCCCCG 6360 

AGCCAGGTCC TGGACTCTCT GGAGAACAGG 6420 

AGCCGGGCAG CAATGGTGAC CAAGGTCCCA 6480 

GAGACCGGGG AGAGCCTGGA CCGAGGGGTC 6540 

GTGGTATGGC TGGGCCTGAA GGGAAGCCGG 6600 

CAGTGGGTGG TCATGGAGAC CCTGGACCAC 6660 

GACCCCAAGG ACCTTCTGGC CTGAAGGGGG 6720 

GCCTGACTGG ACCTACTGGA GCTGTGGGAC 6780 1 

TGGGTCCACA GGGGTCTCCA GGTTTGCCTG 6840 

CCCCAGGTCG AGATGGTGCC AGTGGAAAAG 6900 

GGTCACCAGG TCTGCCTGGC CCTGTCGGAC 6960 

CTGGACAGGC TGTGGTCGGG CTCCCTGGAG 7020 

TTGCTGGAGA CCTGGTGGGT GAGCCGGGAG 7080 

GAGGCGAGAA GGGTGAAGCT GGCCGTGCAG 7140 

AGAAAGGGGC TCCAGGACCC AAAGGTTTCA 7200 

CCCCTGGGCC TCCTGGCCCT CCAGGTGTGA 7260 

GTGCTCCTGG TGTTGTTGGG TTCCCGGGTC 7320 

CAGGCCCTAG TGGAGAGCGG GGTCTGGCAG 7380 

CCCTGGGGCC ACCTGGACCA CCGGGGTCAG 7440 

GAGACAAGGG AGACCCTGGA GTAGGGCTGC 7500 
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CTGGGCCCCG AGGCGAGCGT GGGGAGCCAG GCATCCGGGG TGAAGATGGC CGCCCCGGCC 7560 

AGGAGGGACC CCGAGGACTC ACGGGGCCCC CTGGCAGCAG GGGAGAGCGT GGGGAGAAGG 7620 

GTGATGTTGG GAGTGCAGGA CTAAAGGGTG ACAAGGGAGA CTCAGCTGTG ATCCTGGGGC 7680 

CTCCAGGCCC ACGGGGTGCC AAGGGGGACA TGGGTGAAOG AGGGCCTCGG GGCTTGGATG 7740 

GTGACAAAGG ACCTCGGGGA GACAATGGGG ACCCTGGTGA CAAGGGCAGC AAGGGAGAGC 7800 

CTGGTGACAA GGGCTCAGCC GGGTTGCCAG GACTGCGTGG ACTCCTGGGA CCCCAGGGTC 7860 

AACCTGGTGC AGCAGGGATC CCTGGTGACC CGGGATCCCC AGGAAAGGAT GGAGTGCCTG 7920 

GTATCCGAGG AGAAAAAGGA GATGTTGGCT TCATGGGTCC CCGGGGCCTC AAGGGTGAAC 7980 

GGGGAGTGAA GGGAGCCTGT GGCCTTGATG GAGAGAAGGG AGACAAGGGA GAAGCTGGTC 8040 

CCCCAGGCCG CCCCGGGCTG GCAGGACACA AAGGAGAGAT GGGGGAGCCT GGTGTGCCGG 8100 

GCCAGTCGGG GGCCCCTGGC AAGGAGGGCC TGATCGGTCC CAAGGGTGAC CGAGGCTTTG 8160 

ACGGGCAGCC AGGCCCCAAG GGTGACCAGG GCGAGAAAGG GGAGCGGGGA ACCCCAGGAA 8220 

TTGGGGGCTT CCCAGGCCCC AGTGGAAATG ATGGCTCTGC TGGTCCCCCA GGGCCACCTG 8280 

GCAGTGTTGG TCCCAGAGGC CCCGAAGGAC TTCAGGGCCA GAAGGGTGAG CGAGGTCCCC 8340 

CCGGAGAGAG AGTGGTGGGG GCTCCTGGGG TCCCTGGAGC TCCTGGCGAG AGAGGGGAGC 8400 

AGGGGCGGCC AGGGCCTGCC GGTCCTCGAG GCGAGAAGGG AGAAGCTGCA CTGACGGAGG 8460 

ATGACATCCG GGGCTTTGTG CGCCAAGAGA TGAGTCAGCA CTGTGCCTGC CAGGGCCAGT B520 

TCATCGCATC TGGATCACGA CCCCTCCCTA GTTATGCTGC AGACACTGCC GGCTCCCAGC 85 80 

TCCATGCTGT GCCTGTGCTC CGCGTCTCTC ATGCAGAGGA GGAAGAGCGG GTACCCCCTG 8640 

AGGATGATGA GTACTCTGAA TACTCCGAGT ATTCTGTGGA GGAGTAOCAG GACCCTGAAG 8700 

CTCCTTGGGA TAGTGATGAC CCCTGTTCCC TGCCACTGGA TGAGGGCTCC TGCACTGCCT 8760 

ACACCCTGCG CTGGTACCAT CGGGCTGTGA CAGGCAGCAC AGAGGCCTGT CACCCTTTTG 8820 

TCTATGGTGG CTGTGGAGGG AATGCCAACC GTTTTGGGAC CCGTGAGGCC TGCGAGCGCC 8880 

GCTGCCCACC CCGGGTGGTC CAGAGCCAGG GGACAGGTAC TGCCCAGGAC TGAGGCCCAG 8940 

ATAATGAGCT GAGATTCAGC ATCCCCTGGA GGAGTCGGGG TCTCAGCAGA ACCCCACTGT 90 00 

CCCTCCCCTT GGTGCTAGAG GCTTGTGTGC ACGTGAGCGT GCGAGTGCAC GTCCGTTATT 9060 

TCAGTGACTT GGTCCCGTGG GTCTAGCCTT CCCCCCTGTG GACAAACCCC CATTGTGGCT 9120 

CCTGCCACCC TGGCAGATGA CTCACTGTGG GGGGGTGGCT GTGGGCAGTG AGCGGATGTG 9180 

ACTGGCGTCT GACCCGCCCC TTGACCCAAG CCTGTGATGA CATGGTGCTG ATTCTGGGGG 9240 
GCATTAAAGC TGCTGTTTTA AAAGGCAAAA AA 

Seq ID NO: 63 Protein sequence: 
Protein Accession ft: NP_000085.1 

1 11 21 31 41 51 

I I I I I I 

MTLRLLVAAL CAGILAEAPR VRAQHRERVT CTRIiYAADIV FLLDGSSSIG RSNFREVRSF 60 

LEGLVLPFSG AASAQGVRFA TVQYSDDPRT EFGLDALGSG GDVIRAIRBL SYKGGNTRTG 120 

AAILHVADHV FLPQLARPGV PKVCILITDG KSQDLVDTAA QRLKGQGVKL FAVGIKNADP 180 

EELKRVASQP TSDFFFFVND FSILRTLLPL VSRRVCTTAG GVPVTRPPDD STSAPRDLVL 240 

SEPSSQSLRV QWTAASGPVT GYKVQYTPLT GLGQPLPSER QEVNVPAGET SVRLRGLRPL 300 

TEYQVTVIAL YANSIGEAVS GTARTTALEG PELTIQNTTA HSLLVAWRSV PGATGYRVTW 360 

RVLSGGPTQQ QELGPGQGSV IiLRDLEPGTD* YEVTVSTLFG RSVGPATSLM ARTDASVEQT 420 

LRPVILGPTS ILLSWNLVPE ARGYRLEWRR ETGLEPPQKV VLPSDVTRYQ LDGLQPGTEY 480 

RLTLYTLLEG HEVATPATW PTGPELPVSP VTDLQATELP GQRVRVSWSP VPGATQYRII 540 

VRSTQGVERT LVLPGSQTAF DLDDVQAGLS YTVRVSARVG PREGSASVI/T VRREPETPLA 600 

VPGLRVWSD ATRVRVAWGP VPGASGFRIS WSTGSGPESS QTLPPDSTAT DITGLQPGTT 660 

YQVAVSVLRG REEGPAAVIV ARTDPLGPVR TVHVTQASSS SVTITWTRVP GATGYRVSWH 720 

SAHGPEKSQIi VSGEATVAEL DGLEPDTEYT VHVRAHVAGV DGPPASWVR TAPEPVGRVS 780 

RLQILNASSD VLRITWVGVT GATAYRLAWG RSEGGPMRHQ I LPGNTDSAE IRGLEGGVSY 840 

SVRVTALVGD REGTPVSIW TTPPEAPPAL GTLHWQRGE HSI/RLRWEPV FRAQGFLLHW 900 

QPEGGQEQSR VLGPELSSYH LDGLEPATQY RVRLSVLGPA GEGPSAEVTA RTESPRVPSI 960 

ELRWDTSID SVTLAWTPVS RASSYILSWR PLRGPGQEVP GSPQTLPGIS SSQRVTGLEP 1020 

GVSYIFSLTP VLDGVRGPEA SVTQTPVCPR GLADWFLPH ATQDNAHRAE ATRRVLERLV 1080 

LALGPLGPQA VQVGLLSYSH RPSPLFPLNG SHDLGIILQR IRDMPYMDPS GNNLGTAWT 1140 

AHRYMLAPDA PGRRQHVPGV MVLLVDEPLR GDIFSPIREA QASGLNWML GMAGADPEQL 1200 

RRLAPGMDSV QTFFAVDDGP SLDQAVSGLA TALCQASFTT QPRPEPCPVY CPKGQKGEPG 1260 

EMGLRGQVGP PGDPGLPGRT GAPGPQGPPG SATAKGERGF PGADGRPGSP GRAGNPGTPG 1320 

APGLKGSPGL PGPRGDPGER GPRGPKGEPG APGQVIGGEG PGLPGRKGDP GPSGPPGPRG 1380 

PLGDPGPRGP PGLPGTAMKG DKGDRGERGP PGPGEGGIAP GEPGLPGLPG SPGPQGPVGP 1440 

PGKKGEKGDS EDGAPGLPGQ PGSPGEQGPR GPPGAIGPKG DRGFPGPLGE AGEKGERGPP 1500 

GPAGSRGLPG VAGRPGAKGP EGPPGPTGRQ GEKGEPGRPG DPAWGPAVA GPKGEKGDVG 1560 

PAGPRGATGV QGERGPPGLV LPGDPGPKGD PGDRGPIGLT GRAGPPGDSG PPGEKGDPGR 1620 

PGPPGPVGPR GRDGEVGEKG DEGPPGDPGL PGKAGERGLR GAPGVRGPVG EKGDQGDPGE 1680 

DGRNGSPGSS GPKGDRGEPG PPGPPGRLVD TGPGAREKGE PGDRGQEGPR GPKGDPGLPG 1740 

APGERGIEGF RGPPGPQGDP GVRGPAGEKG DRGPPGLDGR SGLDGKPGAA GPSGPNGAAG 1800 

KAGDPGRDGL PGLRGEQGIiP GPSGPPGLPG KPGEDGKPGL NGKNGEPGDP GEDGRKGEKG I860 

DSGASGREGR DGPKGERGAP GILGPQGPPG LPGPVGPPGQ GFPGVPGGTG PKGDRGETGS 1920 

KGEQGIiPGER GLRGEPGSVP NVDRLLETAG IKASALREZV ETWDESSGSF LPVPERRRGP 1980 

KGDSGEQGPP GKEGPIGFPG ERGLKGDRGD PGPQGPPGLA LGERGPPGPS GLAGEPGKPG 2040 

IPGLPGRAGG VGEAGRPGER GERGEKGERG EQGRDGPPGL PGTPGPPGPP GPKVSVDEPG 2100 

PGLSGEQGPP GLKGAKGEPG SNGDQGPKGD RGVPGIKGDR GEPGPRGQDG NPGLPGERGM 2160 

AGPEGKPGLQ GPRGPPGPVG GHGDPGPPGA PGLAGPAGPQ GPSGLKGEPG ETGPPGRGLT 2220 

GPTGAVGLPG PPGPSGLVGP QGSPGLPGQV GETGKPGAPG RDGASGKDGD RGSPGVPGSP 2280 

GLPGPVGPKG EPGPTGAPGQ AWGLPGAKG EKGAPGGLAG DLVGEPGAKG DRGLPGPRGE 2340 

KGEAGRAGEP GDPGEDGQKG APGPKGFKGD PGVGVPGSPG PPGPPGVKGD LGLPGLPGAP 2400 

GWGFPGQTG PRGEMGQPGP SGERGLAGPP GREGIPGPLG PPG PPG SVG P PGASGLKGDK 2460 

GDPGVGLPGP RGERGEPGIR GEDGRPGQEG PRGLTGPPGS RGERGEKGDV GSAGLKGDKG 2520 

DSAVILGPPG PRGAKGDMGE RGPRGLDGDK GPRGDNGDPG DKGSKGEPGD KGSAGLPGLR 2580 

GLLGPQGQPG AAGIPGDPGS PGKDGVPGIR GEKGDVGFMG PRGLKGERGV KGACGLDGEK 2640 

GDKGEAGPPG RPGLAGHKGE MGEPGVPGQS GAPGKEGLIG PKGDRGFDGQ PGPKGDQGEK 2700 

GERGTPGIGG FPGPSGNDGS AGPPGPPGSV GPRGPEGLQG QKGERGPPGE RWGAPGVPG 2760 

APGERGEQGR PGPAGPRGEK GEAALTEDDI RGFVRQEMSQ HCACQGQFIA SGSRPLPSYA 2820 

ADTAGSQLHA VPVLRVSHAE EEERVPPEDD EYSEYSEYSV EEYQDPEAPW DSDDPCSLPL 2880 

DEGSCTAYTL RWYHRAVTGS TEACHPFVYG GCGGMANRFG TREACERRCP PRWQSQGTG 2940 
TAQD 
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Seq ID NO; 64 DNA sequence 

Nucleic Acid Accession #: NMJ>06945, 

Coding sequence: 1-219 
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ATGTCTTATC AACAGCAGCA GTGCAAGCAG CCCTGCCAGC CACCTCCTGT GTGCCCCACG 
CCAAAGTGCC CAGAGCCATG TCCACCCCCG AAGTGCCCTG AGCCCTGCCC ACCACCAAAG 
TGTCCACAGC CCTGCCCACC TCAGCAGTGC CAGCAGAAAT ATCCTCCTGT GACACCTTCC 
CCACCCTGCC AGCCAAAGTA TCCACCGAAG AGCAAGTAA 

Seq ID NO: 65 Protein sequence: 
Protein Accession #: NP 008876 
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MSYQQQQCKQ PCQPPPVCPT PKCPEPCPPP KCPEPCPPPK CPQPCPPQQC QQKYPPVTPS 

PPCQPKYPPK SK 

Seq ID NO: 66 DNA sequence 

Nucleic Acid Accession #: NMJ)05629.1 

Coding sequence: 639-2546 



1 

I 

TAGTCGGAGC 
CCGCCGCCGG 
CCGCCGCCAC 
CGGGGCGGGG 
CCGATGTCGC 
GCCGCCGCCC 
GCCGGGCCCC 
CTCACTTTCA 
CGTCCGCCCG 
CCTCGGGGCC 
GCCGCGACCC 
ACGGCATCTA 
ACGGQGCCCC 
CCGTGCCGCC 
TCGCCGTGGG 
GTGTGTTCCT 
TAGAGATCTC 
CCCTGTTCAA 
ACATCATGGT 
CCTGGGCCAC 
AAGACTGTGC 
CCCCTGTCAT 
CAGGGGCCCT 
TCTGTGTCTG 
CCTACGTGGT 
GCATCATTTA 
ATGCGGGGAC 
GCAGCTACAA 
ACAGTGGGAC 
CAGAGCAGGG 
CCTACCCGCG 
TCATGCTGTT 
GCCTCCTCGA 
CCCTCTGTTG 
ACGTCTTCCA 
TTTGGGAGTG 
CCTGTATGAT 
CGCTGGTCTG 
ACAACACCTA 
CCATGCTGTG 
CTGAGCGCTG 
CTCAGGACGC 
TCGTCGTGGT 
GCCATAGCAG 
CACTTTTGGG 
ACTTTTTCCA 
TCCCCCTCCA 
CAGTGCTGCA 
CTCTGCAGCA 
TGTGAGGAAG 
GAGGGGCAGC 
TTATAGAAGC 
CCAGTATCAA 
TATAGATCTC 
GGAGGCTTGT 
TTTCTAAAAA 
CCTACCTTAC 
GCAAAACAAA 
GTCCCCAGCC 
GCTGTCCCTT 
GGGCTGCTAA 
GACCCTGGAC 



11 
I 

GAGGTGGCGA 
GAAGGAGAGG 
CACCGCCACC 
GGCGCGGGCC 
CCGCGCCCCG 
GAGCCGCGGG 
GACGCCGCCC 
AGTGCTTCGT 
CCGCCCCGTC 
CTCCCCGGTG 
CGGCCCGGCC 
TAGCGTGTCC 
GGCCAAGGGC 
GCGCGAGACC 
CTTGGGCAAC 
TATTCCCTAC 
GCTGGGCCAG 
AGGCCTGGGC 
GCTGGCCTGG 
ATGTGGCCAC 
CAATGCCAGC 
CGAGTTCTGG 
CAACTGGGAG 
GAAGGGGGTC 
CCTGGTCGTG 
CTATCTCAAG 
CCAGATTTTC 
CCGCTTCAAC 
CAGCTTCTTT 
CGTGCACATC 
GGCTGTCACG 
GCTGCTTGGT 
CCTCCTCCCG 
TGCCCTCTGC 
GCTGTTTGAC 
CGTGGTGGTG 
CGGGTACCGA 
CATGGGCATC 
CGTGTACCCG 
CGTGCCGCTG 
GCAGCACCTG 
AGATGTCAGG 
GGAGAGTGTC 
CCCCTGCTTC 
GTCTGCCTGG 
TTTTTAATAA 
GCCCTAGCCG 
CTCCTCCTGC 
CACCCGTGGG 
GGAAGGAGGG 
AGAACCAAGG 
TTAGAGAGCC 
TTGTGTGAGC 
TATCTCTTAG 
ATTTTGCACA 
GAGGAAGGAG 
CCCTCTGCCC 
AGCTTCGAGC 
CCAGACTGGA 
TGCCACAAGT 
CCTGGCCTGC 
ACGGCTCCCA 



21 
I 

GTCGCTGAGC 
GCGAGGCGCG 
GGAGTCGCGG 
ACAGGCCCCT 
TTAGGATGAG 
CAGGAGCCTC 
GCGCGCCCCC 
GGACTGCTTC 
CCCCGGCCCG 
CCGCCGGTGC 
GTGCGGCCCG 
GGCGACGAGA 
GACGGCCCCG 
TGGACGCGCC 
GTGTGGCGCT 
GTCCTGATCG 
TTCATGAAGG 
TACGCCTCCA 
GGCTTCTATT 
ACCTGGAACA 
CTGGCCAACC 
GAGAACAAAG 
GTGACCCTTT 
AAATCCACGG 
CTGCTGGTGC 
CCTGACTGGT 
TTTTCTTACG 
AACAACTGCT 
GCTGGCTTCG 
TCCAAGGTGG 
CTGATGCCAG 
CTCGACAGCC 
GCCTCCTACT 
TTTGTCATCG 
TACTACTCGG 
GCCTGGGTGT 
CCTTGCCCCT 
TTCATCTTCA 
TGGTGGGGTG 
CACCTCCTGG 
ACCCAGCCCA 
GGCCTGACCA 
ATGTGACAAC 
AGCCCCACCG 
GGGAGGAGGG 
AACGCCAAAA 
AGCTGGTCCT 
CCCTGCCACG 
TGACCCCTCA 
AGAGACGGGA 
CAAATATTTC 
AGCCAGCAAT 
TTGGGTGCGA 
CAAAGGTGAA 
TTTTATAAAA 
CCCAAACCAT 
CTAGCCAAGG 
TGTTGCGTGT 
TTGGAAAAGT 
CTGTGGGGCA 
TCAGGCTTCC 
CGTCCAGGCT 



31 
I 

CCGCCGCGGC 
CCCGAGCCGC 
GCCAGCCGGG 
GCTCCGGCCG 
TCTCGGGTCG 
GGGAGCCGCC 
GGGCCCCCGA 
TGACTGCGCC 
GCCGCCCCCC 
CCCCCGCCTG 
CCGGGGCCAT 
AGAAGGGCCC 
TGGGCCTGGG 
AGATGGACTT 
TCCCCTACCT 
CCCTGGTTGG 
CCGGCAGCAT 
TGGTGATCGT 
ACCTGGTCAA 
CTCCCGACTG 
TCACCTGTGA 
TCTTGAGGCT 
GTCTGCTGGC 
GAAAGATCGT 
GTGGAGTGCT 
CAAAGCTGGG 
CCATTGGCCT 
ACAAGGACGC 
TGGTCTTCTC 
CAGAGTCAGG 
TGGCCCCACT 
AGTTTGTAGG 
ACTTCCGTTT 
ATCTCTCCAT 
CCAGCGGCAC 
ACGGAGCTGA 
GGATGAAATG 
ACGTTGTGTA 
AGGCCATGGG 
GCTGCCTCCT 
TCTGGGGCCT 
CCCTGACCCC 
TCAGCTCACA 
CACCCCTCCA 
GAGAAAGCAC 
ATATCACAAC 
AGGCCCCGCC 
CCCACCCCCT 
CCCCAGAAGC 
GGGAGGAGAG 
AGCTGGGCTA 
GGAACCTTCT 
GTGCACGCGT 
TGCCAGATGT 
ACTTGAGAGA 
CCTCTCCTTA 
AGTGTGAATT 
GTGAGTCTGT 
GCATGGTGGG 
AGAGGCTGCA 
CACCCTGTGC 
TAAGGTGGAT 



41 

I 

CCCGAGAGCG 
CGCCGCCGCC 
CAGCCTCCGC 
TCGTTTGCAG 
GGCGAGGAGC 
GCCGCCGCCG 
CACACATGAG 
GCCCGCGCCC 
GGCCCCCGGC 
ACCGCCGCCC 
GGCGAAGAAG 
CCTCATCGCG 
GACACCCGGC 
CATCATGTCG 
GTGCTACAAG 
AGGAATCCCC 
CAATGTCTGG 
CTTCTACTGC 
GTCCTTTACC 
CGTGGAGATC 
CCAGCTTGCT 
GTCTGGGGGA 
CTGCTGGGTG 
GTACTTCACT 
GCTGCCTGGC 
GTCCCCTCAG 
GGGGGCCCTC 
CATCATCCTG 
CATCCTGGGC 
GCCGGGCCTG 
CTGGGCTGCC 
TGTGGAGGGC 
CCAAAGGGAG 
GGTGACTGAT 
CACCCTGCTC 
CCGCTTCATG 
GTGCTGGTCC 
CTACGAGCCG 
CTGGGCCTTC 
CAGGGCCAAG 
CCACCACTTG 
AGTGTCCGAG 
TCACCAGCTC 
GGGGGCCTGC 
CATGAGTGCT 
CCACCAAAAA 
TAGTGCCCCA 
GCCCACCTCT 
AGCAGTGGCA 
AGAGGAGAAG 
TACCCCTCTC 
GGTTCCTGCG 
GCGTGAGTAC 
AAATGGCGCC 
ATGAGATTTC 
CCACTCCCAT 
TATAGATCTA 
TGTGTGGATG 
GGCCTCGGGG 
ATATTCCGTC 
GGGGCACACC 
GCACTTCCCG 



51 
I 

GCTGCAGCCG 
GCCACCGCCG 
GGGCCCCGGC 
ACCGCGGGCG 
CGCCGCAGCC 
CCGCCGCCCG 
ATTCTTCAGG 
CGCACCCCGC 
CGGCCCGCGC 
CCCGTGAGGC 
AGCGCCGAGA 
CCCGGGCCCG 
GGCCGCCTGG 
TGCGTGGGCT 
AACGGCGGAG 
ATTTTCTTCT 
AACATCTGTC 
AACACCTACT 
ACCACGCTGC 
TTCCGCCATG 
GACCGCCGGT 
CTGGAGGTGC 
CTGGTCTACT 
GCTACATTCC 
GCCCTGGATG 
GTGTGGATAG 
ACAGCCCTGG 
GCTCTCATCA 
TTCATGGCTG 
GCCTTCATCG 
CTGTTCTTCT 
TTCATCACCG 
ATCTCTGTGG 
GGCGGGATGT 
TGGCAGGCCT 
GACGACATTG 
TTCTTCACCC 
CTGGTCTACA 
GCCCTGTCCT 
GGCACCATGG 
GAGTACCGAG 
AGCAGCAAGG 
ACCTCTGGTA 
CTTTCCCTGA 
CACTAAAACA 
TAGATGCCTC 
CCCCCACCCA 
CCAGGCTCTG 
GCTTGGGAAA 
GGAGGCAGGG 
CCCATCCCTG 
CCAATCGCCA 
GGAGAGTATA 
TCTGGGCAAA 
TGCTTGTATA 
CCCTGTGAGC 
ACTTTCATAG 
TGCGTGTGTG 
CTGTCCCCAC 
CTGGGTGTCT 
CCCAGGAAGG 
CACCTCCAGT 



60 
120 
180 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
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CTTCTGTGTA GCAGCTTTAA CCCACGTTTG TCTGTCACGT CCAGTCCCGA GACGGCTGAG 3780 
TGACCCCAAG AAAGGCTTCC CCGACACCCA GACAGAGGCT GCAGGGCTGG GGCTGGGTGA 3840 
GGGTGGCGGG CCTGCGGGGA CATTCTACTG TGCTAAAAAG CCACTGCAGA CATAGCAATA 3900 
AAAACATGTC ATTTTCC 

Seq ID NO: 67 Protein sequence: 
Protein Accession #: NPJ305620.1 

1 11 21 31 41 51 

I I I I I I 

MAKKSAENGI YSVSGDEKKG PLIAPGPDGA PAKGDGPVGL GTPGGRIAVP PRETWTRQMD 60 

PIMSCVGPAV GLGNVWRFPY LCYKNGGGVF LIPYVLIALV GGIPIFFLEI SLGQFMKAGS 120 

INVWNICPLF KGLGYASMVI VFYCNTYYIM VLAWGFYYLV KSPTTTLPWA TCGHTWNTPD 180 

CVEIFRHEDC ANASLANLTC DQLADRRSPV IEFWENKVLR LSGGLEVPGA LNWEVTLCLL 240 

ACWVLVYFCV WKGVKSTGKI VYFTATFPYV VLWLLVRGV LLPGALDGII YYLKPDWSKL 300 

GSPQVWIDAG TQIFFSYAIG LGALTALGSY NRFNNNCYKD AIILALINSG TSFFAGFWF 360 

SILGFMAAEQ GVHISKVAES GPGLAFIAYP RAVTLMPVAP LWAALFFFML LLLGLDSQFV 420 

GVEGFITGLL DLLPASYYFR FQREISVALC CALCFVIDLS MVTDGGMYVF QLFDYYSASG 480 

TTLLWQAFWE CVWAWVYGA DRFMDDIACM IGYRPCPWMK WCWSFFTPLV CMGIFIFNW 540 

YYEPLVYNNT YVYPWWGEAM GWAFALSSML CVPLHLLGCL LRAKGTMAER WQHLTQPIWG 600 
LHHLEYRAQD ADVRGLTTLT PVSESSKWV VESVM 



Seq ID NO: 68 DNA sequence 

Nucleic Acid Accession 8: NM_021953.1 

Coding sequence: 178-2469 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGGG GGACCCGGCC GGTCCGGCGC GAGCCCCCGT CCGGGGCCCT GGCTCGGCCC 60 

CCAGGTTGGA GGAGCCCGGA GCCCGCCTTC GGAGCTACGG CCTAACGGCG GCGGCGACTG 120 

CAGTCTGGAG GGTCCACACT TGTGATTCTC AATGGAGAGT GAAAACGCAG ATTCATAATG 180 

AAAGCTAGCC CCCGTCGGCC ACTGATTCTC AAAAGACGGA GGCTGCCCCT TCCTGTTCAA 240 

AATGCCCCAA GTGAAACATC AGAGGAGGAA CCTAAGAGAT CCCCTGCCCA ACAGGAGTCT 300 

AATCAAGCAG AGGCCTCCAA GGAAGTGGCG GAGTCCAACT CTTGCAAGTT TCCAGCTGGG 360 

ATCAAGATTA TTAACCACCC CACCATGCCC AACACGCAAG TAGTGGCCAT CCCCAACAAT 420 

GCTAATATTC ACAGCATCAT CACAGCACTG ACTGCCAAGG GAAAAGAGAG TGGCAGTAGT 480 

GGGCCCAACA AATTCATCCT CATCAGCTGT GGGGGAGCCC CAACTCAGCC TCCAGGACTC 540 

CGGCCTCAAA CCCAAACCAG CTATGATGCC AAAAGGACAG AAGTGACCCT GGAGACCTTG 600 

GGACCAAAAC CTGCAGCTAG GGATGTGAAT CTTCCTAGAC CACCTGGAGC CCTTTGCGAG 660 

CAGAAACGGG AGACCTGTGC AGATGGTGAG GCAGCAGGCT GCACTATCAA CAATAGCCTA 720 

TCCAACATCC AGTGGCTTCG AAAGATGAGT TCTGATGGAC TGGGCTCCCG CAGCATCAAG 780 

CAAGAGATGG AGGAAAAGGA GAATTGTCAC CTGGAGCAGC GACAGGTTAA GGTTGAGGAG 840 

CCTTCGAGAC CATCAGCGTC CTGGCAGAAC TCTGTGTCTG AGCGGCCACC CTACTCTTAC 900 

ATGGCCATGA TACAATTCGC CATCAACAGC ACTGAGAGGA AGCGCATGAC TTTGAAAGAC 960 

ATCTATACGT GGATTGAGGA CCACTTTCCC TACTTTAAGC ACATTGCCAA GCCAGGCTGG 1020 

AAGAACTCCA TCCGCCACAA CCTTTCCCTG CACGACATGT TTGTCCGGGA GACGTCTGCC 1080 

AATGGCAAGG TCTCCTTCTG GACCATTCAC CCCAGTGCCA ACCGCTACTT GACATTGGAC 1140 

CAGGTGTTTA AGCCACTGGA CCCAGGGTCT CCACAATTGC CCGAGCACTT GGAATCACAG 1200 

CAGAAACGAC CGAATCCAGA GCTCCGCCGG AACATGACCA TCAAAACCGA ACTCCCCCTG 1260 

GGCGCACGGC GGAAGATGAA GCCACTGCTA CCACGGGTCA GCTCATACCT GGTACCTATC 1320 

CAGTTCCCGG TGAACCAGTC ACTGGTGTTG CAGCCCTCGG TGAAGGTGCC ATTGCCCCTG 1380 

GCGGCTTCCC TCATGAGCTC AGAGCTTGCC CGCCATAGCA AGCGAGTCCG CATTGCCCCC 1440 

AAGGTGCTGC TAGCTGAGGA GGGGATAGCT CCTCTTTCTT CTGCAGGACC AGGGAAAGAG 1500 

GAGAAACTCC TGTTTGGAGA AGGGTTTTCT CCTTTGCTTC CAGTTCAGAC TATCAAGGAG 1560 

GAAGAAATCC AGCCTGGGGA GGAAATGCCA CACTTAGCGA GACCCATCAA AGTGGAGAGC 1620 

CCTCCCTTGG AAGAGTGGCC CTCCCCGGCC CCATCTTTCA AAGAGGAATC ATCTCACTCC 1680 

TGGGAGGATT CGTCCCAATC TCCCACCCCA AGACCCAAGA AGTCCTACAG TGGGCTTAGG 1740 

TCCCCAACCC GGTGTGTCTC GGAAATGCTT GTGATTCAAC ACAGGGAGAG GAGGGAGAGG 1800 

AGCCGGTCTC GGAGGAAACA GCATCTACTG CCTCCCTGTG TGGATGAGCC GGAGCTGCTC 1860 

TTCTCAGAGG GGCCCAGTAC TTCCCGCTGG GCCGCAGAGC TCCCGTTCCC AGCAGACTCC 1920 

TCTGACCCTG CCTCCCAGCT CAGCTACTCC CAGGAAGTGG GAGGACCTTT TAAGACACCC 1980 

ATTAAGGAAA CGCTGCCCAT CTCCTCCACC CCGAGCAAAT CTGTCCTCCC CAGAACCCCT 2040 

GAATCCTGGA GGCTCACGCC CCCAGCCAAA GTAGGGGGAC TGGATTTCAG CCCAGTACAA 2100 

ACCTCCCAGG GTGCCTCTGA CCCCTTGCCT GACCCCCTGG GGCTGATGGA TCTCAGCACC 2160 

ACTCCCTTGC AAAGTGCTCC CCCCCTTGAA TCACCGCAAA GGCTCCTCAG TTCAGAACCC 2220 

TTAGACCTCA TCTCCGTCCC CTTTGGCAAC TCTTCTCCCT CAGATATAGA CGTCCCCAAG 2280 

CCAGGCTCCC CGGAGCCACA GGTTTCTGGC CTTGCAGCCA ATCGTTCTCT GACAGAAGGC 2340 

CTGGTCCTGG ACACAATGAA TGACAGCCTC AGCAAGATCC TGCTGGACAT CAGCTTTCCT 2400 

GGCCTGGACG AGGACCCACT GGGCCCTGAC AACATCAACT GGTCCCAGTT TATTCCTGAG 2460 

CTACAGTAGA GCCCTGCCCT TGCCCCTGTG CTCAAGCTGT CCACCATCCC GGGCACTCCA 2520 

AGGCTCAGTG CACCCCAAGC CTCTGAGTGA GGACAGCAGG CAGGGACTGT TCTGCTCCTC 2580 

ATAGCTCCCT GCTGCCTGAT TATGCAAAAG TAGCAGTCAC ACCCTAGCCA CTGCTGGGAC 2640 

CTTGTGTTCC CCAAGAGTAT CTGATTCCTC TGCTGTCCCT GCCAGGAGCT GAAGGGTGGG 2700 

AACAACAAAG GCAATGGTGA AAAGAGATTA GGAACCCCCC AGCCTGTTTC CATTCTCTGC 2760 

CCAGCAGTCT CTTACCTTCC CTGATCTTTG CAGGGTGGTC CGTGTAAATA GTATAAATTC 2820 

TCCAAATTAT CCTCTAATTA TAAATGTAAG CTTATTTCCT TAGATCATTA TCCAGAGACT 2880 

GCCAGAAGGT GGGTAGGATG ACCTGGGGTT TCAATTGACT TCTGTTCCTT GCTTTTAGTT 2940 

TTGATAGAAG GGAAGACCTG CAGTGCACGG TTTCTTCCAG GCTGAGGTAC CTGGATCTTG 3000 

GGTTCTTCAC TGCAGGGACC CAGACAAGTG GATCTGCTTG CCAGAGTCCT TTTTGCCCCT 3060 

CCCTGCCACC TCCCCGTGTT TCCAAGTCAG CTTTCCTGCA AGAAGAAATC CTGGTTAAAA 3120 

AAGTCTTTTG TATTGGGTCA GGAGTTGAAT TTGGGGTGGG AGGATGGATG CAACTGAAGC 3180 

AGAGTGTGGG TGCCCAGATG TGCGCTATTA GATGTTTCTC TGATAATGTC CCCAATCATA 3240 

CCAGGGAGAC TGGCATTGAC GAGAACTCAG GTGGAGGCTT GAGAAGGCCG AAAGGGCCCC 3300 

TGACCTGCCT GGCTTCCTTA GCTTGCCCCT CAGCTTTGCA AAGAGCCACC CTAGGCCCCA 3360 

GCTGACCGCA TGGGTGTGAG CCAGCTTGAG AACACTAACT ACTCAATAAA AGCGAAGGTG 3420 
GACCNAAAAA AAAAAAAAAA AAAA 
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Seq ID NO: 69 Protein sequence: 
Protein Accession ft: NP_068772.1 

1 11 21 31 41 51 

I I I I I I 

MKASPRRPLI LKRRRLPLPV QNAPSETSEE BPKRSPAQQE SNQAEASKEV ABSNSCKPPA 60 

GIKIINHPTM PNTQWAIPN NANIHSIITA LTAKGKBSGS SGPNKFILIS CGGAPTQPPG 120 

LRPQTQTSYD AKRTEVTLET LGPKPAARDV NLPRPPGALC EQKRBTCADG EAAGCTINNS 180 

LSNIQWLRKM SSDGLGSRS1 KQEMEEKENC HLEQRQVKVE EPSRPSASWQ NSVSKRPPYS 240 

YMAMIQFAIN STERKRMTLK DIYTWIEDHP PYFKHIAKPG WKNSIRHNLS LHDMFVRETS 300 

ANGKVSFWTI HPSANRYLTL DQVFKPLDPG SPQLPEHLES QQKRPNPELR RNMTIKTELP 360 

LGARRKMKPIi LPRVSSYLVP IQFPVNQSLV LQPSVKVPLP LAASLMSSEL ARHSKRVRIA 420 

PKVLLAEEGI APLSSAGPGK EEKLLFGEGP SPLLPVQTIK EEEIQPGEEM PHLARPIKVE 480 

SPPLEEWPSP APSFKEESSH SWEDSSQSPT PRPKKSYSGL RSPTRCVSEM LVIQHRERRE 540 

RSRSRRKQHL LPPCVDEPEI* LFSEGPSTSR WAAELPFPAD SSDPASQLSY SQEVGGPFKT 600 

PIKETLPISS TPSKSVLPRT PESWRLTPPA KVGGLDFSPV QTSQGASDPL PDPLGLMDLS 660 

TTPLQSAPPL ESPQRLLSSE PLDLISVPFG NSSPSDIDVP KPGSPEPQVS GLAANRSLTE 720 
GLVLDTMNDS LSKILLDISF PGLDEDPLGP DNINWSQFIP ELQ 



Seq ID NO: 70 DNA sequence 

Nucleic Acid Accession ft: BC006529.1 

Coding sequence: 178-2424 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGGG GGACCCGGCC GGTCCGGCGC GAGCCCCCGT CCGGGGCCCT GGCTCGGCCC 60 

CCAGGTTGGA GGAGCCCGGA GCCCGCCTTC GGAGCTACGG CCTAACGGCG GCGGCGACTG 120 

CAGTCTGGAG GGTCCACACT TGTGATTCTC AATGGAGAGT GAAAACGCAG ATTCATAATG 180 

AAAACTAGCC CCCGTCGGCC ACTGATTCTC AAAAGACGGA GGCTGCCCCT TCCTGTTCAA 240 

AATGCCCCAA GTGAAACATC AGAGGAGGAA CCTAAGAGAT CCCCTGCCCA ACAGGAGTCT 300 

AATCAAGCAG AGGCCTCCAA GGAAGTGGCA GAGTCCAACT CTTGCAAGTT TCCAGCTGGG 360 

ATCAAGATTA TTAACCACCC CACCATGCCC AACACGCAAG TAGTGGCCAT CCCCAACAAT 420 

GCTAATATTC ACAGCATCAT CACAGCACTG ACTGCCAAGG GAAAAGAGAG TGGCAGTAGT 480 

GGGCCCAACA AATTCATCCT CATCAGCTGT GGGGGAGCCC CAACTCAGCC TCCAGGACTC 540 

CGGCCTCAAA CCCAAACCAG CTATGATGCC AAAAGGACAG AAGTGACCCT GGAGACCTTG 600 

GGACCAAAAC CTGCAGCTAG GGATGTGAAT CTTCCTAGAC CACCTGGAGC CCTTTGCGAG 660 

CAGAAACGGG AGACCTGTGC AGATGGTGAG GCAGCAGGCT GCACTATCAA CAATAGCCTA 720 

TCCAACATCC AGTGGCTTCG AAAGATGAGT TCTGATGGAC TGGGCTCCCG CAGCATCAAG 780 

CAAGAGATGG AGGAAAAGGA GAATTGTCAC CTGGAGCAGC GACAGGTTAA GGTTGAGGAG 840 

CCTTCGAGAC CATCAGCGTC CTGGCAGAAC TCTGTGTCTG AGCGGCCACC CTACTCTTAC 900 

ATGGCCATGA TACAATTCGC CATCAACAGC ACTGAGAGGA AGCGCATGAC TTTGAAAGAC 960 

ATCTATACGT GGATTGAGGA CCACTTTCCC TACTTTAAGC ACATTGCCAA GCCAGGCTGG 1020 

AAGAACTCCA TCCGCCACAA CCTTTCCCTG CACGACATGT TTGTCCGGGA GACGTCTGCC 1080 

AATGGCAAGG TCTCCTTCTG GACCATTCAC CCCAGTGCCA ACCGCTACTT GACATTGGAC 1140 

CAGGTGTTTA AGCAGCAGAA ACGACCGAAT CCAGAGCTCC GCCGGAACAT GACCATCAAA 1200 

ACCGAACTCC CCCTGGGCGC ACGGCGGAAG ATGAAGCCAC TGCTACCACG GGTCAGCTCA 1260 

TACCTGGTAC CTATCCAGTT CCCGGTGAAC CAGTCACTGG TGTTGCAGCC CTCGGTGAAG 1320 

GTGCCATTGC CCCTGGCGGC TTCCCTCATG AGCTCAGAGC TTGCCCGCCA TAGCAAGCGA 1380 

GTCCGCATTG CCCCCAAGGT GCTGCTAGCT GAGGAGGGGA TAGCTCCTCT TTCTTCTGCA 1440 

GGACCAGGGA AAGAGGAGAA ACTCCTGTTT GGAGAAGGGT TTTCTCCTTT GCTTCCAGTT 1500 

CAGACTATCA AGGAGGAAGA AATCCAGCCT GGGGAGGAAA TGCCACACTT AGCGAGACCC 1560 

ATCAAAGTGG AGAGCCCTCC CTTGGAAGAG TGGCCCTCCC CGGCCCCATC TTTCAAAGAG 1620 

GAATCATCTC ACTCCTGGGA GGATTCGTCC CAATCTCCCA CCCCAAGACC CAAGAAGTCC 1680 

TACAGTGGGC TTAGGTCCCC AACCCGGTGT GTCTCGGAAA TGCTTGTGAT TCAACACAGG 1740 

GAGAGGAGGG AGAGGAGCCG GTCTCGGAGG AAACAGCATC TACTGCCTCC CTGTGTGGAT 1800 

GAGCCGGAGC TGCTCTTCTC AGAGGGGCCC AGTACTTCCC GCTGGGCCGC AGAGCTCCCG 1860 

TTCCCAGCAG ACTCCTCTGA CCCTGCCTCC CAGCTCAGCT ACTCCCAGGA AGTGGGAGGA 1920 

CCTTTTAAGA CACCCATTAA GGAAACGCTG CCCATCTCCT CCACCCCGAG CAAATCTGTC 1980 

CTCCCCAGAA CCCCTGAATC CTGGAGGCTC ACGCCCCCAG CCAAAGTAGG GGGACTGGAT 2040 

TTCAGCCCAG TACAAACCCC CCAGGGTGCC TCTGACCCCT TGCCTGACCC CCTGGGGCTG 2100 

ATGGATCTCA GCACCACTCC CTTGCAAAGT GCTCCCCCCC TTGAATCACC GCAAAGGCTC 2160 

CTCAGTTCAG AACCCTTAGA CCTCATCTCC GTCCCCTTTG GCAACTCTTC TCCCTCAGAT 2220 

ATAGACGTCC CCAAGCCAGG CTCCCCGGAG CCACAGGTTT CTGGCCTTGC AGCCAATCGT 2280 

TCTCTGACAG AAGGCCTGGT CCTGGACACA ATGAATGACA GCCTCAGCAA GATCCTGCTG 2340 

GACATCAGCT TTCCTGGCCT GGACGAGGAC CCACTGGGCC CTGACAACAT CAACTGGTCC 2400 

CAGTTTATTC CTGAGCTACA GTAGAGCCCT GCCCTTGCCC CTGTGCTCAA GCTGTCCACC 2460 

ATCCCGGGCA CTCCAAGGCT CAGTGCACCC CAAGCCTCTG AGTGAGGACA GCAGGCAGGG 2520 

ACTGTTCTGC TCCTCATAGC TCCCTGCTGC CTGATTATGC AAAAGTAGCA GTCACACCCT 2580 

AGCCACTGCT GGGACCTTGT GTTCCCCAAG AGTATCTGAT TCCTCTGCTG TCCCTGCCAG 2640 

GAGCTGAAGG GTGGGAACAA CAAAGGCAAT GGTGAAAAGA GATTAGGAAC CCCCCAGCCT 2700 

GTTTCCATTC TCTGCCCAGC AGTCTCTTAC CTTCCCTGAT CTTTGCAGGG TGGTCCGTGT 2760 

AAATAGTATA AATTCTCCAA ATTATCCTCT AATTATAAAT GTAAGCTTAT TTCCTTAGAT 2820 

CATTATCCAG AGACTGCCAG AAGGTGGGTA GGATGACCTG GGGTTTCAAT TGACTTCTGT 2880 

TCCTTGCTTT TAGTTTTGAT AGAAGGGAAG ACCTGCAGTG CACGGTTTCT TCCAGGCTGA 2940 

GGTACCTGGA TCTTGGGTTC TTCACTGCAG GGACCCAGAC AAGTGGATCT GCTTGCCAGA 3000 

GTCCTTTTTG CCCCTCCCTG CCACCTCCCC GTGTTTCCAA GTCAGCTTTC CTGCAAGAAG 3060 

AAATCCTGGT TAAAAAAGTC TTTTGTATTG GGTCAGGAGT TGAATTTGGG GTGGGAGGAT 3120 

GGATGCAACT GAAGCAGAGT GTGGGTGCCC AGATGTGCGC TATTAGATGT TTCTCTGATA 3180 

ATGTCCCCAA TCATACCAGG GAGACTGGCA TTGACGAGAA CTCAGGTGGA GGCTTGAGAA 3240 

GGCCGAAAGG GCCCCTGACC TGCCTGGCTT CCTTAGCTTG CCCCTCAGCT TTGCAAAGAG 3300 

CCACCCTAGG CCCCAGCTGA CCGCATGGGT GTGAGCCAGC TTGAGAACAC TAACTACTCA 3360 
ATAAAAGCGA AGGTGGAAAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 71 Protein sequence: 
Protein Accession 8: AAH06529.1 
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1 11 21 31 41 51 

I I I I I I 

MKTSPRRPLI LKRRRIiPLPV QNAPSETSEE EPKRSPAQQB SNQAEASKEV AESNSCKFPA 60 

GIKIINHPTM PNTQWAIPN NANIHSIITA LTAKGKESGS SGPNKFIljIS CGGAPTQPPG 120 

LRPQTQTSYD AKRTBVTLET LGPKPAARDV NLPRPPGALC EQKRETCADG EAAGCTINNS 180 

LSNIQWLRKM SSDGLGSRSI KQEMEEKENC HLBQRQVKVE EPSRPSASWQ NSVSERPPYS 240 

YMAMIQPAIN STERKRMTLK DIYTWIEDHF PYFKHIAKPG WKNSIRHNLS LHDMPVRETS 300 

ANGKVSPWTI HPSANRYLTL DQVFKQQKRP NPELRRNMTI KTELPLGARR KMKPLLPRVS 360 

SYLVPIQFPV NQSLVLQPSV KVPLPLAASL MSSELARHSK RVRIAPKVLL AEEGIAPLSS 420 

AGPGKEEKLL FGEGFSPLLP VQTIKEEEIQ PGEEMPHLAR PIKVESPPLE EWPSPAPSFK 480 

EESSHSWEDS SQSPTPRPKK SYSGLRSPTR CVSEMLVIQH RERRERSRSR RKQHLLPPCV 540 

DEPELLFSEG PSTSRWAAEL PFPADSSDPA SQLSYSQEVG GPFKTPIKET LPISSTPSKS 600 

VLPRTPESWR LTPPAKVGGI* DFSPVQTPQG ASDPLPDPLG LMDLSTTPLQ SAPPLESPQR 660 

LLSSEPLDLI SVPFGNSSPS DIDVPKPGSP EPQVSGIAAN RSLTEGLVLD TMNDSLSKIL 720 
LDISFPGLDE DPLGPDNINW SQFIPELQ 



Seq ID NO: 72 DNA sequence 
Nucleic Acid Accession 8: U74612.1 
Coding sequence: 178-2583 

1 11 21 31 41 51 

I I I I 1 I. 

GGCACGAGGG GGACCCGGCC GGTCCGGCGC GAGCCCCCGT CCGGGGCCCT GGCTCGGCCC 60 

CCAGGTTGGA GGAGCCCGGA GCCCGCCTTC GGAGCTACGG CCTAACGGCG GCGGCGACTG 120 

CAGTCTGGAG GGTCCACACT TGTGATTCTC AATGGAGAGT GAAAACGCAG ATTCATAATG 180 

AAAACTAGCC CCCGTCGGCC ACTGATTCTC AAAAGACGGA GGCTGCCCCT TCCTGTTCAA' 240 

AATGCCCCAA GTGAAACATC AGAGGAGGAA CCTAAGAGAT CCCCTGCCCA ACAGGAGTCT 300 

AATCAAGCAG AGGCCTCCAA GGAAGTGGCA GAGTCCAACT CTTGCAAGTT TCCAGCTGGG 360 

ATCAAGATTA TTAACCACCC CACCATGCCC AACACGCAAG TAGTGGCCAT CCCCAACAAT 420 

GCTAATATTC ACAGCATCAT CACAGCACTG ACTGCCAAGG GAAAAGAGAG TGGCAGTAGT 480 

GGGCCCAACA AATTCATCCT CATCAGCTGT GGGGGAGCCC CAACTCAGCC TCCAGGACTC 540 

CGGCCTCAAA CCCAAACCAG CTATGATGCC AAAAGGACAG AAGTGACCCT GGAGACCTTG 600 

GGACCAAAAC CTGCAGCTAG GGATGTGAAT CTTCCTAGAC CACCTGGAGC CCTTTGCGAG 660 

CAGAAACGGG AGACCTGTGC AGATGGTGAG GCAGCAGGCT GCACTATCAA CAATAGCCTA 720 

TCCAACATCC AGTGGCTTCG AAAGATGAGT TCTGATGGAC TGGGCTCCCG CAGCATCAAG 780 

CAAGAGATGG AGGAAAAGGA GAATTGTCAC CTGGAGCAGC GACAGGTTAA GGTTGAGGAG 840 

CCTTCGAGAC CATCAGCGTC CTGGCAGAAC TCTGTGTCTG AGCGGCCACC CTACTCTTAC 900 

ATGGCCATGA TACAATTCGC CATCAACAGC ACTGAGAGGA AGCGCATGAC TTTGAAAGAC 960 

ATCTATACGT GGATTGAGGA CCACTTTCCC TACTTTAAGC ACATTGCCAA GCCAGGCTGG 1020 

AAGAACTCCA TCCGCCACAA CCTTTCCCTG CACGACATGT TTGTCCGGGA GACGTCTGCC 1080 

AATGGCAAGG TCTCCTTCTG GACCATTCAC CCCAGTGCCA ACCGCTACTT GACATTGGAC 1140 

CAGGTGTTTA AGCCACTGGA CCCAGGGTCT CCACAATTGC CCGAGCACTT GGAATCACAG 1200 

CAGAAACGAC CGAATCCAGA GCTCCGCCGG AACATGACCA TCAAAACCGA ACTCCCCCTG 1260 

GGCGCACGGC GGAAGATGAA GCCACTGCTA CCACGGGTCA GCTCATACCT GGTACCTATC 1320 

CAGTTCCCGG TGAACCAGTC ACTGGTGTTG CAGCCCTCGG TGAAGGTGCC ATTGCCCCTG 1380 

GCGGCTTCCC TCATGAGCTC AGAGCTTGCC CGCCATAGCA AGCGAGTCCG CATTGCCCCC 1440 

AAGGTTTTTG GGGAACAGGT GGTGTTTGGT TACATGAGTA AGTTCTTTAG TGGCGATCTG 1500 

CGAGATTTTG GTACACCCAT CACCAGCTTG TTTAATTTTA TCTTTCTTTG TTTATCAGTG 1560 

CTGCTAGCTG AGGAGGGGAT AGCTCCTCTT TCTTCTGCAG GACCAGGGAA AGAGGAGAAA 1620 

CTCCTGTTTG GAGAAGGGTT TTCTCCTTTG CTTCCAGTTC AGACTATCAA GGAGGAAGAA 1680 

ATCCAGCCTG GGGAGGAAAT GCCACACTTA GCGAGACCCA TCAAAGTGGA GAGCCCTCCC 1740 

TTGGAAGAGT GGCCCTCCCC GGCCCCATCT TTCAAAGAGG AATCATCTCA CTCCTGGGAG 1800 

GATTCGTCCC AATCTCCCAC CCCAAGACCC AAGAAGTCCT ACAGTGGGCT TAGGTCCCCA 1860 

ACCCGGTGTG TCTCGGAAAT GCTTGTGATT CAACACAGGG AGAGGAGGGA GAGGAGCCGG 1920 

TCTCGGAGGA AACAGCATCT ACTGCCTCCC TGTGTGGATG AGCCGGAGCT GCTCTTCTCA 1980 

GAGGGGCCCA GTACTTCCCG CTGGGCCGCA GAGCTCCCGT TCCCAGCAGA CTCCTCTGAC 2040 

CCTGCCTCCC AGCTCAGCTA CTCCCAGGAA GTGGGAGGAC CTTTTAAGAC ACCCATTAAG 2100 

GAAACGCTGC CCATCTCCTC CACCCCGAGC AAATCTGTCC TCCCCAGAAC CCCTGAATCC 2160 

TGGAGGCTCA CGCCCCCAGC CAAAGTAGGG GGACTGGATT TCAGCCCAGT ACAAACCTCC 2220 

CAGGGTGCCT CTGACCCCTT GCCTGACCCC CTGGGGCTGA TGGATCTCAG CACCACTCCC 2280 

TTGCAAAGTG CTCCCCCCCT TGAATCACCG CAAAGGCTCC TCAGTTCAGA ACCCTTAGAC 2340 

CTCATCTCCG TCCCCTTTGG CAACTCTTCT CCCTCAGATA TAGACGTCCC CAAGCCAGGC 2400 

TCCCCGGAGC CACAGGTTTC TGGCCTTGCA GCCAATCGTT CTCTGACAGA AGGCCTGGTC 2460 

CTGGACACAA TGAATGACAG CCTCAGCAAG ATCCTGCTGG ACATCAGCTT TCCTGGCCTG 2520 

GACGAGGACC CACTGGGCCC TGACAACATC AACTGGTCCC AGTTTATTCC TGAGCTACAG 2580 

TAGAGCCCTG CCCTTGCCCC TGTGCTCAAG CTGTCCACCA TCCCGGGCAC TCCAAGGCTC 2640 

AGTGCACCCC AAGCCTCTGA GTGAGGACAG CAGGCAGGGA CTGTTCTGCT CCTCATAGCT 2700 

CCCTGCTGCC TGATTATGCA AAAGTAGCAG TCACACCCTA GCCACTGCTG GGACCTTGTG 2760 

TTCCCCAAGA GTATCTGATT CCTCTGCTGT CCCTGCCAGG AGCTGAAGGG TGGGAACAAC 2820 

AAAGGCAATG GTGAAAAGAG ATTAGGAACC CCCCAGCCTG TTTCCATTCT CTGCCCAGCA 2880 

GTCTCTTACC TTCCCTGATC TTTGCAGGGT GGTCCGTGTA AATAGTATAA ATTCTCCAAA 2940 

TTATCCTCTA ATTATAAATG TAAGCTTATT TCCTTAGATC ATTATCCAGA GACTGCCAGA 3000 

AGGTGGGTAG GATGACCTGG GGTTTCAATT GACTTCTGTT CCTTGCTTTT AGTTTTGATA 3060 

GAAGGGAAGA CCTGCAGTGC ACGGTTTCTT CCAGGCTGAG GTACCTGGAT CTTGGGTTCT 3120 

TCACTGCAGG GACCCAGACA AGTGGATCTG CTTGCCAGAG TCCTTTTTGC CCCTCCCTGC 3180 

CACCTCCCCG TGTTTCCAAG TCAGCTTTCC TGCAAGAAGA AATCCTGGTT AAAAAAGTCT 3240 

TTTGTATTGG GTCAGGAGTT GAATTTGGGG TGGGAGGATG GATGCAACTG AAGCAGAGTG 3300 

TGGGTGCCCA GATGTGCGCT ATTAGATGTT TCTCTGATAA TGTCCCCAAT CATACCAGGG . 3360 

AGACTGGCAT TGACGAGAAC TCAGGTGGAG GCTTGAGAAG GCCGAAAGGG CCCCTGACCT 3420 

GCCTGGCTTC CTTAGCTTGC CCCTCAGCTT TGCAAAGAGC CACCCTAGGC CCCAGCTGAC 3480 

CGCATGGGTG TGAGCCAGCT TGAGAACACT AACTACTCAA TAAAAGCGAA GGTGGACAAA 3540 
AAAAAAAAAA AAAAA 

Seq ID NO: 73 Protein sequence: 
Protein Accession #: AAC51128.1 
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PCT7US02/12476 



1 11 21 31 41 51 

I I 1 I I I 

MKTSPRRPLI LKRRRLPLPV QNAPSETSEB EPKRSPAQQE SNQAEASKEV AESNSCKFPA 60 

5 GIKIINHPTM PNTQWAIPN NANIHSIITA LTAKGKESGS SGPNKFILIS OGGAPTQPPG 120 

LRPQTQTSYD AKRTEVTLET LGPKPAARDV NLPRPPGALC EQKRETCADG EAAGCTINNS 180 

LSNIQWLRKM SSDGLGSRSI KQEMEEKENC HLEQRQVKVE EPSRPSASWQ NSVSERPPYS 240 

YMAMIQFAIN STERKRMTLK DIYTWIEDHF PYFKHIAKPG WKNSIRHNLS LHDMFVRETS 300 

ANGKVSFWTI HPSANRYLTL DQVFKPLDPG SPQLPEHLES QQKRPNPELR RNMTIKTELP 360 

10 LGARRKMKPIi LPRVSSYLVP IQFPVNQSLV LQPSVKVPLP LAASLMSSEL ARHSKRVRIA 420 

PKVFGEQWF GYMSKPFSGD LRDFGTPITS LFNFIFLCLS VLLAEEGIAP LSSAGPGKEE 480 

KLLFGEGFSP LLPVQTIKEE EIQPGEEMPH LARPIKVESP PLEEWPSPAP SFKEESSHSW 540 

EDSSQSPTPR PK2CSYSGLRS PTRCVSEMLV IQHRERRERS RSRRKQHLLP PCVDEPELLF 600 

SEGPSTSRWA AELPFPADSS DPASQLSYSQ EVGGPFKTPI KETLPISSTP SKSVIiPRTPE 660 

15 SWRLTPPAKV GGLDFSPVQT SQGASDPLPD PLGLMDLSTT PLQSAPPLES PQRLLSSEPL 720 

DLISVPFGNS SPSDIDVPKP GSPEPQVSGL AANRSLTEGL VLDTMNDSLS KILLDISFPG 780 
LDEDPLGPDN INWSQFIPEL Q 

Seq ID NO: 74 DNA sequence 
2\J Nucleic Acid Accession #: Eos sequence 
Coding sequence: 111-416 

1 11 21 31 41 51 

0 r I I I I I I 

Zj GGGAAGAGCC AGGCTGAGCC TTATAAAGGA CTGCTCTTTG TCCAAACACA CACATCTCAC 60 
TCATCCTTCT ACTCGTGACG CTTCCCAGCT CTGGCTTTTT GAAAGCAAAG ATGAGCAACA 120 
CTCAAGCTGA GAGGTCCATA ATAGGCATGA TCGACATGTT TCACAAATAC ACCAGACGTG 180 
ATGACAAGAT TGAGAAGCCA AGCCTGCTGA CGATGATGAA GGAGAACTTC CCCAACTTCC 240 
TTAGTGCCTG TGACAAAAAG GGCACAAATT ACCTCGCCGA TGTCTTTGAG AAAAAGGACA 300 

30 AGAATGAGGA TAAGAAGATT GATTTTTCTG AGTTTCTGTC CTTGCTGGGA GACATAGCCA 360 
CAGACTACCA CAAGCAGAGC CATGGAGCAG CGCCCTGTTC CGGGGGCAGC CAGTGACCCA 420 
GCCCCACCAA TGGGCCTCCA GAGACCCCAG GAACAATAAA ATGTCTTCTC CCACCAGA 

Seq ID NO: 75 Protein sequence: 
35 Protein Accession #: Eos sequence 

1 11 21 31 41 51 

' 1 I I I I I 

MSNTQAERSI IGMIDMFHKY TRRDDKIEKP SLLTMMKENF PNFLSACDKK GTNYLADVFE 60 
40 KKDKNEDKKI DFSEFLSLLG DIATDYHKQS HGAAPCSGGS Q 

Seq ID NO: 76 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
45 Coding sequence: 111-416 

1 11 21 31 41 51 

I I I 1 I I 

GGGAAGAGCC AGGCTGAGCC TTATAAAGGA CTGCTCTTTG TCCAAACACA CACATCTCAC 60 

50 TCATCCTTCT ACTCGTGACA CTTCCCAGTT CTGGCTTTTT GAAAGCAAAG ATGAGCAACA 120 

CTCAAGCTGA GAGGTCCATA ATAGGCATGA TCGACATGTT TCACAAATAC ACCGGACGTG 180 

ATGGCAAGAT TGAGAAGCCA AGCCTGCTGA CGATGATGAA GGAGAACTTC CCCAATTTCC 240 

TCAGTGCCTG TGACAAAAAG GGCATACATT ACCTCGCCAC TGTCTTTGAG AAAAAGGACA 300 

AGAATGAGGA TAAGAAGATT GATTTTTCTG AGTTTCTGTC CTTGCTGGGA GACATAGCCG 360 

55 CAGACTACCA CAAGCAGAGC CATGGAGCGG CGCCCTGTTC TGGGGGAAGC CAGTGATCCA 420 
GCCCCACCAA GGGGCCTCCA GAGACCCCAG GAACAATAAG TGTCTCCTCC CACCAGA 



60 



65 



70 



Seq ID NO: 77 Protein sequence: 
Protein Accession &: XPJ)48124,1 

1 11 21 31 41 51 

I I I I I I 

MSNTQAERSI IGMIDMFHKY TGRDGKIEKP SLLTMMKENF PNFLSACDKK GIHYLATVFE 60 
KKDKNEDKKI DFSEFLSLLG DIAADYHKQS HGAAPCSGGS Q 



Seq ID NO: 78 DNA sequence 
Nucleic Acid Accession #: Z73678.1 
Coding sequence: 253-2433 



1 11 21 31 41 51 

I I I I I I 

GGGGTGGTGC AGGGCAGGGG TGGTATATCC TGTCTGACGG AGGGCGGGCC TCGCCAGTGC 60 

CAGAGAGGGA CGAACCAGGG TGGAAGCGCC AGGAGCAGCT GCAGGGAGCC CTCACGCGGA 120 

75 CCTCGCACTC TATGGCCGTA GGGAGCCGCT GAGAGCGAGA AGAGCACGCT CCTGCCCGCC 180 

CGCTGCACCG CACCTCGCCT CGCCTCTCTG CTCTCCTAGG CCCCGGCCGC GCGCCACCCG 240 

CCTCCCGCCA CCATGAACCA CTCGCCGCTC AAGACCGCCT TGGCGTACGA ATGCTTCCAG 300 

GACCAGGACA ACTCCACGTT GGCTTTGCCG TCGGACCAAA AGATGAAAAC AGGCACGTCT 360 

GGCAGGCAGC GCGTGCAGGA GCAGGTGATG ATGACCGTCA AGCGGCAGAA GTCCAAGTCT 420 

80 TCCCAGTCGT CCACCCTGAG CCACTCCAAT CGAGGTTCCA TGTATGATGG CTTGGCTGAC 480 

AATTACAACT ATGGGACCAC CAGCAGGAGC AGCTACTACT CCAAGTTCCA GGCAGGGAAT 540 

GGCTCATGGG GATATCCGAT CTACAATGGA ACCCTCAAGC GGGAGCCTGA CAACAGGCGC 600 

TTCAGCTCCT ACAGCCAGAT GGAGAACTGG AGCCGGCACT ACCCCCGGGG CAGCTGTAAC 660 

ACCACCGGCG CAGGCAGCGA CATCTGCTTC ATGCAGAAAA TCAAGGCGAG CCGCAGTGAG 720 

85 CCCGACCTCT ACTGTGACCC ACGGGGCACC CTGCGCAAGG GCACGCTGGG CAGCAAGGGC 780 

CAGAAGACCA CCCAGAACCG CTACAGCTTT TACAGCACCT GCAGTGGTCA GAAGGCCATA 840 

AAGAAGTGCC CTGTGCGCCC GCCCTCTTGT GCCTCCAAGC AGGACCCTGT GTATATCCCG 900 
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CCCATCTCCT GCAACAAGGA CCTGTCCTTT GGCCACTCTA GGGCCAGCTC CAAGATCTGC 960 

AGTGAGGACA TOGAGTGCAG TGGGCTGACC ATCCCCAAGG CTGTGCAGTA CCTGAGCTCC 1020 

CAGGATGAGA AGTACCAGGC CATTGGGGCC TATTACATCC AGCATACCTG CTTCCAGGAT 1080 

GAATCTGCCA AGCAACAGGT CTATCAGCTG GGAGGCATCT GCAAGCTGGT GGACCTCCTC 1140 

CGCAGCCCCA ACCAGAACGT CCAGCAGGCC GCGGCAGGGG CCCTGCGCAA CCTGGTGTTC 1200 

AGGAGCACCA CCAACAAGCT GGAGACCCGG AGGCAGAATG GGATCCGCGA GGCAGTCAGC 1260 

CTCCTGAGGA GAACCGGGAA CGCCGAGATC CAGAAGCAGC TGACTGGGCT GCTCTGGAAC 1320 

CTGTCTTCCA CTGACGAGCT GAAGGAGGAA CTCATTGCCG ACGCCCTGCC TGTTCTGGCC 1380 

GACCGCGTCA TCATTCCCTT CTCTGGCTGG TGCGATGGCA ATAGCAACAT GTCCCGGGAA 1440 

GTGGTGGACC CTGAGGTCTT CTTCAATGCC ACAGGCTGCT TGAGGAACCT GAGCTCGGCC 1500 

GATGCAGGCC GCCAGACCAT GCGTAACTAC TCAGGGCTCA TTGATTCCCT CATGGCCTAT 1560 

GTCCAGAACT GTGTAGCGGC CAGCCGCTGT GACGACAAGT CTGTGGAAAA CTGCATGTGT 1620 

GTTCTGCACA ACCTCTCCTA CCGCCTGGAC GCCGAGGTGC CCACCCGCTA CCGCCAGCTG 1680 

GAGTATAACG CCCGCAACGC CTACACCGAG AAGTCCTCCA CTGGCTGCTT CAGCAACAAG 1740 

AGCGACAAGA TGATGAACAA CAACTATGAC TGCCCCCTGC CTGAGGAAGA GACCAACCCC 1800 

AAGGGCAGCG GCTGGTTGTA CCATTCAGAT GCCATCCGCA CCTACCTGAA CCTCATGGGC 1860 

AAGAGCAAGA AAGATGCTAC CCTGGAGGCC TGTGCTGGTG CCCTGCAGAA CCTGACAGCC 1920 

AGCAAGGGGC TGATGTCCAG TGGCATGAGC CAGTTGATTG GGCTGAAGGA AAAGGGCCTG 1980 

CCACAAATTG CCCGCCTCCT GCAATCTGGC AACTCTGATG TGGTGCGGTC CGGAGCCTCC 2040 

CTCCTGAGCA ACATGTCCCG CCACCCTCTG CTGCACAGAG TGATGGGGAA CCAGGTGTTC 2100 

CCGGAGGTGA CCAGGCTCCT CACCAGCCAC ACTGGCAATA CCAGCAACTC CGAAGACATC 2160 

TTGTCCTCGG CCTGCTACAC TGTGAGGAAC CTGATGGCCT CGCAGCCACA ACTGGCCAAG 2220 

CAGTACTTCT CCAGCAGCAT GCTCAACAAC ATCATCAACC TGTGCCGAAG CAGTGCCTCA 2280 

CCCAAGGCCG CAGAAGCTGC CCGGCTTCTC CTGTCTGACA TGTGGTCCAG CAAGGAACTG 2340 

CAGGGTGTCC TCAGACAGCA AGGTTTCGAT AGGAACATGC TGGGAACCTT AGCTGGGGCC 2400 

AACAGCCTCA GGAACTTCAC CTCCCGATTC TAAGAAGAGA CTGTCCAAGC AAGTTAGGCT 2460 

TG CAGGAAGA TATGACCCAG CTGAGAAGCC CTCAGGCCTC GCTGGATGGG GTTTTCTGTC 2520 

CATCCTGTGC AGTATTTGGG AAAGTTCACA AGAAACTGAG AAGAAACCTA AAAACTGTGG 2580 

ATAGTGGAAA GATTTTTAGA TTTTTTTTTT CCTTGGGGAA ACTGGCAGGC AATGGGGGTT 2640 

AGGGAGGTTG GGGCGGGGGG GGCTTTCTTG AGTTAAAGGG GCTTATATGT GATGTCAATA 2700 

TTTCTTCCTC TGAGAAATGG TATATATATG TGTCTAATGT AAGTGTGTGC ATGCATGTGC 2760 

GCGTGCATGT GTGTGTGTGT GAGTGTCTTA AAGCATAACC ACAAACTGCA AAAAGCTAGG 2820 

TAAGCTATTT TGTTGCAGCT CATAAGGTGG TGAAAAGGAC TCTCCTGTGT TTCTTACTCA 2880 

TAGGCAAGGA CAACATGTGC TTTTTGGTGA GCTGCTCATA ATTCCTGAAA TGTGTGGTGC 2940 

CAGGGCAAGG GGGCCATCAC TGCAGTCAGG CCCTCAGAGG AGTCCTGCAG GCTTCCTACC 3000 

AGTGGTCTCC AAGGGTGCAG GAGTAACTGG GGCTGGGCCA GCCTCCCCCC TTACAAGGCT 3060 

GCTTTCCACG AAGGGAGGTC TGGTGTATCT CATGGGAGAA TCTGGGGTGT CTGTAGTGTC 3120 

ACCCCTCCAG CAGCGCCACA AGGACTGAGG TTGGGTAGGT GTGAGGTTCC AGAGGACAGC 3180 

AGGACACTCT CGCATACTTT GCCAAATGAG GCCTGCTCAG AGGAGTAGGA GCTGAAAGAT 3240 

GGTGCCTTCC ACCCTCTTGG GCTGTGTGCC CATCAGAGCA GGCTCAGCCT GCAAAGGCCC 3300 

TGCATTCAGA GGTCTTGTAA TCTACTTGTT GCAGGAGAAA GAAGGTAAAA AATGATTTTT 3360 

TTAAGAAAAG CTATTTTATT GCAGCTCTTT CCCAAGAGCT GTTCTGGGAA TGGCTGGTCT 3420 

TCATATTCCC AGTGGAGAGG GGAACAAGTG GGGCTGGGCA TATACCTATT CCGGCTTCTA 3480 

GTGGGATGGA GTTGGGGTAT AGAAATTAAC CAGGAAGATG TTTCCACCAA GCCTGCTGTG 3540 

AGTCAATTGA GGGAGTGTTT GGGTCCCAGG AGACTTGGAC GGGGGGAGTT TGGGTAGACT 3600 

AGGAAAGGAA AGTGCCATAT CAGGGTACCG GTACCGGCAA GCTCACATCT CAGCCAGGGG 3660 

CCATGCCCCA CTTCCCCTGA CCCCAGCTGT CTTGTCTCCA CTCTGTGAAA CCCACAGGGG 3720 

ATGTGATAAA CAGGGCTATT AGGGGTATCA GCCACGTCGA GCCCCCAGAC TCTGTGCACT 3780 

TCAGACCAGC AGCAGCAGGA GGGCTCCCGA GGGCCTTATG AGAAAACCTG TGTGGACATC 3840 

CCTTGGTGTA CACTAAGACA GAGCAGAGCC CAGCGCTCCC AAGCCTTCCT CCTTCCAGCT 3 900 

TCTACCtCCA TGCTAGCATT GCTGGTGTTA GAGAGGAATT AACTTCCTGG TCTGTGCCCT 3960 

TCTCTAGAAG AATATAAGAT GCTCCTCCTC CTCACCCCTT CTCAGCCTCC TCCCAAGTCT 4020 

TCCTCTTCTG CACCACCCCC GAGTCCAAAC CCACCTCTTG CCCCAGCATT CAGGCTGGAA 4080 

AACACTGATG TGGACTCAGT ATGACAACTG AGATGGGGGA AGCCAGACAT GTGAGGACGC 4140 

TGTCCTCCGA GAGGTGTCCC CGGCTGTTAG CCAGCTGTGC TGTGGTGCTG TGGGTCTGTC 4200 

ATACCCTCCC TTGCTTCTGT TCACACTGGG AGGCCCACTC CTGGCTCACC TCTCCCTCTC 4260 

AGGGACCCAC GTGGGAGCCT GGATCCCTGG ACTGTCCTGG GCATAGGTTT CAGGGGCCTC 4320 

CTTTGTTGTC ATCAGAACCC AGAGGAATTC TTCTCCTAAA AAATACGTAT GGCATACCAA 4380 

TCTGTGCGGG GCAGTGTCCT AAGCACTTAG ACTA CAT CAG GGAAGAACAC AGACCACATC 4440 

CCCGTCCTCA TGCGGCTTAT GTTTTCTGGA GGAAAGTGGA GACACAAGTC CTTGGCTTTA 4500 

GGGCTCCCCC GGCTGGGGGC TGTGCAGTCC GGTCAGGGCG GGAGGGGAAA TGCACCGCTG 4560 

CATGTGAACC TTACCAGCCC AGGCGGATGC CCCTTCCCCT TAGCACTACC CTGGCCTCCT 4620 

GCATCCCCTC GCCTCATGTT CCTCCCACCT TCAAAGAATG AAGAGCCCCA TGGGCCCAGC 4680 

CCCTGCCCTG GGAACCAGGC AGCCTTCCAG ACCTCAGGGG CTGAGGCAGA CTATTAGGGC 4740 

AGGGCTGACT TTGGTGACAC TGCCCATTCC CTCTCAGGCC AGCTCAGGTC ACCCGGGCCT 4800 

CTGACCCAGG CCTGTCACTT TGAGAGGGGC AAAACTGAGA GGGGCTTTTC CTAGAGAAAG 4860 

AGAACAAGGA GCTTGCCAGG CTTCATGTAG CCGACACACG TCTCAGGATT TTAAGTCCAC 4920 

ATTGGCCTCA CACTAGCCTA GGCCAATGCC CAAAATAAGG AGTTCCAATT TGGGGCCAAA 4980 

TGAGGAAGGA CACAGACTCT GCCCTGGGAT CTCCTGTGCT AGCGGCCAAT GACAAATCCA 5040 

GTCATTGGCC ACCAGCCACC TCTGCAGTGG GGACCACACT AGCAGCCCTG ACTCCACACT 5100 

CCTCCTGGGG ACCCAAGAGG CAGTGTTGCT GTCTGCGTGT CCACCTTGGA ATCTGGCTGA 5160 

ACTGGCTGGG AGGACCAAGA CTGCGGCTGG GGTGGGCAGG GAAGGGAAGC CGGGGGCTGC 5220 

TGTGAGGGAT CTTGGAGCTT CCCTGTAGCC CACCTTCCCC TTGCTTCATG TTTGTAGAGG 5280 

AACCTTGTGC CGGCCAGGCC CAGTTTCCTT GTGTGATACA CTAATGTATT TGCTTTTTTT 5340 
GGAAATAGAG AAAATCAATA AATTGCTAGT GTTTCTTTGA AAAAAAAAA 

Seq ID NO: 79 Protein sequence: 
Protein Accession it : CAA98022.1 

1 11 21 31 41 51 

1 I I I I I 

MNHSPLKTAL AYECFQDQDN STLALPSDQK MKTGTSGRQR VQEQVMMTVK RQKSKSSQSS 60 

TLSHSNRGSM YDGLADNYNY GTTSRSSYYS KFQAGNGSWG YPIYNGTLKR EPDNRRFSSY 120 

SQMBNWSRHY PRGSCNTTGA GSDICFMQKI KASRSEPDLY CDPRGTLRKG TLGSKGQKTT 180 

QNRYSFYSTC SGQKAIKKCP VRPPSCASKQ DPVYIPPISC NKDLSFGHSR ASSKICSEDI 240 

ECSGLTIPKA VQYLSSQDEK YQAIGAYYIQ HTCFQDESAK QQVYQLGGIC KLVDLLRSPN 300 

QNVQQAAAGA LRNLVFRSTT NKLETRRQNG IREAVSLLRR TGNAEIQKQL TGLLWNLSST 360 
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DELKEELIAD ALPVLADRVI IPFSGWCDGN SNMSREWDP EVFFNATGCL RNLSSADAGR 420 

QTMRNYSGLI DSLMAYVQNC VAASRCDDKS VENCMCVLHN LSYRLDAEVP TRYRQLEYNA 480 

RNAYTEKSST GCFSNKSDKM MNNNYDCPLP EEETNPKGSG WLYHSDAIRT YLNLMGKSKK 540 

DATLEACAGA LQNLTASKGL MSSGMSQLIG LKEKGLPQIA RLLQSGNSDV VRSGASLLSN 600 

MSRHPLLHRV MGNQVFPEVT RLLTSHTGNT SNSEDILSSA CYTVRNLMAS QPQLAKQYFS 660 

SSMLNNIINL CRSSASPKAA EAARLLLSDM WSSKELQGVL RQQGFDRNML GTLAGANSLR 720 
NFTSRF 



Seq ID NO: 80 DNA sequence 

Nucleic Acid Accession 8: NM_006516.1 

Coding sequence: 180-1658 

1 ll 21 31 41 51 

I I I I I I 

TAGTCGCGGG TCCCCGAGTG AGCACGCCAG GGAGCAGGAG ACCAAACGAC GGGGGTCGGA 60 

GTCAGAGTCG CAGTGGGAGT CCCCX3GACCG GAGCACGAGC CTGAGCGGGA GAGCGCCGCT 120 

CGCACGCCCG TCGCCACCCG CGTACCCGGC GCAGCCAGAG CCACCAGCGC AGCGCTGCCA 180 

TGGAGCCCAG CAGCAAGAAG CTGACGGGTC GCCTCATGCT GGCTGTGGGA GGAGCAGTGC 240 

TTGGCTCCCT GCAGTTTGGC TACAACACTG GAGTCATCAA TGCCCCCCAG AAGGTGATCG 300 

AGGAGTTCTA CAACCAGACA TGGGTCCACC GCTATGGGGA GAGCATCCTG CCCACCACGC 360 

TCACCACGCT CTGGTCCCTC TCAGTGGCCA TCTTTTCTGT TGGGGGCATG ATTGGCTCCT 420 

TCTCTGTGGG CCTTTTCGTT AACCGCTTTG GCCGGCGGAA TTCAATGCTG ATGATGAACC 480 

TGCTGGCCTT CGTGTCCGCC GTGCTCATGG GCTTCTCGAA ACTGGGCAAG TCCTTTGAGA 540 

TGCTGATCCT GGGCCGCTTC ATCATCGGTG TGTACTGCGG CCTGACCACA GGCTTCGTGC 600 

CCATGTATGT GGGTGAAGTG TCACCCACAG CCTTTCGTGG GGCCCTGGGC ACCCTGCACC 660 

AGCTGGGCAT CGTCGTCGGC ATCCTCATCG CCCAGGTGTT CGGCCTGGAC TCCATCATGG 720 

GCAACAAGGA CCTGTGGCCC CTGCTGCTGA GCATCATCTT CATCCCGGCC CTGCTGCAGT 780 

GCATCGTGCT GCCCTTCTGC CCCGAGAGTC CCCGCTTCCT GCTCATCAAC CGCAACGAGG 840 

AGAACCGGGC CAAGAGTGTG CTAAAGAAGC TGCGCGGGAC AGCTGACGTG ACCCATGACC 900 

TGCAGGAGAT GAAGGAAGAG AGTCGGCAGA TGATGCGGGA GAAGAAGGTC ACCATCCTGG 960 

AGCTGTTCCG CTCCCCCGCC TACCGCCAGC CCATCCTCAT CGCTGTGGTG CTGCAGCTGT 1020 

CCCAGCAGCT GTCTGGCATC AACGCTGTCT TCTATTACTC CACGAGCATC TTCGAGAAGG 1080 

CGGGGGTGCA GCAGCCTGTG TATGCCACCA TTGGCTCCGG TATCGTCAAC ACGGCCTTCA 1140 

CTGTCGTGTC GCTGTTTGTG GTGGAGCGAG CAGGCCGGCG GACCCTGCAC CTCATAGGCC 1200 

TCGCTGGCAT GGCGGGTTGT GCCATACTCA TGACCATCGC GCTAGCACTG CTGGAGCAGC 1260 

TACCCTGGAT GTCCTATCTG AGCATCGTGG CCATCTTTGG CTTTGTGGCC TTCTTTGAAG 1320 

TGGGTCCTGG CCCCATCCCA TGGTTCATCG TGGCTGAACT CTTCAGCCAG GGTCCACGTC 1380 

CAGCTGCCAT TGCCGTTGCA GGCTTCTCCA ACTGGACCTC AAATTTCATT GTGGGCATGT 1440 

GCTTCCAGTA TGTGGAGCAA CTGTGTGGTC CCTACGTCTT CATCATCTTC ACTGTGCTCC 1500 

TGGTTCTGTT CTTCATCTTC ACCTACTTCA AAGTTCCTGA GACTAAAGGC CGGACCTTCG 1560" 

ATGAGATCGC TTCCGGCTTC CGGCAGGGGG GAGCCAGCCA AAGTGATAAG ACACCCGAGG 1620 

AGCTGTTCCA TCCCCTGGGG GCTGATTCCC AAGTGTGAGT CGCCCCAGAT CACCAGCCCG 1680 

GCCTGCTCCC AGCAGCCCTA AGGATCTCTC AGGAGCACAG GCAGCTGGAT GAGACTTCCA 1740 

AACCTGACAG ATGTCAGCCG AGCCGGGCCT GGGGCTCCTT TCTCCAGCCA GCAATGATGT 1800 

CCAGAAGAAT ATTCAGGACT TAACGGCTCC AGGATTTTAA CAAAAGCAAG ACTGTTGCTC 1860 

AAATCTATTC AGACAAGCAA CAGGTTTTAT AATTTTTTTA TTACTGATTT TGTTATTTTT 1920 

ATATCAGCCT GAGTCTCCTG TGCCCACATC CCAGGCTTCA CCCTGAATGG TTCCATGCCT 1980 

GAGGGTGGAG ACTAAGCCCT GTCGAGACAC TTGCCTTCTT CACCCAGCTA ATCTGTAGGG 2040 

CTGGACCTAT GTCCTAAGGA CACACTAATC GAACTATGAA CTACAAAGCT TCTATCCCAG 2100 

GAGGTGGCTA TGGCCACCCG TTCTGCTGGC CTGGATCTCC CCACTCTAGG GGTCAGGCTC 2160 

CATTAGGATT TGCCCCTTCC CATCTCTTCC TACCCAACCA CTCAAATTAA TCTTTCTTTA 2220 

CCTGAGACCA GTTGGGAGCA CTGGAGTGCA GGGAGGAGAG GGGAAGGGCC AGTCTGGGCT 2280 

GCCGGGTTCT AGTCTCCTTT GCACTGAGGG CCACACTATT ACCATGAGAA GAGGGCCTGT 2340 

GGGAGCCTGC AAACTCACTG CTCAAGAAGA CATGGAGACT CCTGCCCTGT TGTGTATAGA 2400 

TGCAAGATAT TTATATATAT TTTTGGTTGT CAATATTAAA TACAGACACT AAGTTATAGT 2460 

ATATCTGGAC AAGCCAACTT GTAAATACAC CACCTCACTC CTGTTACTTA CCTAAACAGA 2520 

TATAAATGGC TGGTTTTTAG AAACATGGTT TTGAAATGCT TGTGGATTGA GGGTAGGAGG 2580 

TTTGGATGGG AGTGAGACAG AAGTAAGTGG GGTTGCAACC ACTGCAACGG CTTAGACTTC 2640 

GACTCAGGAT CCAGTCCCTT ACACGTACCT CTCATCAGTG TCCTCTTGCT CAAAAATCTG 2700 

TTTGATCCCT GTTACCCAGA GAATATATAC ATTCTTTATC TTGACATTCA AGGCATTTCT 2760 

ATCACATATT TGATAGTTGG TGTTCAAAAA AACACTAGTT TTGTGCCAGC CGTGATGCTC 2820 
AGGCTTGAAA TCGCATTATT TTGAATGTGA AGGGAA 

Seq ID NO: 81 Protein sequence: 
Protein Accession #: NP_006507.1 

1 11 21 31 41 51 

I I I I I I 

MEPSSKKLTG RLMLAVGGAV LGSLQFGYNT GVINAPQKVI EEFYNQTWVH RYGESILPTT 60 

LTTLWSLSVA IFSVGGMIGS FSVGLFVNRF GRRNSMLMMN LLAFVSAVLM GFSKLGKSFE 120 

MLILGRFIIG VYCGLTTGFV PMYVGEVSPT AFRGALGTLH QLGIWGIH AQVFGLDSIM 180 

GNKDLWPLLL SIIFIPALLQ CIVLPFCPES PRFLLINRNE ENRAKSVLKK LRGTADVTHD 240 

LQEMKEESRQ MMREKKVTIL ELFRSPAYRQ PILIAWLQI* SQQLSGINAV FYYSTSIFEK 300 

AGVQQPVYAT IGSGIVNTAF TWSLFWER AGRRTLHLIG LAGMAGCAIL MTIALALLEQ 360 

LPWMSYLSIV AIFGFVAFFE VGPGPIPWFI VAELFSQGPR PAAIAVAGFS NWTSNFIVGM 420 

CFQYVEQLCX5 PYVFIIFTVL LVLFFIFTYF KVPETKGRTF DEIASGFRQG GASQSDKTPE 480 
ELFHPLGADS QV 

Seq ID NO: 82 DNA sequence 
Nucleic Acid Accession 8: BC001291 
Coding sequence: 44-541 

1 11 21 31 41 51 

I I i I I I 

GGGGGCGCCG CGCGCTGACC CTCCCTGGGC ACCGCTGGGG ACGATGGCGC TGCTCGCCTT 60 

GCTGCTGGTC GTGGCCCTAC CGCGGGTGTG GACAGACGCC AACCTGACTG CGAGACAACG 120 
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AGATCCAGAG GACTCCCAGC GAACGGACGA 
TGAGAGAGAA AACACTTTCG AGTGCCAGAA 
CTGCGTTATA GCGGCCGTGA AAATATTTCC 
CGCTGGTTGT GCAGCGATGG AGAGACCCAA 
GCCCATGCCC TTCTTTTACC TCAAGTGTTG 
ACCTATCAAC TCATCAGTGT TCAAAGAATA 
GCTGTGGCTG GCCATCCTCC TGCTGCTGGC 
AGCCACGGGA CTGCCACAGA CTGAGCCTTC 
ACCTGTTGCA TTAAACTTGT TTTCTGTTGA 
GGGATGGGAG AGTGGGGATC AGGTGCAGTT 
ACATTCAGAG GAAGTCCAGA TCTCCTGAGT 
AAATCAAACC TTGTAACTCA TTTATTGCTG 
CCTCTGAGGG CTTCAGTATT GATGGGGAGG 
TGCTGAGATG CTTCCGACCT TTCAGGTGAC 
GGGTGAAGAC ATCCCTGGAG TGAAGGACTC 
AGGGCTGCCC CCATTCCAGT GGTGGAGGCG 
CTACCAGATT CCAGGAGGCA GAAGATAACT 
ACCAGCTGGC ACAGGTGCAC AGATTCATAA 
ACTTAGGCCA AGTAGAGAGC ATCAGGGTAA 
CATCCATGGG GAGCTGAGAA ATCAGACTCA 
TTCAAAAGTT CACGAAAAAA AAAAAAAAAA 

Seq ID NO: 83 Protein sequence: 
Protein Accession 8: AAH01291 



1 11 21 31 41 51 

I.I I I I I 

MALLALLLW ALPRVWTDAN LTARQRDPED SQRTDEGDNU VWCHVCEREN TFECQNPRRC 
KWTEPYCVIA AVKIFPRPFM VAKQCSAGCA AMERPKPEEK RFLLEEPMPF FYLKCCKIRY 
CNLEGPPINS SVFKEYAGSM GESCGGLWLA I LLLLAS I AA GLSLS 

Seq ID NO: 84 DNA sequence 

Nucleic Acid Accession #: NMJ)22B93.1 

Coding sequence: 229-2726 



PCT/US02/12476 



GGGTGACAAT 
CCCAAGGAGG 
ACGTTTTTTC 
GCCAGAGGAG 
TAAAATTCGC 
TGCTGGGAGC 
CTCCATTGCA 
CGGAGCATGG 
TTACCTCTTG 
GGCTCTTAAC 
AGTGATTTTG 
ATGGCCACTC 
GAGGCCTAAG 
GCAGGAACAC 
CTCAGCATGG 
CTGTGGATGG 
AATTGTGTTG 
ATTCCCACAC 
ATGGCGTTCA 
AAGTTCCACC 
AAAAAAAAAA 



AGAGTGTGGT 
TGCAAATGGA 
ATGGTTGCGA 
AAGCGGTTTC 
TACTGCAATT 
ATGGG TGAGA 
GCCGGCCTCA 
ACTCGCTCCA 
GTTTGACTTC 
CCTCAAGGGT 
GTGACAAGTT 
TTTTCCTTGA 
TACCACTCAT 
TGGGGGAGTC 
GGGGCAGTGG 
CTGCTTTTCC 
AAGAAACTTA 
GTGTGTGTTC 
TTTCTCTGTT 
AAAAACAAAT 
AAAAAAAAAA 



GTCATGTTTG 
CAGAGCCATA 
AGCAGTGCTC 
TCCTGGAAGA 
TAGAGGGGCC 
GCTGTGGTGG 
GCCTGTCTTG 
GACCGTTGTC 
CCAGGGTCTT 
TCTTTAACTC 
TTTCTCTTTG 
CTCCCCTCTG 
GGAGAGTATG 
TGAATGATTG 
GGCACACGTT 
TCAACCTTTC 
GACTTCACCC 
AACATCTGAA 
AAGATGCAGC 
ACAAGGGGAC 
AAA 



TTTTTTTTTT 
TGCGCCATCT 
TTTTCTCTGG 
CGCCCGCCGC 
AAGCAAGGCA 
ATTCTTACAG 
CTCCTCACCT 
GAGCACAAAC 
CCTTCCCCTT 
GTCACGCCAG 
GAACACATAG 
GGAGCTCTAA 
GATGAGCCCA 
CTCTTGCAAC 
AGTCCCCTGA 
CCACCTCTCC 
GGATCAGTAT 
CTGTTTAGTC 
GAAGAAATGG 
CCAATGGCTA 
AACACGTCTA 
CCATTCCAGC 
TCCGCCCCTC 
ACGTTCAAAT 
TACAAGTGCA 
AAGACGCACA 
GCCAGCTCCC 
TCCGTGGTGG 
GAGGAGGAAG 
CTGACGGAGA 
CACGAGAACA 
GACGTCATGC 
GTCCTGGGCG 
TGCGACGAAG 
CGCGGCTGCT 
AGCCCCAGCT 
CCCCCGGCCA 
GCCTCCAGGC 
GCCTCCTCGT 
GAGCTGGACG 
ATTAGTGGTC 
GAGTACTGTG 
ACGGGCGAAA 
CTCACCAGGC 
TGTAAGATGC 
GATCGAGTGT 
CTCCCACCTG 
CCTGTAGGAT 
ACGAAGCTAA 
TTCTTTTTTC 



11 

1 

TTTTTTGCTT 
TTGTATTATT 
AGTCTCCTTC 
CGCCGCCGCC 
AACCCCAGCA 
ATGATGAACC 
GTGGGCAGTG 
GGAAACAATG 
CACCAATCGA 
AAGATGACGA 
CAGATAAACT 
TCCCCACGCC 
GCAGCTACAC 
ACGCACAGAA 
CCCCGCGGGT 
ATGGGATTCA 
CGAGAGAGGC 
CACCACCGAG 
CCCTGGCCAC 
TGGAGCCTCC 
GCCCACCGCT 
CAGGTAGCAA 
CTCCCTCCCA 
TTCAGAGCAA 
ACCTGTGCGA 
TGCACAAATC 
CGGAACCCGG 
CCAAGTTCAA 
AGGAGGACGA 
GCGAGAGGGT 
GCTCGCGGGG 
AGGGCATGGT 
AGAAGCATAA 
ACTCGGTGGC 
CCCCGGGCGA 
CGCTGAGCCC 
CGATGCCCAA 
AGCTCAAAGA 
CGGAGCACTC 
GAGGGATCTC 
CGGGCACGGG 
GGAAAGTCTT 
GGCCTTATAA 
ACATGAAAAC 
CTTTTAGCGT 
TGAATAATGA 
ACACCCCCTT 
TTTTTTCTAG 
GAATATGAGA 
TTTTTCCTTT 



21 
I 

AAAAAAAAGC 
TCTAATTTAT 
TTTCTAACCC 
GCCGCCGCCG 
CTTAAGCAAA 
AGACCACGGC 
CCAGATGAAC 
CAATGGCAGC 
GATGAAAAAA 
TTGTTTATCA 
TCTGCACTGG 
TGGGATGAGT 
ATGTACAACT 
CACTCATGGA 
TGGTATCCCT 
TATTGCAGAC 
TTCCGGCCTG 
ACATCACTTG 
CCATCACCCG 
CGCCATGGAT 
GTCCCCAGGC 
GCCGCCCTTC 
GCCCCCGGTC 
CCTGGTGGTG 
CCACGCGTGC 
GTCCCCCATG 
CACCAGCGAC 
GAGCGAGAAC 
CGAGGAAGAG 
GGACTACGGC 
CGCGGTCGTG 
GCTCAGCTCC 
GCGCGGCCAC 
CGGCGAGTCG 
GTCGGCCTCG 
CTTCTCTAAG 
CACGGAGAAC 
TCCCTTCCTT 
CTCGGAGAAC 
GGGGCGCAGC 
CAGGCCCAGC 
CAAGAACTGT 
ATGCGAGCTG 
GCATGGCCAG 
GTACAGTACC 
TATAAAAACT 
TTTCACCACT 
TCCCATGTGA 
GTGCTTGTCA 
TTTTTTTTTT 



31 
I 

CATGACGGCT 
TTTGGATGTC 
GGCTCTCCCG 
CCCGCCCCGC 
CGGGAATTCT 
CCGTTGGGAG 
TTCCCATTGG 
CTCTGCTTAG 
GCATCCAATC 
ACGTCATCTA 
AGGGGCCTCT 
GCAGAATATG 
TGCAAACAGC 
TTAAGAATCT 
TCAGGACTAG 
AATAACCCCT 
GCAGAAGGGC 
GACCCCCACC 
AGTGCCTTTG 
TTCTCTAGGA 
CGGCCCAGCC 
CTGGCGACGC 
AAGTCCAAGT 
CACCGGCGCA 
ACCCAGGCCA 
ACGGTCAAGT 
TTGGTGGGCA 
GACCCCAACC 
GAAGAAGAGG 
TTCGGGCTGA 
GGCGTGGGCG 
ATGCAGCACT 
CTGGCCGAGG 
GACCGCATAG 
GGGGGCCTGT 
CGCATCAAGC 
GTGTACTCGC 
AGCTTCGGAG 
GGGAGCTTGC 
GGCACGGGAA 
TCAAAAGAGG 
AGCAATCTCA 
TGCAACTATG 
GTGGGGAAGG 
CTGGAGAAAC 
GAATAGAGGT 
CCCTTTCCCC 
TTTAAACAAA 
CCAGCACACC 
TCCTTTATGT 



41 
I 

CTCCCACAAT 
AAAAGGCACT 
ATGTGAACCG 
AGCCCACCAT 
CGCCCGAGCC 
CTCCAGAAGG 
GGGACATTCT 
AAAAAGCTGT 
CCGTGGAGGT 
GAAGAATTTG 
CCTCCCCTCG 
CCCCGCAGGG 
CATTCACCAG 
ACTTAGAAAG 
GTGCAGAATG 
TTAACCTGCT 
GCTTTCCACC 
GCATAGAGCG 
ACAGGGTGCT 
GACTTAGAGA 
CTATGCAAAG 
CCCCCCTCCC 
CATGCGAGTT 
GCCACACGGG 
GCAAGCTGAA 
CCGACGACGG 
GCGCCAGCAG 
TGATCCCGGA 
AGGAAGAGGA 
GCCTGGAGGC 
ACGAGAGCCG 
TCAGCGAGGC 
CCGAGGGCCA 
ACGATGGCAC 
CCAAAAAGCT 
TCGAGAAGGA 
AGTGGCTCGC 
ACTCCAGACA 
GCTTCTCCAC 
GTGGAGGGAG 
GCAGACGCAG 
CTGTCCACAG 
CCTGTGCCCA 
ACGTTTACAA 
ACATGAAAAA 
ATATTAATAC 
ATCGCCCTCC 
CAAACAAACA 
TGTTTTTTTT 
TCTCACCGTT 



180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



51 
I 

TCATCTTCCC 
GATGAAGATA 
AGCCGTCGTC 
GTCTCGCCGC 
TCTTGAAGCC 
GGATCATGAC 
TATTTTTATC 
GGATAAGCCA 
TGGCATCCAG 
CCCCAAACAG 
TTCTGCACAT 
TATTTGTAAA 
TGCATGGTTT 
CGAACACGGA 
TCCTTCCCAG 
AAGAATACCA 
CACTCCCCCC 
CCTGGGGGCG 
GCGGTTGAAT 
GCTGGCAGGG 
GTTACTGCAA 
TCCTCTGCAA 
CTGCGGCAAG 
CGAGAAGCCC 
GCGCCACATG 
TCTCTCCACC 
CGCGCTCAAG 
GAACGGGGAC 
GGAGGAGGAG 
GGCGCGCCAC 
CGCCCTGCCC 
CTTCCACCAG 
CAGGGACACT 
TGTTAATGGC 
GCTGCTGGGC 
GTTCGACCTG 
CGGCTACGCG 
ATCGCCTTTT 
ACCGCCCGGG 
CACGCCCCAT 
CGACACTTGT 
GAGAAGCCAC 
GAGTAGCAAG 
ATGTGAAATT 
ATGGCACAGT 
CCCTCCCTCA 
AGCCCCACTC 
AACAGAAGTA 
CTTTTTCTTT 
TGAATGCATG 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



220 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 



WO 02/086443 

ATCTGTATGG GGCAATACTA TTGCATTTTA CGCAAACTTT GAGCCTTTCT 
AATTTACATG TTGTGTATGT TTTTTTTTAA ACTTAGACAG CATGTATGGT 
TATTTTAAAT TGTCCCTAAT TCGTTGCTGA GCAAACATGT TGCTGTTTCC 
TGAGAGAAAA AGAGAGAGAG AGAGAAAAAG ACCATGCTGC A TA CATTCTG 
CATGTACAGT TTTATTTTAT AACGTGAGGA GGAAAAACAG TCTTTGGATT 
AGACAGAATA GATAGCACTG AAAAAAAATC TCTATGAGCT AAATGTCTGT 
TTAAATGTAT CAATTGGAAA GGAAGAAAAA AGGCCTTGAA TTGACAAATT 
CAGAACAAGT TTATTCTATC ATTTGGTTTT AAAATATGAG TGCCTTGGAT 
CACATCGATG GTTCTTTCTA CTTGTTATAA ACTTGTAGCT TAATTCAGCA 
TAATAAACCT TAGGAACTAG CATATAATTC TATATTGTAT TTCTCACAAC 
TAAAAAGATG ACCCATTATG TCCTAGTTAA TCATCATTTT TCCTTTAGTT 
AACAAAACTG ATTATACCAG TATAAAAGCT ACTTTGCTCC TGGTGAGAGC 
TGGGCTGTTT TGCCCAAAGT TTTATTTTTT TTAAACAATG ATTAAATTGA 
TGCAAAAGCC CTGGAACGCA ATTAAATACA CTAGTAAGGA GTTCATTTTA 
TGCTTTAATA ATGTCTTTTT AAAAATACTG GCACCAAAAG AAATAGATCC 
GGTTGTCAAG TGGACAATCA AATGATAAAC TTTAAGACCT TGTATACCAT 
AGAGGCTGAC AATAAGGTTT GACAGAGGGG AACAGAAGAA AATAATATGA 
CAACGTGGTA CTATTTGCCA TTTAAAACTA GAACAGGTAT ATAAGCTAAT 
TGATGATTAA CTATGAATTC TTAAGACTTG CATTTAAATG TGACATTCTT 
GAGAAAGAAT TTTAAGAGTA GCAGTATATA TGTCTGTGCT CCCTAAAAGT 
TTCTTTTCCA TACACTGTGT GCTATTTGTG TTAACATGGA AGAGGATTCA 
TTTTATTTTT TTAATTTTTT CTTTTTTATT AAGCTAGCAT CTGCCCCAGT 
ATAGCACTTG ACTCTGCCTG TGATATCTGT ATCTTTTCTC TAATCAGAGA 
GAGTATAAAA TAAACCTGCT CAGATAGGAC AATTAAGTGC ACTGTACAAT 
TACAGGTCTA TACTTAAGGG AAAAGTTGCA AGAATGCTGA AAAAAAATTG 
CATTGAGGAG CATTTTTTAA AAACTAAAAA AAAAAAAACT TTGCCAGCCA 
TATTGAGCTT ACTTACTTGG ACGCAACATT GCAAGCGCTG TGAATGGAAA 
TTAACATAGA AATGAATGAT TGCTTTCGCT TCTACAGTGC AAGGATTTTT 
CTTTTTTAAA TATAAATGTT AAGAAAAATT TTTTTTAAAA AACACTTCAT 
GGGGAACTGC ATTTTAGGGT TCCATTGTCT TGGTGGTGTT ACAAGACTTG 
AAAAATGGTA GTGGAAATTC TATGCCTTGG ATACACACCG CTCTTCAGGT 
AAAAACATAC ATTGGGGAAA GGTTTAAGAT TATATAGTAC TTAAATATAG 
ACTCATGTTG ATTCCTATGC TAAAATACAT TTATGGTCTT TTTTCTGTAT 
GTATTTGAAT TAAATGTTCA TCTAGTGTTA GGCACTATAG TATTTATATT 
TTTTTAACTG TTGCTTGTTC TCTTAAAAGG TATCAATGTA CCTTTTTTGG 
AAAAAAGACA GGCTGCCACA GTATATTTTT TTAATTTGGC AGGATAATAT 
ATTTGTATGC TTCAAAAAAA AAAAAAAGAG AGAAACAAAA AAGTGTGACA 
GAAGCCATAT AATGGCGGTT TGGGGGAGCC TGCTAGAATG TCACATGGAT 
GGGGTTGTAC ATATCCTTTT TTGTTCCTTT TTCCTGCTGC CATACTGTAT 
AAGCTAATAA CGTTGGTTTG TTATGTAGTG TGCTTTTTGT CCCTTTCCTT 
ACATTCCAGC ATCTTACCTT CATATGCAGT AAAAGAAAGA AAGAAAAAAA 
AAAAAAAAAC CAATGTTTTG CAGTTTTTTT CATTGCCAAA AACTAAATGG 
TTAGATTGGA AAGAATTTCA TATGCAAAGC ATATTAAAGA GAAAGCCCGC 
ACTTTTTTGT AAATGGCAAT GCAGAATATT TTGTTATTGG CCTTTTCTAT 
AAAGCTGTTT GTCGTAACTT GAAATTTTAT CTTTTACTAT GGGAGTCACT 
GCTTATGTGC CCTGTTCAAA ACAGAGGCAC TTAATTTGAT CTTTTATTTT 
TATTTTTTTT TTTATTTAGA TGACCAAAGG TCATTACAAC CTGGCTTTTT 
TTTCTGGTCT TTGTTAAGTT CTATTGGAAA AACCACTGTC TGTGTTTTTT 
CTGCATTAAC CTGTTCATAC ACCCATTTTG TCCCTTTATT GAAAAAATAA 
A 

Seq ID NO: 85 Protein sequence: 
Protein Accession NP_075044.l 



PCT/US02/12476 



MSRRKQGKPQ 
LIFIEHKRKQ 
CPKQBHIADK 
SAWFLLGHAQ 
LRIPGSVSRE 
LRLNPMAMEP 
PPLQSAPPPS 
KRHMKTHMHK 
ENGDEEEEED 
RALPDVMQGM 
TVNGRGCSPG 
AGYAASRQLK 
STPHISGPGT 
QSSKLTRHMK 



11 
I 

HLSKREFSPE 
CNGSLCLEKA 
LLHWRGLSSP 
NTHGLRIYLE 
ASGLAEGRFP 
PAMDFSRRLR 
QPPVKSKSCE 
SSPMTVKSDD 
DEEEEEEEEE 
VLSSMQHFSE 
ESASGGLSKK 
DPFLSFGDSR 
GRPSSKEGRR 
THGQVGKDVY 



21 
I 

PLEAILTDDE 
VDKPPSPSPI 
RSAHGALIPT 
SEHGSPLTPR 
PTPPLFSPPP 
ELAGNTSSPP 
FCGKTFKFQS 
GLSTASSPEP 
EEEELTESER 
AFHQVLGEKH 
LLLGSPSSLS 
QSPFASSSEH 
SDTCEYCGKV 
KCEICKMPFS 



31 
I 

PDHGPLGAPE 
EMKKASNPVE 
PGMSAEYAPQ 
VGIPSGLGAE 
RHHLDPHRIE 
LSPGRPSPMQ 
NLWHRRSHT 
GTSDLVGSAS 
VDYGFGLSLE 
KRGHLAEAEG 
PFSKRIKLEK 
SSENGSLRFS 
FKNCSNLTVH 
VYSTLEKHMK 



41 

I 

GDHDLLTCGQ 
VGIQVTPEDD 
GICKDEPSSY 
CPSQPPLHGI 
RLGAEEMALA 
RLLQPFQPGS 
GEKPYKCNLC 
SALKSWAKF 
AARHHENSSR 
HRDTCDEDSV 
EFDLPPATMP 
TPPGELDGGI 
RRSHTGERPY 
KWHSDRVLNN 



CTTGTGCAAT 
ATGTTATGGC 
AGTTCCGTTC 
TAATACATAT 
AACCCTCTAT 
CTCTAAAGGG 
AACAGAAAAA 
CTATTAAAAC 
TTGGGTGAGG 
AATGGCTACC 
TAATTTTATA 
TTAAAAGAAA 
ATGTGTAATG 
TGAAGATATT 
AGATCTACTT 
ATTGAAAGGA 
TTTATTAGCA 
ATTGATACAA 
AAAAAAAGAA 
TGTACTTCAT 
TTGTTTTTAT 
TGGTGTTCAA 
TACAGAGGTT 
TTTCCCAGTT 
AACACAATCT 
TTTACTTGAC 
CAGAATACAC 
TTGTACAAAA 
TATGTTTAGG 
TTATCCATTT 
TGTAAAAAAA 
GAAAATGCAC 
TTCTAGAATG 
GAAGCTTGTA 
TAGTGGAAAA 
AGTGCAAATT 
TTACAGATGA 
GGCTGTCATA 
GCAGTACTGC 
CTATCACCCT 
AAGGAAAAAA 
TGCTTTATAT 
TTTAGTCAAT 
TCCTGTAATG 
ATTTATTATT 
TCTTTGTTTT 
ATTGTATTTG 
TGGCAGTTGT 
AAAAAATTAA 



51 
I 

CQMNFPLGDI 
DCLSTSSRRI 
TCTTCKQPFT 
HIADNNPFNXj 
THHPSAFDRV 
KPPFLATPPL 
DHACTQASKL 
KSENDPNLIP 
GAWGVGDES 
AGESDRIDDG 
NTENVYSQWL 
SGRSGTGSGG 
KCELCNYACA 
DIKTE 



3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5680 
5940 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



75 



80 



85 



Seq ID NO: 86 DNA sequence 

Nucleic Acid Accession #: XM_035292.2 

Coding sequence: 53-1576 



GCTCGCTGGG 
TGCGGGCCCG 
GGAGAAGATG 
CGTGACCCTG 
TATCGGCTCG 
GCTGGCGCTG 
CGCGGAGCTC 
CTACGGCTCG 
ATCGCAGTAC 



11 
I 

CCGCGGCTCC 
AAGCGGCGCG 
CTGGCCGCCA 
CAGCGGAACA 
GGCATCTTCG 
GTGGTGTGGG 
GGCACCACCA 
CTGCCCGCCT 
ATCGTGGCCC 



21 
I 

CGGGTGTCCC 
CGCTAGCGGC 
AGAGCGCGGA 
TCACGCTGCT 
TGACGCCCAC 
CCGCGTGCGG 
TCTCCAAATC 
TCCTCAAGCT 
TGGTCTTCGC 



31 
I 

AGGCCCGGCC 
GCCGGCGGCC 
CGGCTCGGCG 
CAACGGCGTG 
GGGCGTGCTC 
CGTCTTCTCC 
GGGCGGCGAC 
CTGGATCGAG 
CACCTACCTG 



41 
I 

GGTGCGCAGA 
GAGGAGAAGG 
CCGGCAGGCG 
GCCATCATCG 
AAGGAGGCAG 
ATCGTGGGCG 
TAOGCCTACA 
CTGCTCATCA 
CTCAAGCCGC 



51 
I 

GCATGGCGGG 
AAGAGGCGCG 
AGGGCGAGGG 
TGGGGACCAT 
GCTCGCCGGG 
CGCTCTGCTA 
TGCTGGAGGT 
TCCGGCCTTC 
TCTTCCCCAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 



221 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 
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CTGCCCGGTG CCCGAGGAGG CAGCCAAGCT CGTGGCCTGC 
GGCCGTGAAC TGCTACAGCG TGAAGGCCGC CACCCGGGTC 
CAAGCTCCTG GCCCTGGCCC TGATCATCCT GCTGGGCTTC 
TGTGTCCAAT CTAGATCCCA ACTTCTCATT TGAAGGCACC 
TGTGCTGGCA TTATACAGCG GCCTCTTTGC CTATGGAGGA 
CACAGAGGAA ATGATCAACC CCTACAGAAA CCTGCCCCTG 
CATCGTGACG CTGGTGTACG TGCTGACCAA CCTGGCCTAC 
GCAGATGCTG TCGTCCGAGG CCGTGGCCGT GGACTTCGGG 
GTCCTGGATC ATCCCCGTCT TCGTGGGCCT GTCCTGCTTC 
GTTCACATCC TCCAGGCTCT TCTTCGTGGG GTCCCGGGAA 
CTCCATGATC CACCCACAGC TCCTCACCCC CGTGCCGTCC 
GACGCTGCTC TACGCCTTCT CCAAGGACAT CTTCTCCGTC 
CAACTGGCTC TGCGTGGCCC TGGCCATCAT CGGCATGATC 
TGAGCTTGAG CGGCCCATCA AGGTGAACCT GGCCCTGCCT 
CCTCTTCCTG ATCGCCGTCT CCTTCTGGAA GACACCCGTG 
CATCATCCTC AGCGGGCTGC CCGTCTACTT CTTCGGGGTC 
GTGGCTCCTC CAGGGCATCT TCTCCACGAC CGTCCTGTGT 
CCCCCAGGAG ACATAGCCAG GAGGCCGAGT GGCTGCCGGA 

Seq ID NO: 87 Protein sequence: 
Protein Accession #: XPJJ35292.2 



PCT/US02/12476 



CTCTGCGTGC 
CAGGATGCCT 
GTCCAGATCG 
AAACTGGATG 
TGGAATTACT 
GCCATCATCA 
TTCACCACCC 
AACTATCACC 
GGCTCCGTCA 
GGCCACCTGC 
CTCGTGTTCA 
ATCAACTTCT 
TGGCTGCGCC 
GTGTTCTTCA 
GAGTGTGGCA 
TGGTGGAAAA 
CAGAAGCTCA 
GGAGCATGC 



TGCTGCTCAC 
TTGCCGCCGC 
GGAAGGGTGA 
TGGGGAACAT 
TGAATTTCGT 
TCTCCCTGCC 
TGTCCACCGA 
TGGGCGTCAT 
ATGGGTCCCT 
CCTCCATCCT 
CGTGTGTGAT 
TCAGCTTCTT 
ACAGAAAGCC 
TCCTGGCCTG 
TCGGCTTCAC 
ACAAGCCCAA 
TGCAGGTGGT 



MAGAGPKRRA 
GTIIGSGIFV 
LEVYGSLPAF 
LLTAVNCYSV 
GNIVLAIiYSG 
STEQMLSSEA 
SILSMIHPQL 
RKPELERPIK 
KPKWLLQGIF 



11 
I 

LAAPAAEEKE 
TPTGVLKEAG 
LKLWIELLII 
KAATRVQDAF 
LFAYGGWNYL 
VAVDFGNYHL 
LTPVPSLVFT 
VNLALPVFFI 
STTVLCQKLM 



21 
I 

EAREKMLAAK 
SPGLALWWA 
RPSSQYIVAL 
AAAKLLALAL 
NFVTEEMINP 
GVMSWIIPVF 
CVMTLLYAFS 
LACfcFLIAVS 
QWPQET 



31 
I 

SADGSAPAGE 
ACGVFSIVGA 
VFATYLLKPL 
IILLGFVQIG 
YRNLPLAIII 
VGLSCFGSVN 
KDIFSVINFF 
FWKTPVECGI 



41 
I 

GEGVTLQRNI 
LCYAELGTTI 
FPTCPVPEEA 
KGDVSNLDPN 
SLPIVTLVYV 
GSLFTSSRLF 
SFFNWLCVAL 
GFTIILSGLP 



51 
I 

TLLNGVAIIV 
SKSGGDYAYM 
AKLVACLCVL 
FSFEGTKLDV 
LTNLAYFTTL 
FVGSREGHLP 
AI IGMI WLRH 
VYFFGVWWKN 



Seq ID NO: 88 DNA sequence 

Nucleic Acid Accession #: NM_005268.1 

Coding sequence: 168-989 



TAAAAAGCAA 
TCTGGATATG 
AGCCCTGAGG 
TCTTTGAGGG 
TGTCTCTGGT 
GTGATGACCA 
TTGATGAGTT 
CATGCCCCTC 
ACCGAGAAGC 
GTGGGCTCTG 
TTCTCTATGT 
ACGCAGATCC 
TTTTCACCCT 
TCATCTACCT 
TGTGCACAGG 
CGGGTGACCT 
GAGACCATGT 
CCTGGATGGG 
CATGAGGTAG 
TCAACTCCAG 
GCTCGGTTTC 



11 
I 

AAGAATTCGC 
AAATTCAAGC 
AGTAGTCACT 
ACTCCTGAGT 
CTTCATCTTC 
CAAGGACTTC 
CTTCCCTGTG 
ACTGCTCGTG 
CCATGGGGAG 
GTGGACATAT 
GTTCCACTCA 
ATGTCCCAAT 
CTTCATGGTG 
GGTGAGCAAG 
TCATCACCCC 
CATCTTTCTG 
GAAGAAAACC 
GAGGCTCTAG 
GGGCAGGCAA 
CCACCTGCCC 
CTTTTCTAGA 



21 
I 

GGCCGCGTCG 
TGCTTGCTGA 
CAGTAGCAGC 
GGGGTCAACA 
CGCGTGCTGG 
GACTGCAATA 
TCCCATGTGC 
GTCATGCACG 
AACAGTGGGC 
GTCTGCAGCC 
TTCTACCCCA 
ATAGTGGACT 
GCCACAGCTG 
AGATGCCACG 
CACGGTACCA 
GGCTCAGACA 
ATCTTGTGAG 
CATCTCTCAT 
GAGAGAGGAT 
CAGCTCGACG 
ATGGAAATAG 



31 
I 

ACACGGGCTT 
GTCCTATTGC 
TGACGCGTGG 
AGTACTCCAC 
TGTACCTGGT 
CTCGCCAGCC 
GCCTCTGGGC 
TGGCCTACCG 
GCCTCTACCT 
TAGTGTTCAA 
AATATATCCT 
GCTTCATCTC 
CCATCTGCAT 
AGTGCCTGGC 
CCTCTTCCTG 
GTCATCCTCC 
GGGCTGCCTG 
AGGTGCAACC 
TCAGACGCTC 
GCACTGGGCC 
TGAGGGCCAA 



41 
I 

CCCCGAAAAC 
CGGCTGCTGG 
GTCCACCATG 
AGCCTTTGGG 
GACGGCCGAG 
CGGCTGCTCC 
CCTGCAGCTT 
GGAGGTTCAG 
GAACCCCGGC 
GGCGAGCGTG 
CCCTCCTGTG 
CAAGCCCTCA 
CCTGCTCAAC 
AGCAAGGAAA 
CAAACAAGAC 
TCTCTTACCA 
GACTGGTCTG 
TGAGAGTGGG 
TGGGAGCCAG 
AGTTCCCCCT 
TGC 



51 

I 

CTTCCCCGCT 
GAGCCAGGAG 
AACTGGAGTA 
CGCATCTGGC 
CGTGTGTGGA 
AACGTCTGCT 
ATCCTGGTGA 
GAGAAGAGGC 
AAGAAGCGGG 
GACATCGCCT 
GTCAAGTGCC 
GAGAAGAACA 
CTCGTGGAGC 
GCTCAAGCCA 
GACCTCCTTT 
GACCGCCCCC 
GCAGGTTGGG 
GGAGCTAAGC 
TTCCTAGTCC 
CTGCTCTGCA 



Seq ID NO: 89 Protein sequence: 
Protein Accession #: NP_0052S9.1 



MNWSIFEGLL 
SNVCFDEFFP 
GKKRGGLWWT 
SEKNIFTLFM 
DDLLSGDLIF 



11 21 31 41 51 

I I I I I 

SGVNKYSTAF GRIWLSLVFI FRVLVYLVTA ERVWSDDHKD FDCNTRQPGC 
VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG ENSGRLYLNP 
YVCSLVFKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NIVDCFISKP 
VATAAICILL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 
LGSDSHPPLL PDRPRDHVKK TIL 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



Seq ID NO: 90 DNA sequence 

Nucleic Acid Accession 8: NM_002391.1 

Coding sequence: 26-457 

1 11 21 31 41 51 

I I I I I I 

CGGGCGAAGC AGCGCGGGCA GCGAGATGCA GCACCGAGGC TTCCTCCTCC TCACCCTCCT 
CGCCCTGCTG GCGCTCACCT CCGCGGTCGC CAAAAAGAAA GATAAGGTGA AGAAGGGCGG 
CCCGGGGAGC GAGTGCGCTG AGTGGGCCTG GGGGCCCTGC ACCCCCAGCA GCAAGGATTG 
CGGCGTGGGT TTCCGCGAGG GCACCTGCGG GGCCCAGACC CAGCGCATCC GGTGCAGGGT 
GCCCTGCAAC TGGAAGAAGG AGTTTGGAGC CGACTGCAAG TACAAGTTTG AGAACTGGGG 
TGCGTGTGAT GGGGGCACAG GCACCAAAGT CCGCCAAGGC ACCCTGAAGA AGGCGCGCTA 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 



222 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



TGCACCCCCA 
CCAAGCCTGG 
CCCAGGCCCG 
TGCCCTGCCT 
ACAAGGGATT 
TTGTTCTTCC 
CAATAAAAGC 



AGACCAAAGC 
ATGCCAAGGA 
AGATGTGACC 
TGTCCCTCTC 
CTGGGAAGCT 
CCACAATTCC 
TCTTCTTTTT 



WO 02/086443 

CAATGCTCAG TGCCAGGAGA CCATCCGCGT CACCAAGCCC 
AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA GGACTAGACG 
GCCCCTGGTG TCACATGGGG CCTGGCCACG CCCTCCCTCT 
CACCAGTGCC TTCTGTCTGC TCGTTAGCTT TAATCAATCA 
ACTCCCCAGC CCCACCCCTA AGTGCCCAAA GTGGGGAGGG 
TGAGCCTCCC CCAAAGCAAT GTGAGTCCCA GAGCCOGCTT 
ATTACTAAGA AACACATCAA ATAAACTGAC TTTTTCCCCC 
TAATAT 1 

Seq ID NO: 91 Protein sequence: 
Protein Accession #: NP_002382.1 



1 11 21 31 41 51 

I I I I I I 

MQHRGFLLLT LLALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT 
CGAQTQRIRC RVPCNWKKEF GRDCKYKFEN WGACDGGTGT KVRQGThKKA RYNAQCQETI 
RVTKPCTPKT KAKAKAKKGK GKD 



PCT/US02/12476 



Seq ID NO: 92 DNA sequence 

Nucleic Acid Accession ft: NM_005130.1 

Coding sequencer 98-802 



CTCTACCTGA 
CGTGTGCTCA 
GCTCTCCTTC 
GAATGGACTT 
TAAGCAGAAA 
CAGATGGGCT 
GGACCATGAA 
TGAGAGAGTC 
ATATTCCAAG 
TAAGCTAGTC 
GTCCCCCAGG 
GACCATGGCC 
GACTGCCCTG 
AGTGCAGGAC 
TGTCGTAAGT 
TGTGCTTAGT 
TGGAATTTGC 
TTCCATGGCC 
GAGTGATAAT 
TTTTTCAAAA 



11 
I 

CACAGCTGCA 
GAACAAGGTG 
CTCCTACTGG 
CACAGCAAAG 
AGCAGGCCCG 
GCTACTGAGC 
TTTTCCTGTG 
TATTGGAAAC 
ACAGCTGTGA 
AGCTCCACTC 
GAGCACATCA 
ACCAAAGCTC 
GAGTT CTGTG 
ACGTCATGCT 
CCCTCTGTAT 
GAGTGCAACG 
CTTATTTTTC 
CACACAGCTA 
TTCAGTGCAA 
AAAAAAAAAA 



21 
I 

GCCTGCAATT 
AACGCCCAGC 
CTGCTCAGGT 
TGGTCTCAGA 
GGAACAAAGG 
AGGAGGAGGG 
TCTTTGCTGG 
AAGTTGCCCG 
AAACCAGAGT 
TATTTGGGAA 
AGGGCAAAGA 
CCGAGTGTGT 
GAGAGACTTG 
AATGAGGTCA 
ACTTTAAAGC 
AAATATTTAA 
TTGGATGCGA 
TGTGTTTGAG 
CGAACTTTCT 
AAA 



31 
I 

CACTCCCACT 
TGCAGCCATG 
GCTCCTGGTG 
ACAAAAGGAC 
CAAGTTTGTC 
CATCTCTCTC 
CAATCCAACC 
GAATCTGCGC 
GTGCAGAAAG 
CACAAAGCCC 
GACCACCCCC 
GGAGGACCCA 
GAGCTCTCTC 
AAAGAGAACG 
TCTCTACAGT 
ACAAGTTTTG 
TGTTCAGAGG 
CAGCGAAGAG 
GCTGAATTAA 



41 
I 

GCCTGGGATT 
AAGATCTGTA 
GAGGGGAAAA 
ACTCTGGGCA 
ACCAAAGACC 
AAGGTTGAGT 
TCATGCCTAA 
TCACAGAAAG 
GATTTTCCAG 
AGGAAGGAGA 
TCTAGCCTAG 
GATATGGCAA 
TGCACATTCT 
GGTTCCTTTA 
CCCCCCAAAA 
TATTTTTTGC 
CTGTTTCCTG 
TCTTTGAGCT 
TGGTAATAAA 



51 
I 

GCACTGGATC 
GCCTCACCCT 
AAAAAGTGAA 
ACACCCAGAT 
AAGCCAACTG 
GCACTCAATT 
AGCTCAAGGA 
ACATCTGTAG 
AATCCAGTCT 
AAACAGAGAT 
CAGTGACCCA 
ACCAGAGGAA 
TCCTCAGCAT 
AGAGATGTCA 
TATGAACTTT 
TTTTGTGTTT 
CAGCATGTAT 
GAATGAGCCA 
ACTCTGGGTG 



Seq ID NO: 93 Protein sequence: 
Protein Accession ft: NP 005121.1 



11 



21 



31 
I 



41 



51 



MKICSLTLLS FLLLAAQVLL VEGKKKVKNG LHSKWSEQK DTLGNTQIKQ KSRPGNKGKP 
VTKDQANCRW AATEQEEGIS LKVECTQLDH EFSCVFAGNP TSCLKLKDER VYWKQVARNL 
RSQKDICRYS KTAVKTRVCR KDFPESSLKL VSSTLFGNTK PRKEKTEMSP REHIKGKETT 
PSSLAVTQTM ATKAPECVED PDMANQRKTA LEFCGETWSS LCTFFLSIVQ DTSC 

Seq ID NO: 94 DNA sequence 

Nucleic Acid Accession ft: NM_012101 

Coding sequence: 125-1891 



CTCCTCACAG 
TGCCAGAAAG 
TGCGATGGAA 
CCGGAGCCCG 
TGCCAAGACC 
CCTGAAGCCA 
CATCCAGTTT 
GGAAGGCAAG 
TACCTTTGCC 
GGTGTCCATC 
CCTTTTTTCA 
CAAGCAGAAG 
CAAGCCCCAC 
CTTTGAGGCC 
CCAGACCTGC 
AGTGGAGGAG 
GCTCAAGATC 
CAAGAGCTTC 
GGACCTGGAG 
TGTGGACCAA 
•GGACAAGCAG 
ATTTGGTGCA 
GCTGGAGGGG 
ATGCATGCGC 
GAACCACATG 



11 
I 

GTGTGTCTCT 
GTCACCTATC 
GCTGCAGATG 
TCGGGCCCCA 
ACCAACGGGC 
GGGGAAGGTA 
GTCGAGTCCG 
AGGTCGCCGT 
GAAAAGGGCG 
ATGGAGCCCG 
CGGTCCAAGT 
GCGGTCAAGT 
CTGGAGGGCG 
CGCAAGTGTC 
ATCTGCTACC 
GCCAAGGCCG 
ATTGAGATTG 
ACCACCAATG 
AAGCAAAAGG 
GTGAAGGTGA 
ACCCGGGAGC 
TTGATGAGCA 
GAGGGCCTGG 
CACGTTGAGA 
GAGAACGGTG 



21 
I 

AGTCCTCGTG 
CTGAACCCCA 
CCTCCAGGAG 
GTGGCAGCCT 
ACGGCGGGGA 
GGAGCGCCCT 
GGGACGACAA 
ACGCAGGGCT 
ACGTGCGCAA 
GGGAGACCCG 
CCGGCTCCGA 
CCTGCCTGGT 
CCGCCTTCCG 
CCGTGCATGG 
TTTGCATGTT 
AGAAGGAGAC 
AGGATGAAGC 
AGAAGGCCAT 
AGGAAGTGAG 
TCATGGATGC 
AGCTGCATAG 
ATTACTCTCT 
GACAGTCACT 
AGATGTGCAA 
GTGACCATCG 



31 
I 

GTTGCCTGCC 
GCAAGCCTGA 
CAACGGGTCG 
GGAGAATGGC 
GGCAGCTGAG 
GTTCGCGGGC 
GAACTCCAAC 
CCAGCTGGGG 
GTCCATTTTC 
GCGGAACAGC 
GGAGGTGCTG 
GTGCCAGGCC 
AGACCACCAG 
CAAGACGATG 
CCAGGAGCAC 
GGAGCTGTCA 
TGAGAAGTGG 
CCTGGAGCAG 
GGCTGCGCTG 
TCTGGATGAG 
CATCAGCGAC 
CCCCCCACCC 
AGGCAACTTC 
GGCGGACCTG 
CTATGTGAAC 



41 
I 

CCACTCCCTG 
AACAGCTCAG 
AGCCCAGAAG 
ACCAAGGCTG 
GGCAAGAGCC 
AATGAGTGGC 
TACTTCAGCA 
GCTGCCAAGA 
TCGGAGTCCC 
TACCCCCGGG 
TGCGACTCCT 
TCCTTCTGCG 
CTGCTCGAGC 
GAGCTCTTCT 
AAGAATCATA 
CTGCAAAAGG 
CAGAAGGAGA 
AACTTCCGGG 
GAGCAGCGGG 
AGAGCCAAGG 
TCTGTGTTGT 
CTGCCCACCT 
AAGGACGACC 
AGCCGTAACT 
AACTACACGA 



51 
I 

CCGAGACGCC 
CCAAGCACCC 
CCAGGGATGC 
ACGGCAAGGA 
TGGGCAGCGC 
GGCGACCCAT 
TGGACTCTAT 
AGCCACCCGT 
GGAAGCCCAC 
CCGACACGGG 
GCATCGGCAA 
AGCTGCATCT 
CCATCCGGGA 
GCCAGACCGA 
GCACCGTCAC 
AGCAGCTGCA 
AGGACCGCAT 
ACCTGGTGCG 
AGCAGGATGC 
TGCTGCATGA 
TTCTGCAGGA 
ATCATGTCCT 
TGCTCAATGT 
TCATTGAGAG 
ACAGCTTCGG 



420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



223 



WO 02/086443 

GGGTGAGTGG AGTGCACCGG ACACCATGAA GAGATACTCC ATGTACCTGA CACCCAAAGG 1560 

TGGGGTCCGG ACATCATACC AGCCCTCGTC TCCTGGCCGC TTCACCAAGG AGACCACCCA 1620 

GAAGAATTTC AACAATCTCT ATGGCACCAA AGGTAACTAC ACCTCCCGGG TCTGGGAGTA 1680 

CTCCTCCAGC ATTCAGAACT CTGACAATGA CCTGCCCGTC GTCCAAGGCA GCTCCTCCTT 1740 

CTCCCTGAAA GGCTATCCCT CCCTCATGCG GAGCCAAAGC CCCAAGGCCC AGCCCCAGAC 1800 

TTGGAAATCT GGCAAGCAGA CTATGCTGTC TCACTACCGG CCATTCTACG TCAACAAAGG I860 

CAACGGGATT GGGTCCAACG AAGCCCCATG AGCTCCTGGC GGAAGGAACG AGGCGCCACA 1920 

CCCCTGCTCT TCCTCCTGAC CCTGCTGCTC TTGCCTTCTA AGCTACTGTG CTTGTCTGGG 1980 

TGGGAGGGAG CCTGGTCCTG CACCTGCCCT CTGCAGCCCT CTGCCAGCCT CTTGGGGGCA 2040 

GTTCCGGCCT CTCCGACTTC CCCACTGGCC ACACTCCATT CAGACTCCTT TCCTGCCTTG 2100 

TGACCTCAGA TGGTCACCAT CATTCCTGTG CTCAGAGGCC AACCCATCAC AGGGGTGAGA 2160 

TAGGTTGGGG CCTGCCCTAA CCCGCCAGCC TCCTCCTCTC GGGCTGGATC TGGGGGCTAG 2220 

CAGTGAGTAC CCGCATGGTA TCAGCCTGCC TCTCCCGCCC ACGCCCTGCT GTCTCCAGGC 2280 

CTATAGACGT TTCTCTCCAA GGCCCTATCC CCCAATGTTG TCAGCAGATG CCTGGACAGC 2340 

ACAGCCACCC ATCTCCCATT CACATGGCCC ACCTCCTGCT TCCCAGAGGA CTGGCCCTAC 2400 

GTGCTCTCTC TCGTCCTACC TATCAATGCC CAGCATGGCA GAACCTGCAG TGGCCAAGGG 2460 

CTGCAGATGG AAACCTCTCA GTGTCTTGAC ATCACCCTAC CCAGGCGGTG GGTCTCCACC 2520 

ACAGCCACTT TGAGTCTGTG GTCCCTGGAG GGTGGCTTCT CCTGACTGGC AGGATGACCT 2580 

TAGCCAAGAT ATTCCTCTGT TCCCTCTGCT GAGATAAAGA ATTCCCTTAA CATGATATAA 2640 

TCCACCCATG CAAATAGCTA CTGGCCCAGC TACCATTTAC CATTTGCCTA CAGAATTTCA 2700 

TTCAGTCTAC ACTTTGGCAT TCTCTCTGGC GATGGAGTGT GGCTGGGCTG ACCGCAAAAG 2760 

GTGCCTTACA CACTGCCCCC ACCCTCAGCC GTTGCCCCAT CAGAGGCTGC CTCCTCCTTC 2820 

TGATTACCCC CCATGTTGCA TATCAGGGTG CTCAAGGATT GGAGAGGAGA CAAAACCAGG 2880 

AGCAGCACAG TGGGGACATC TCCCGTCTCA ACAGCCCCAG GCCTATGGGG GCTCTGGAAG 2940 

GATGGGCCAG CTTGCAGGGG TTGGGGAGGG AGACATCCAG CTTGGGCTTT CCCCTTTGGA 3000 
ATAAACCATT GGTCTGTC 

Seq ID NO: 95 Protein sequence: 
Protein Accession #: NP_036233.1 



1 11 21 31 41 51 

MEAADASRSN GSSPEARDAR SPSGPSGSLE NGTKADGKDA KTTNGHGGEA AEGKSLGSAL 60 

KPGEGRSALF AGNEWRRPII QFVESGDDKN SNYFSMDSME GKRSPYAGLQ LGAAKKPPVT 120 

FAEKGDVRKS IFSESRKPTV SIMEPGETRR NSYPRADTGL FSRSKSGSEE VLCDSCIGNK 180 

OKAVKSCLVC QASFCELHLK PHLEGAAFRD HQUUEPIRDF EARKCPVHGK TMELFCQTDQ 240 

TCICYLCMFQ EHKNHSTVTV EEAKAEKETE LSLQKEQLQL KIIEIEDEAE KWQKEKDRIK 300 

SFTTNEKAIL EQNFRDLVRD LEKQKEEVRA ALEQREQDAV DQVKVIMDAL DERAKVLHED 360 

KQTREQLHSI SDSVLFLQEF GALMSNYSfcP PPLPTYHVLL EGEGLGQSLG NFKDDLLNVC 420 

MRHVEKMCKA DLSRNFIERN HMENGGDHRY VNNYTNSFGG EWSAPDTMKR YSMYLTPKGG 480 

VRTSYQPSSP GRFTKETTQK NFNNLYGTKG NYTSRVWEYS SSIQNSDNDL PWQGSSSFS 540 
LKGYPSLMRS QSPKAQPQTW KSGKQTMLSH YRPFYVNKGN GIGSNEAP 

Seq ID NO: 96 DNA sequence 

Nucleic Acid Accession NM_080668.1 

Coding sequence: 83-841 

1 11 21 31 41 51 

GGCACGAGGG CAGCGAGTGG CCTTCCCGGT TGGCGCGCGC CCGGGGCX3GC GGCGCTGGAG 60 

GAGCTCGAGA CGGAGCCTAG TTATGTCTGG GAGGCGAACG CGGTCCGGAG GAGCCGCTCA 120 

GCGCTCCGGG CCAAGGGCCC CATCTCCTAC TAAGCCTCTG CGGAGGTCCC AGCGGAAATC 180 

AGGCTCTGAA CTCCCGAGCA TCCTCCCTGA AATCTGGCCG AAGACACCCA GTGCGGCTGC 240 

AGTCAGAAAG CCCATCGTCT TAAAGAGGAT CGTGGCCCAT GCTGTAGAGG TCCCAGCTGT 300 

CCAATCACCT CGCAGGAGCC CTAGGATTTC CTTTTTCTTG GAGAAAGAAA ACGAGCCCCC 360 

TGGCAGGGAG CTTACTAAGG AGGACCTTTT CAAGACACAC AGCGTCCCTG CCACCCCCAC 420 

CAGCACTCCT GTGCCGAACC CTGAGGCCGA GTCCAGCTCC AAGGAAGGAG AGCTGGACGC 480 

CAGAGACTTG GAAATGTCTA AGAAAGTCAG GCGTTCCTAC AGCCGGCTGG AGACCCTGGG 540 

CTCTGCCTCT ACCTCCACCC CAGGCCGCCG GTCCTGCTTT GGCTTCGAGG GGCTGCTGGG 600 

GGCAGAAGAC TTGTCCGGAG TCTCGCCAGT GGTGTGCTCC AAACTCACCG AGGTCCCCAG 660 

GGTTTGTGCA AAGCCCTGGG CCCCAGACAT GACTCTCCCT GGAATCTCCC CACCACCCGA 720 

GAAACAGAAA CGTAAGAAGA AGAAAATGCC AGAGATCTTG AAAACGGAGC TGGATGAGTG 780 

GGCTGCGGCC ATGAATGCCG AGTTTGAAGC TGCTGAGCAG TTTGATCTCC TGGTTGAATG 840 

AGATGCAGTG GGGGGTGCAC CTGGCCAGAC TCTCCCTCCT GTCCTGTACA TAGCCACCTC 900 

CCTGTGGAGA GGACACTTAG GGTCCCCTCC CCTGGTCTTG TTACCTGTGT GTGTGCTGGT 960 

GCTGCGCATG AGGACTGTCT GCCTTTGAGG GCTTGGGCAG CAGCGGCAGC CATCTTGGTT 1020 

TTAGGAAATG GGGCCGCCTG GCCCAGCCAC TCACTGGTGT CCTGTCTCTT GTCGTCCTGT 1080 • 

CCTTCCTATC TCCCCAAAGT ACCATAGCCA GTTTCCAGAT GGGCCACAGA CTGGGGAGGA 1140 

GAATCAGTGG CCCAGCCAGA AGTTAAAGGG CTGAGGGTTG AGGTGAGAGG CACCTCTGCT 1200 

CTTGTTGGGA GGGGTGGCTG CTTGGAAATA GGCCCAGGGG CTCTGCCAGC CTCGGCCTCT 1260 

CCCTCCTGAG TTGCCTTCTG TTGGTGGCTT TCTTCTTGAA CCCACCTGTG TAAAGAGGTT 1320 

TTCAGTTCCG TGGGTTTCCC CTTTGATTCT GTAAATAGTC CCAGAGAGAA TTCGTGGGCT 1380 

GAGGGCAATT CTGTCTTGGA GGAAGAAGCT GGACATTCAG CCTGTGGAGT CTGAGTTTTG 1440 

AAGGATGTAG GGAGCCTTAG TTGGGTCTCA GACCATAAGT GTGTACTACA CAGAAGCTGT 1500 

GTTTTCTAGT TCTGGTCTGC TGTTGAGATG TTTGGTAAAT GCCAGGTTGA TAGGGCGCTG 1560 

GCTGCTTGGA GCAAAGGGTG CATTTCAGGG TGTGGCCACC AGGTGCTGTG AGTTTCTGTG 1620 

GCTCATGGCC TCTGGGCTGG TCCCTTGCAC AGGGCCCACG CTGGAGTCTT ACCACTCTGC 1680 

TGCAGGGGTG GAAGGTGGCC CCTCTTGTCA CCCATACCCA TTTCTTACAA AATAAGTTAC 1740 

ACCGAGTCTA CTTGGCCCTA GAAGAGAAAG TTGAAGAGTC CCAGACCTAC TAGCATTTTG 1800 

CAACTATGCT TGTAAAGTCC TCGGAAAGTT TCCTCGCGTA CCAGACAGCG GCGGGGGCTG 1860 

ATAGCAATTT TAGTTTTTGG CCTCCCTATC CTCTCACATG AGAACACTGC CTGGATGCAT 1920 

CTCATGATCT CTGGAGAATT TCCCCATCTT TCTCTTCTTT CCATCGTGTG GATTCAATAG 1980 

TTTGGATTTG AAGGCTGCCC TGCCCCCGAC TCTCCTGCCG CACCCCTGGC CATTGTACCT 2040 

TTTGATGTTT AGAAGTTCGT GGAAGTAGAC GCTGAGGTGT GCAGAGGAGC TGGTGGATAA 2100 

CAGAGAATGC CAGGGAAGAT GAGTGCTGGG TCAGGGTACT TGGATGAAAC GGTGCAGGCC 2160 

AGGCGGGCCC TAATAAAACC CTCTGCCAGG TCTGGGAGTC CCAGGCCATC TGCTCAACGC 2220 



224 



GAAGCCTAGG TGTCCTTGAG 
CAGCAGAACT CTTGGGAAAT 
AGTTGAGATC ATCATTGCAA 
GGATTTTGTT TTAGAGTTGG 
AAAAAAAAAA AAAAAA 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TCTGTGGTTT GTCAGACCTG CAAGCAAGCC CCCTGCTGGG 
CTGAACOGCA CTGAAGAACT CTTGTCCTCA CTGGCTGATG 
GTCTTAGTCC TGCAGAATCA GGAGTCACCA GATGATGCAG 
AGTTCTCTGT TCCTGAGGAA CTAAATTTAA GGAAAAAATG 
AAAAAAAGCC TGATTAAAGA GTTTCTGCCT GTTAAAAAAA 

Seq ID NO: 97 Protein sequence: 
Protein Accession #: NP_542399.1 

1 11 21 31 41 si 

I I I I I I 

MSGRRTRSGG AAQRSGPRAP SPTKPLRRSQ RKSGSELPSI LPBIWPKTPS AAAVRKPIVL 
KRIVAHAVEV PAVQSPRRSP RISFFLBKEN EPPGRELTKE DLFKTHSVPA TPTSTPVPNP 
EAESSSKEGE LDARDLEMSK KVRRSYSRLE TLGSASTSTP GRRSCFGFEG LLGAEDLSGV 
SPWCSKLTE VPRVCAKPWA PDMTLPGISP PPEKQKRKKK KMPEILKTEL DEWAAAMNAE 
FEAAEQFDLL VE 

Seq ID NO: 98 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 58-12444 



PCT/US02/12476 



2280 
2340 
2400 
2460 



1 
I 

GGGGCATTTC 
GCGGGCTCCG 
GCGGACCGCT 
TGCGTCCTGA 
GATTTCGGTT 
AGAGAAGAAA 
CCTTACTCTG 
AAATGTAAAA 
AGACTCATGG 
TTGAAAAAAA 
GGTGAAGTTC 
GGTGAACTTA 
GGATGTCTGA 
CCCCAGACTT 
CTGAAGAGAT 
TTTAGCACCT 
GCCCACACAA 
CAGGTTTCTA 
ATGGAGCAGT 
GCTATCCGTG 
GACTTCATGT 
ACTGGTGACG 
CTGTACCTTG 
CAGATAGACA 
GTGAAGGTGT 
GTGGTGCATC 
GAGTCTGAAT 
CCCACATACA 
GATTCTATTT 
CATTTACTTT 
ACACTTGAAA 
ATGATCCCAA 
GCTTTCATTA 
TTTGAACCAT 
CTCATCAGTG 
TATTTCGAGG 
TCTTGCTTTG 
AAAGATGAAC 
GAACTCGATG 
TATACCCCCT 
AGACATGTAA 
ACTTCAGCCT 
GCCCAGAAAG 
TCAAACGAAG 
CTAGGAGGAC 
AGCTATGTGG 
AAACCTGTCA 
AGTGACAGAC 
TTGGGCAAAG 
TATAAGCGGA 
CAACTGTATG 
GAAAGTCAGG 
GACAGTACTT 
AAGCAAATAA 
CGACTTTATA 
TTTAATAATA 
GAAGCCTTGG 
GGTACAATTC 
CATGTTTCTT 
TCATTGTGTT 
GAATGTCGAC 
AGATCCCCTA 
AACACCTTTG 
TTGTACCTTC 



11 
I 

CGGGTCCGGG 
GAGCCGGTGT 
GCGGTGCTGC 
GCAGCAGCCC 
TGCTTGTATT 
TCCTAAAGTT 
TTGAAATTAA 
TTCCAGCCCT 
ATGAATTTAA 
AAATACCAGA 
ATCCTAGTGA 
AGACCCAGAT 
AGGGGTTGTC 
CAAGGGAGAT 
ATGCTGTGCC 
GCCTTCTGGA 
ATGTAGAATT 
ATATGGTGGC 
TTTATGGAAT 
GATATGGACT 
ACGTTGAGCT 
ACCGTGTTTA 
ACACAGTTCC 
GTTTCCCACA 
TCCTAGCTTT 
AGGGTTTAAT 
CTGAAGACCA 
AAGACTACGT 
TAGCAGATGA 
ATGATGAATT 
TACAGACTGT 
CTTCAGATCC 
ACCTGGTGGA 
GGGTGTACTC 
GTTTCTACAA 
GAGTTAGTCC 
CTTTATTTGT 
TTTTGGCCTC 
TTAGAGCCTA 
TGGCAGAAGT 
TGCAGCCTTA 
TGTCAGATGA 
GATTTAATAA 
CAATATCCTT 
AAATAAACAA 
CCTGGGACAG 
TTTTCCTGGA 
AAACTAAAGT 
CCACGCAGAT 
CGTTTCCTGT 
AGCCACTAGT 
ATACTGTTGC 
TAAGAGATTT 
CACCACAGCA 
GCCTTGCGCT 
TCTACAGGGA 
TGATATACAT 
AACAGTGTTG 
TAAATAAAGC 
TATTGGATCT 
ACAAATCCAT 
ATTTGTGGCT 
AGGGGGGTGG 
GGGGGCCATT 



21 
I 

CCGAGCGGGC 
GCGTTGCTCC 
CCTGGCCGGT 
CGCGGTGCTG 
TGTCCGGAAG 
TTTATGTATT 
GAACACTTGT 
GGACCTTCTT 
AATTGGAGAA 
TACAGTTTTA 
GATGATAAAT 
GACATCAGCA 
CTCACTTCTG 
TTTTAATTTT 
CTCAGCTGGC 
CAACTACGTG 
GAAAAAAGCT 
GAAAAATGCA 
CATCAGAAAT 
TTTTGCAGGA 
CATTCAGCGC 
TCAGATGCCA 
TGAGGTGTAT 
GTACAGTCCA 
GGCAGCAAAA 
CAGAATATGT 
CCGTGCTTCA 
GGATCTCTTC 
AGCATTTTTC 
TGTAAAATCC 
TGGGGAACAA 
AGCGGCTAAC 
ATTTTGCAGA 
ATTTTCATAT 
ATTGCTTTCT 
AAAGAGTCTG 
GAAATTTGGC 
TTGTTTGACC 
CGTTCCTGCA 
AGGCCTGAAT 
TTACAAAGAC 
GACCAAGAAT 
AGTGGTGTTA 
AGAAGAAATA 
AAATCTTCTG 
AGAGAAGCGG 
TGTGTTCCTG 
TGCAGCCTGT 
GCCAGAAGGG 
GCTGCTTCGA 
TATGCAGCTG 
CTTACTAGAA 
TTGTGGTCGG 
GCAGGAGAAG 
TCACCCCAAT 
ATTCAGGGAA 
GGAGAGTCTG 
TGATGCCATT 
AAAGAAACGA 
GGTCAAGTGG 
TGAACTCTTT 
GAAAGATGTT 
CTGTGGCCAG 
CAGCCTGCAG 



31 
I 

GCACGCGCGG 
CTGCTGCGGC 
CATCAACTGA 
GCATTACAGA 
TCACTCAACA 
TTCTTAGAAA 
ACCAGTGTTT 
ATTAAGTTAC 
TTATTTAGTA 
GAAAAAGTAT 
AATGCAGAAA 
GTAAGAGAGC 
TGCAACTTCA 
GTACTAAAGG 
TTGCGCCTAT 
TCTCTATTTG 
GCACTTTCAG 
GAAATGCATA 
GTGGATTCGA 
CCGTGCAAGG 
TGCAAGCAGA 
AGCTTCCTCC 
ACTCCAGTTC 
AAAATGCAGC 
GGGCCAGTTC 
TCTAAACCAG 
GGGGAAGTCA 
AGACATCTCC 
TCTGTGAATT 
GTTTTGAAGA 
GAGAATGGAG 
TTGCATCCAG 
GAGATTCTCC 
GAATTAATTT 
ATTACAGTAA 
AAACACTCTC 
AAAGAGGTGG 
TTTCTTCTGT 
CTGCAGATGG 
GCTCTAGAAG 
ATTCTCCCCT 
AACTGGGAAG 
AAGCATCTGA 
AGAATTAGAG 
ACAGTCACGT 
CTGAGCTTTG 
CCTCGAGTCA 
GAACTTTTAC 
GGACAGGGAG 
CTTGCGTGTG 
ATTCACTGGT 
GCTATATTGG 
TGTATTCGAG 
AGTCCAGTAA 
GCTTTCAAGA 
GAAGAGTCTC 
GCCTTAGCAC 
GATCACCTAT 
CGTTTGCCGC 
CTTTTAGCTC 
TATAAATTCG 
CTCAAGGAAG 
CCCTCGGGCA 
GCCACGCTAT 



41 
I 

GAGCGGGACT 
TGCAGGAGAC 
TCCGCGGCCT 
CATCTTTAGT 
GTATTGAATT 
AAATGGGCCA 
ATACAAAAGA 
TTCAGACTTT 
AATTCTATGG 
ATGAGCTCCT 
ACCTGTTCCG 
CCAAACTACC 
CTAAGTCCAT 
CAATTCGTCC 
TTGCCCTGCA 
AAGTCTTGTT 
CCCTGGAATC 
AAAATAAACT 
ACAACAAGGA 
TTATAAACGC 
TGTTCCTCAC 
AGTCTGTTGC 
TGGAGCACCT 
TGGTGTGTTG 
TCAGGAATTG 
TGGTCCTTCC 
GAACTGGCAA 
TGAGCTCTGA 
CCTCCAGTGA 
TTGTTGAGAA 
ATGAGGCGCC 
CTAAACCTAA 
CTGAGAAACA 
TGCAATCTAC 
GAAATGCCAA 
CTGAAGACCC 
CAGTTAAAAT 
CCTTGCCACA 
CTTTCAAACT 
AATGGTCAAT 
GCCTGGATGG 
TGTCAGCTCT 
AGAAGACAAA 
TAGTACAAAT 
CCTCAGATGA 
CAGTGCCCTT 
CAGAATTAGC 
ATAGCATGGT 
CCCCACCCAT 
ATGTTGATCA 
TCACTAACAA 
ATGGAATTGT 
AATTCCTTAA 
ACACCAAATC 
GGCTGGGAGC 
TGGTGGAACA 
ATGCAGATGA 
GCCGCATCAT 
GAGGATTTCC 
ATTGTGGGAG 
TTCCTTTATT 
AAGGTGTCTC 
TCCTGGCCCA 
GCTGGCTGGA 



51 
I 

CGGCGGCATG 
CTTGTCCGCT 
GGGGCAGGAA 
TTTTTCCAGA 
TCGTGAATGT 
GAAGATCGCA 
TAGAGCTGCT 
TAGAAGTTCT 
AGAACTTGCA 
AGGATTATTG 
CGCTTTTCTG 
TGTTCTGGCA 
GGAAGAAGAT 
TCAGATTGAT 
TGCATCTCAG 
AAAGTGGTGT 
CTTTCTGAAA 
GCAGTACTTT 
GTTATCTATT 
AAAAGATGTT 
CCAGACAGAC 
AAGCGTCTTG 
CGTGGTGATG 
CAGAGCCATA 
CATTAGTACT 
AAAGGGCCCT 
ATGGAAGGTG 
CCAGATGATG 
AAGTCTGAAT 
ATTGGATCTT 
TGGTGTTTGG 
AGATTTTTCG 
AGCAGAATTT 
AAGGTTGCCC 
GAAAATAAAA 
AGAAAAGTAT 
GAAGCAGTAC 
CAACATCATT 
GGGCCTGAGC 
TTATATTGAC 
ATACCTGAAG 
TTCTCGGGCT 
GAACCTTTCA 
GCTTGGATCT 
GATGATGAAG 
TAGAGAGATG 
GCTCACAGCC 
TATGTTTATG 
GTACCAGCTC 
GGTGACAAGG 
CAAGAAATTT 
GGACCCTGTT 
ATGGTCCATT 
GCTTTTCAAG 
ATCACTTGCC 
GTTTGTGTTT 
GAAGTCCTTA 
TGAAAAGAAG 
ACCTTCCGCA 
GCCCCAGACA 
GCCAGGCAAC 
TTTTCTCATC 
GCCCACCCTC 
CCTGCTCCTG 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
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GCCGCGTTGG AGTGCTACAA CACGTTCATT 
CTAGGTACTG AAGCCCAGTC TTCACTTTTG 
GCCATGCATG ACATTATAGC AGCAGAAAAG 
ACAAGCCCAC AAGAGGGAGA AAGGTACAAC 
ATGGAGTTTA CCACGACTCT GCTAAACACC 
GACTTGTGTA ATACACACCT GATGAGAGTC 
ATAGGTTTCA ACATCGGAGA CGTCCAGGTT 
CTGATGAAAG CTCTAAAGAT GTCCCCATAC 
AAAATAACAG CACAGAGCAT TGAGGAGCTT 
CAAGTGGACA GGAGCAGGCT GGCTGCTGTT 
GGGCTTCTGC ATAATATATT ACCGTCTCAG 
GAACTTCTTT CCCTGGTTTA TAAAGGCATT 
TCTCTAGACC TCAGTTGTAA GCAGCTGGCC 
GGAGGACTGT GTGAGCGCCT TGTGAGTCTT 
TCCTTGGGCA GCTCACAGGG CAGCGTCATC 
TTGTTCTCAG AAACGATCAA CACGGAATTA 
CTCATGCAGT CTTCAGTGGA TAATACCAAA 
GACCAGAGCT TCAGGGAGCG AGCAAACCAG 
ATTCTGCAAC ACTGGAAGAA GTGTGATTCA 
AAAATGGCAG TGCTGGCCTT ACTGGCAAAA 
AATACAAGTC ATGGTTCATT CCCTGAAGTC 
ACAAAGCTGG ATCTACATTT AAAGGGCCAA 
CTCACTGGAG GCAGTCTGGA GGAACTTAGA 
TTCCCCATGC AGTCCAGGGA ATTTCCTCCA 
TGCATGAAAA AGTTTCTAGA TGCATTGGAA 
ATGACAGAAG TTCTTTGTCG GGAACAGCAG 
TTCAGGAGGA TTGCCAGAAG GGGTTCATGT 
TATG AAATGT TCAGGAAGGA TGACCCCCGC 
CGCTCCCTCC TCACTCTGCT GTGGCACTGT 
ACAATTGTGG TGGATGCCAT TGATGTGTTG 
ACCTTTGATA CTCAAATCAC CAAGAAGATG 
TCTCGGCTTC CCAAAGATGA TGTTCATGCT 
GGCTCGTGTA TTACAGAAGG AAATGAACTT 
GCATTTACAG AGAACATGGC AGGAGAGAAT 
TGTGCAGCAT ACAACTGCGC CATATCTGTC 
TACCAAGGTT TTCTGTTTAG TGAAAAACCA 
ATCGACCTGA AGCGCCGCTA TAATTTTCCT 
AAAAAGTACA TTGAAATTAG GAAAGAAGCC 
CCTTCCTATA TGTCTTCCCT GTCATATTTG 
CAATTTGATT TCTCAACCGG AGTTCAGAGC 
GCCACTGGTC GTTTTCGGAG ACGGGAGCAG 
GAGCTGGAGA TGGACGAGCT CAATCGGCAT 
AAGCACATGC ACAGAAGCCT GGGCCCGCCT 
CTTCCTTCTT GGATGAAATT CCTCCATGGC 
ATCCGTCTCT TCTTAGCCAA GCTTGTTATT 
AAGCACTGGC TTAGCCCCTT GCTGCAGCTG 
ATTCACTACA TGGTGGTTGA GATAGTGGCC 
CCAACAGGGG TCCCTAAAGA TGAAGTGTTA 
CATGTCTTTC ATCCAAAAAG AGCTGTGTTT 
GTCGAGTGCT GGAAGGATTG TTTATCCATC 
GGTAAAGATC CTAATTCTAA AGACAACTCA 
GCCAATGACC TGCCTCCCTA TGACCCACAG 
GCTTTGGTGA ATAATATGTC CTTTGTAAGA 
GTTCTAGGAC TTATACTTCG ATATGTTATG 
TGTGAACTGG TTGCGAAACA ATTGAAGCAA 
GTGTGCTTGA ACAAAGTGAC CAAGAGCTTC 
GTGTTCTTTC TGCTGCCAAA ATTTCATGGA 
CTTTGTCGTG TGGAGGGAAT GACAGAGCTG 
CAAGTCATGA GACATAGAGA TGATGAAAGA 
ATGATGCCAA AGTTAAAACC AGTAGAACTC 
GTTTCCCATC CTTCTACAAC ATGTAGGGAA 
GATAATTACA GAGATCCAGA AAGTGAGACA 
GCAAAAGATG TGCTGATTCA AGGATTGATC 
CGAAATTTCT GGAGCCATGA AACTAGGTTA 
CTAAATTCCT TATATTCTCC TAAGATAGAA 
CTGCTCGAAA TGACCAGCAT GAGCCCAGAT 
TCAGAATGCG AATTTCAGGA ATATACCATT 
CTCACTCCGA TGTTTGTGGA GACCCAGGCC 
GAAGGGTCCC TCTCAGCTCG CTGGCCAGTG 
CATGACTTCA CACTGACACA GACTGCAGAT 
AGCAGCACTG ACCCGCTGGT CGACCACACC 
CACAAGAGGA GTGAAAGGTT ACAGAGAGCA 
AAAAAAAGGC TGGGCCTTCC AGGGGACGAG 
CGGACGGACC TACTACGACT GCGCAGACGG 
ATGTATGCCA GAAAAGGCGT TGCTGAGCAA 
AAAATGAAGC AGGATGCCCA GGTCGTTCTG 
GACATTCAGA TCAAGCACAG CAGCCTCATC 
CCAATAATTG CAAAACAGCT CTTTAGCAGC 
AAATTTAAGA CACTGTCTGA AAAAAACAAC 
CGTTTTCTTA ATACCACCTT CTCTTTCTTT 
AGCTGTCAGC ACGCAGCCCT GCTGAGCCTC 
GCCAGCCTAC AGCAGCCCGT GGGCATCCGC 
CCTGCTGAGC TGCCTGCCAA GCGAGTCCGT 
AGATGGGTGG AGCTTGCTAA GCTGTATAGA 
ATTTTTACCA GTGAGATAGG AACAAAGCAA 
AGAAGTGATT ATTCTGAAGC TGCTAAGCAG 
GTAGATGGTG AGCCCACAGA AGCCGAGAAG 



GGCGAGAGAA CTGTAGGAGC GCTCCAGGTC 3900 

AAAGCAGTGG CTTTCTTCTT AGAAAGCATT 3960 

TGCTTTGGCA CTGGGGCAGC AGGTAACAGA 4020 

TACAGCAAAT GCACCGTTGT GGTCCGGATT 4080 

TCCCCGGAAG GATGGAAGCT CCTGAAGAAG 4140 

CTGGTGCAGA CGCTGTGTGA GCCCGCAAGC 4200 

ATGGCTCATC TTCCTGATGT TTGTGTGAAT 4260 

AAAGATATCC TAGAGACCCA TCTGAGAGAG 4320 

TGTGCCGTCA ACTTGTATGG CCCTGACGCG 4380 

GTGTCTGCCT GTAAACAGCT TCACAGAGCT 4440 

TCCACAGATT TGCATCATTC TGTTGGCACA 4500 

GCCCCTGGAG ATGAGAGACA GTGTC TGCCT 4560 

AGCGGACTTC TGGAGTTAGC CTTTGCTTTT 4620 

CTCCTGAACC CAGCGGTGCT GTCCACGGCG 4680. 

CACTTCTCCC ATGGGGAGTA TTTCTATAGC 4740 

TTGAAAAATC TGGATCTTGC TGTATTGGAG 4800 

ATGGTGAGTG CCGTTTTGAA CGGCATGTTA 4860 

AAACACCAAG GACTGAAACT TGCGACTACA 4920 

TGGTGGGCCA AAGATTCCCC TCTCGAAACT 4980 

ATTTTACAGA TTGATTCATC TGTATCTTTT 5040 

TTTACAACAT ATATTAGTCT ACTTGCTGAC 5100 

GCTGTCACTC TTCTTCCATT CTTCACCAGC 5160 

CGTGTTCTGG AGCAGCTCAT CGTTGCTCAC 5220 

GGAACTCCGC GGTTCAATAA TTATGTGGAC 5280 

TTATCTCAAA GCCCTATGTT GTTGGAATTG 5340 

CATGTCATGG AAGAATTATT TCAATCCAGT 5400 

GTCACACAAG TAGGCCTTCT GGAAAGCGTG 5460 

CTAAGTTTCA CACGCCAGTC CTTTGTGGAC 5520 

AGCCTGGATG CTTTGAGAGA ATTCTTCAGC 5580 

AAGTCCAGGT TTACAAAGCT AAATGAATCT 5640 

GGCTACTATA AGATTCTAGA CGTGATGTAT 5700 

AAGGAATCAA AAATTAATCA AGTTTTCCAT 5760 

ACAAAGACAT TGATTAAATT GTGCTACGAT 5820 

CAGCTGCTGG AGAGGAGAAG ACTTTACCAT 5880 

ATCTGCTGTG TCTTCAATGA GTTAAAATTT 5940 

GAAAAGAACT TGCTTATTTT TGAAAATCTG 6000 

GTAGAAGTTG AGGTTCCTAT GGAAAGAAAG 6060 

AGAGAAGCAG CAAATGGGGA TTCAGATGGT 6120 

GCAGACAGTA CCCTGAGTGA GGAAATGAGT 6180 

TATTCATACA GCTCCCAAGA CCCTAGACCT 6240 

CGGGACCCCA CGGTGCATGA TGATGTGCTG 6300 

GAGTGCATGG CGCCCCTGAC GGCCCTGGTC 6360 

CAAGGAGAAG AGGATTCAGT GCCAAGAGAT 6420 

AAACTGGGAA ATCCAATAGT ACCATTAAAT 6480 

AATACAGAAG AGGTCTTTCG CCCTTACGCG 6540 

GCTGCTTCTG AAAACAATGG AGGAGAAGGA 6600 

ACTATTCTTT CATGGACAGG CTTGGCCACT 6660 

GCAAATCGAT TGCTTAATTT CCTAATGAAA 6720 

AGACACAACC TTGAAATTAT AAAGACCCTT 6780 

CCTTATAGGT TAATATTTGA AAAGTTTTCC 6840 

GTAGGGATTC AATTGCTAGG CATCGTGATG 6900 

TGTGGCATCC AGAGTAGCGA ATACTTCCAG 6960 

TATAAAGAAG TGTATGCCGC TGCAGCAGAA 7020 

GAGAGAAAAA ACATACTGGA GGAGTCTCTG 7080 

CATCAGAATA CTATGGAGGA CAAGTTTATT 7140 

CCTCCTCTTG CAGACAGGTT CATGAATGCT 7200 

GTGTTGAAAA CACTCTGTCT GGAGGTGGTA 7260 

TACTTCCAGT TAAAGAGCAA GGACTTCGTT 7320 

CAAAAAGTAT GTTTGGACAT AATTTATAAG 7380 

CGAGAACTTC TGAACCCCGT TGTGGAATTC 7440 

CAAATGTATA ATATTCTCAT GTGGATTCAT 7500 

GATAATGACT CCCAGGAAAT ATTTAAGTTG 7560 

GATGAGAACC CTGGACTTCA ATTAATTATT 7620 

CCTTCAAATA CCTTGGACCG GTTGCTGGCA 7680 

GTGCACTTTT TAAGTTTAGC AACAAATTTT 7740 

TATCCAAACC CCATGTTCGA GCATCCTCTG 7800 

GATTCTGATT GGCGTTTCCG AAGTACTGTT 7860 

TCCCAGGGCA CTCTCCAGAC CCGTACCCAG 7920 

GCAGGGCAGA TAAGGGCCAC CCAGCAGCAG 7980 

GGAAGAAGCT CATTTGATTG GCTGACCGGG 8040 

AGTCCCTCAT CTGACTCCTT GCTGTTTGCC 8100 

CCCTTGAAGT CAGTGGGGCC TGATTTTGGG 8160 

GTGGATAACA AAGTGAAAGG TGCGGCCGGC 8220 

TTTATGAGGG ACCAGGAGAA GCTCAGTTTG 8280 

AAACGAGAGA AGGAAATCAA GAGTGAGTTA 8340 

TACAGAAGCT ACCGGCACGG AGACCTTCCT 8400 

ACCCCGTTAC AGGCCGTGGC CCAGAGGGAC 8460 

TTGTTTTCTG GAATTTTGAA AGAGATGGAT 8520 

ATCACTCAAA AGTTGCTTCA AGACTTCAAT 8580 

CCACCCTTTG TCTCTTGTAT TCAGGACATT 8640 

GACCCAGCGG CTGTTAGCGC TGGTTGCCTG 8700 

CTGCTAGAGG AGGCTCTGCT CCGCCTGCTG 8760 

GGGAAGGCCC GCCTCCCTCC TGATGTCCTC 8820 

TCAATTGGAG AATACGACGT CCTCCGTGGG 8880 

ATCACTCAGA GTGCATTATT AGCAGAAGCC 8940 

TATGATGAGG CTCTCAATAA ACAAGACTGG 9000 

GATTTTTGGG AACTTGCATC CCTTGACTGT 9060 
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TACAACCACC TTGCTGAGTG GAAATCACTT GAATACTGTT CTACA6CCA6 TATAGACAGT 9120 
GAGAACCCCC CAGACCTAAA TAAAATCTGG AGTGAACCAT TTTATCAGGA AACATATCTA 9180 
CCTTACATGA TCCGCAGCAA GCTGAAGCTG CTGCTCCAGG GAGAGGCTGA CCAGTCCCTG 9240 
CTGACATTTA TTGACAAAGC TATGCACGGG GAGCTCCAGA AGGCGATTCT AGAGCTTCAT 9300 
TACAGTCAAG AGCTGAGTCT GCTTTACCTC CTGCAAGATG ATGTTGACAG AGCCAAATAT 9360 
TACATTCAAA ATGGCATTCA GAGTTTTATG CAGAATTATT CTAGTATTGA TGTCCTCTTA 9420 
CACCAAAGTA GACTCACCAA ATTGCAGTCT GTACAGGCTT TAACAGAAAT TCAGGAGTTC 9480 
ATCAGCTTTA TAAGCAAACA AGGCAATTTA TCATCTCAAG TTCCCCTTAA GAGACTTCTG 9540 
AACACCTGGA CAAACAGATA TCCAGATGCT AAAATGGACC CAATGAACAT CTGGGATGAC 9600 
ATCATCACAA ATCGATGTTT CTTTCTCAGC AAAATAGAGG AGAAGCTTAC CCCTCTTCCA 9660 
GAAGATAATA GTATGAATGT GGATCAAGAT GGAGACCCCA GTGACAGGAT GGAAGTGCAA 9720 
GAGCAGGAAG AAGATATCAG CTCCCTGATC AGGAGTTGCA AGTTTTCCAT GAAAATGAAG 9780 
ATGATAGACA GTGCCCGGAA GCAGAACAAT TTCTCACTTG CTATGAAACT ACTGAAGGAG 9840 
CTGCATAAAG AGTCAAAAAC CAGAGACGAT TGGCTGGTGA GCTGGGTGCA GAGCTACTGC 9900 
CGCCTGAGCC ACTGCCGGAG CCGGTCCCAG GGCTGCTCTG AGCAGGTGCT CACTGTGCTG 9960 
AAAACAGTCT CTTTGTTGGA TGAGAACAAC GTGTCAAGCT ACTTAAGCAA AAATATTCTG 10020 
GCTTTCCGTG ACCAGAACAT TCTCTTGGGT ACAACTTACA GGATCATAGC GAATGCTCTC 10080 
AGCAGTGAGC CAGCCTGCCT TGCTGAAATC GAGGAGGACA AGGCTAGAAG AATCTTAGAG 10140 
CTTTCTGGAT CCAGTTCAGA GGATTCAGAG AAGGTGATCG CGGGTCTGTA CCAGAGAGCA 10200 
TTCCAGCACC TCTCTGAGGC TGTGCAGGCG GCTGAGGAGG AGGCCCAGCC TCCCTCCTGG 10260 
AGCTGTGGGC CTGCAGCTGG GGTGATTGAT GCTTACATGA CGCTGGCAGA TTTCTGTGAC 10320 
CAACAGCTGC GCAAGGAGGA AGAGAATGCA TCAGTTATTG ATTCTGCAGA ACTGCAGGCG 10380 
TATCCAGCAC TTGTGGTGGA GAAAATGTTG AAAGCTTTAA AATTAAATTC CAATGAAGCC 10440 
AGATTGAAGT TTCCTAGATT ACTTCAGATT ATAGAACGGT ATCCAGAGGA GACTTTGAGC 10500 
CTCATGACAA AAGAGATCTC TTCCGTTCCC TGCTGGCAGT TCATCAGCTG GATCAGCCAC 10560 
ATGGTGGCCT TACTGGACAA AGACCAAGCC GTTGCTGTTC AGCACTCTGT GGAAGAAATC 10620 
ACTGATAACT ACCCGCAGGC TATTGTTTAT CCCTTCATCA TAAGCAGCGA AAGCTATTCC 10680 
TTCAAGGATA CTTCTACTGG TCATAAGAAT AAGGAGTTTG TGGCAAGGAT TAAAAGTAAG 10740 
TTGGATCAAG GAGGAGTGAT TCAAGATTTT ATTAATGCCT TAGATCAGCT CTCTAATCCT 10800 
GAACTGCTCT TTAAGGATTG GAGCAATGAT GTAAGAGCTG AACTAGCAAA AACCCCTGTA 10860 
AATAAAAAAA ACATTGAAAA AATGTATGAA AGAATGTATG CAGCCTTGGG TGACCCAAAG 10920 
GCTCCAGGCC TGGGGGCCTT TAGAAGGAAG TTTATTCAGA CTTTTGGAAA AGAATTTGAT 10980 
AAACATTTTG GGAAAGGAGG TTCTAAACTA CTGAGAATGA AGCTCAGTGA CTTCAACGAC 11040 
ATTACCAACA TGCTACTTTT AAAAATGAAC AAAGACTCAA AGCCCCCTGG GAATCTGAAA 11100 
GAATGTTCAC CCTGGATGAG CGACTTCAAA GTGGAGTTCC TGAGAAATGA GCTGGAGATT 11160 
CCCGGTCAGT ATGACGGTAG GGGAAAGCCA TTGCCAGAGT ACCACGTGCG AATCGCCGGG 11220 
TTTGATGAGC GGGTGACAGT CATGGCGTCT CTGCGAAGGC CCAAGCGCAT CATCATCCGT 11280 
GGCCATGACG AGAGGGAACA CCCTTTCCTG GTGAAGGGTG GCGAGGACCT GCGGCAGGAC 11340 
. CAGCGCGTGG AGCAGCTCTT CCAGGTCATG AATGGGATCC TGGCCCAAGA CTCCGCCTGC 11400 
AGCCAGAGGG CCCTGCAGCT GAGGACCTAT AGCGTTGTGC CCATGACCTC CAGGTTAGGA 11460 
TTAATTGAGT GGCTTGAAAA TACTGTTACC TTGAAGGACC TTCTTTTGAA CACCATGTCC 11520 
CAAGAGGAGA AGGCGGCTTA CCTGAGTGAT CCCAGGGCAC CGCCGTGTGA ATATAAAGAT 11580 
TGGCTGACAA AAATGTCAGG AAAACATGAT GTTGGAGCTT ACATGCTAAT GTATAAGGGC 11640 
GCTAATCGTA CTGAAACAGT CACGTCTTTT AGAAAACGAG AAAGTAAAGT GCCTGCTGAT 11700 
CTCTTAAAGC GGGCCTTCGT GAGGATGAGT ACAAGCCCTG AGGCTTTCCT GGCGCTCCGC 11760 
TCCCACTTCG CCAGCTCTCA CGCTCTGATA TGCATCAGCC ACTGGATCCT CGGGATTGGA 11820 
GACAGACATC TGAACAACTT TATGGTGGCC ATGGAGACTG GCGGCGTGAT CGGGATCGAC 11880 
TTTGGGCATG CGTTTGGATC CGCTACACAG TTTCTGCCAG TCCCTGAGTT GATGCCTTTT 11940 
CGGCTAACTC GCCAGTTTAT CAATCTGATG TTACCAATGA AAGAAACGGG CCTTATGTAC 12000 
AGCATCATGG TACACGCACT CCGGGCCTTC CGCTCAGACC CTGGCCTGCT CACCAACACC 12060 
ATGGATGTGT TTGTCAAGGA GCCCTCCTTT GATTGGAAAA ATTTTGAACA GAAAATGCTG 12120 
AAAAAAGGAG GGTCATGGAT TCAAGAAATA AATGTTGCTG AAAAAAATTG GTACCCCCGA 12180 
CAGAAAATAT GTTACGCTAA GAGAAAGTTA GCAGGTGCCA ATCCAGCAGT CATTACTTGT 12240 
GATGAGCTAC TCCTGGGTCA TGAGAAGGCC CCTGCCTTCA GAGACTATGT GGCTGTGGCA 12300 
CGAGGAAGCA AAGATCACAA CATTCGTGCC CAAGAACCAG AGAGTGGGCT TTCAGAAGAG 12360 
ACTCAAGTGA AGTGCCTGAT GGACCAGGCA ACAGACCCCA ACATCCTTGG CAGAACCTGG 12420 
GAAGGATGGG AGCCCTGGAT GTGAGGTCTG TGGGAGTCTG CAGATAGAAA GCATTACATT 12480 
GTTTAAAGAA TCTACTATAC TTTGGTTGGC AGCATTCCAT GAGCTGATTT TCCTGAAACA 12540 
CTAAAGAGAA ATGTCTTTTG TGCTACAGTT TCGTAGCATG AGTTTAAATC AAGATTATGA 12600 
TGAGTAAATG TGTATGGGTT AAATCAAAGA TAAGGTTATA GTAACATCAA AGATTAGGTG 12660 
AGGTTTATAG AAAGATAGAT ATCCAGGCTT ACCAAAGTAT TAAGTCAAGA ATATAATATG 12720 
TGATCAGCTT TCAAAGCATT TACAAGTGCT GCAAGTTAGT GAAACAGCTG TCTCCGTAAA 12780 
TGGAGGAAAT GTGGGGAAGC CTTGGAATGC CCTTCTGGTT CTGGCACATT GGAAAGCACA 12840 
CTCAGAAGGC TTCATCACCA AGATTTTGGG AGAGTAAAGC TAAGTATAGT TGATGTAACA 12900 
TTGTAGAAGC AGCATAGGAA CAATAAGAAC AATAGGTAAA GCTATAATTA TGGCTTATAT 12960 
TTAGAAATGA CTGCATTTGA TATTTTAGGA TATTTTTCTA GGTTTTTTCC TTTCATTTTA 13020 
TTCTCTTCTA GTTTTGACAT TTTATGATAG ATTTGCTCTC TAGAAGGAAA CGTCTTTATT 13080 
TAGGAGGGCA AAAATTTTGG TCATAGCATT CACTTTTGCT ATTCCAATCT ACAACTGGAA 13140 
GATACATAAA AGTGCTTTGC ATTGAATTTG GGATAACTTC AAAAATCCCA TGGTTGTTGT 13200 
TAGGGATAGT ACTAAGCATT TCAGTTCCAG GAGAATAAAA GAAATTCCTA TTTGAAATGA 13260 
ATTCCTCATT TGGAGGAAAA AAAGCATGCA TTCTAGCACA ACAAGATGAA ATTATGGAAT 13320 
ACAAAAGTGG CTCCTTCCCA TGTGCAGTCC CTGTCCCCCC CCGCCAGTCC TCCACACCCA 13380 
AACTGTTTCT GATTGGCTTT TAGCTTTTTG TTGTTTTTTT TTTTCCTTCT AACACTTGTA 13440 
TTTGGAGGCT CTTCTGTGAT TTTGAGAAGT ATACTCTTGA GTGTTTAATA AAGTTTTTTT 13500 
CCAAAAGTA 

Seq ID NO: 99 Protein sequence: 
Protein Accession NP_008835.S 

1 11 21 31 41 51 

I I I I I I 

MAGSGAGVRC SLLRLQETLS AADRCGAALA GHQLIRGLGQ ECVLSSSPAV LALQTSLVPS 60 
RDFGLLVFVR KSLNSIEFRE CREEILKFLC IFLEKMGQKI APYSVEIKNT CTSVYTKDRA 120 
AKCKIPALDL LIKLLQTFRS SRLMDEFKIG ELFSKFYGEL ALKKKIPDTV LEKVYEKUGL 180 
LGEVHPSEMI NNAENLPRAF LGELKTQMTS AVREPKLPVL AGCLKGLSSL LCNFTKSMEB 240 
DPQTSREIFN FVLKAIRPQI DLKRYAVPSA GLRLFALHAS QFSTCLLDNY VSLFEVLLKW 300 
CAHTNVELKK AALSALESFL KQVSNMVAKN AEMHKNKLQY FMEQFYGIIR NVDSNNKELS 360 



227 



WO 02/086443 

IAIRGYGLFA GPCKVINAKD VDFMYVELIQ RCKQMFLTQT DTGDDRVYQM PSFLQSVASV 420 

LLYLDTVPEV YTPVLEHLW MQIDSPPQYS PKMQLVCCRA IVKVFLALAA KGPVLRNCIS 480 

TWHQGLIRI CSKPWLPKG PESESEDHRA SGBVRTGKWK VPTYKDYVDL FRHLLSSDQM 540 

MDSILADEAF FSVNSSSESL NHLLYDEFVK SVLKIVEKLD LTLEIQTVGE QENGDEAPGV 600 

WMIPTSDPAA NLHPAKPKDF SAPINLVEFC REILPEKQAE FFEPWVYSFS YELILQSTRL 660 

PLISGFYKLL SITVRNAKKI KYFEGVSPKS LKHSPEDPEK YSCFALPVKF GKEVAVKMKQ 720 

YKDELLASCL TFLLSLPHNI IELDVRAYVP ALQMAFKLGL SYTPLAEVGL NALEEWSIYI 780 

DRHVMQPYYK DILPCLDGYL KTSALSDETK NNWEVSALSR AAQKGFNKW LKHLKKTKNL 840 

SSNEAISLEE IRIRWQMLG SLGGQINKNL LTVTSSDEMM KSYVAWDREK RLSFAVPFRE 900 

MKPVIFLDVF LPRVTELALT ASDRQTKVAA CELLHSMVMF MLGKATQMPE GGQGAPPMYQ 960 

LYKRTPPVLL RLACDVDQVT RQLYEPLVMQ LIHWFTNNKK FESQDTVALL EAILDGIVDP 1020 

VDSTLRDFCG RCIREFLKWS IKQITPQQQE KSPVNTKSLF KRLYSLALHP NAFKRLGASL 1080 

AFNNIYREFR EEESLVEQFV FEAIiVIYMES LALAHADEKS LGTIQQCCDA IDHLCRIIEK 1140 

KHVSLNKAKK RRLPRGFPPS ASLCLLDLVK WLLAHCGRPQ TECRHKSIEL FYKFVPLLPG 1200 

NRSPNLWLKD VLKEEGVSFL INTFEGGGCG QPSGILAQPT LLYLRGPFSL QATLCWLDLL 1260 

LAALECYNTF IGERTVGALQ VLGTEAQSSL LKAVAFFLES IAMHDIIAAE KCFGTGAAGN 1320 

RTSPQEGERY NYSKCTVWR IMEFTTTLLN TSPEGWKLLK KDLCNTHLMR VLVQTLCEPA 1380 

SIGFNIGDVQ VMAHLPDVCV NLMKALKMSP YKDILETHLR EKITAQSIEE LCAVNLYGPD 1440 

AQVDRSRLAA WSACKQLHR AGLLHNILPS QSTDLHHSVG TELLSLVYKG IAPGDERQCL 1500 

PSLDLSCKQL ASGLLELAFA FGGLCERLVS LtiLNPAVLST ASLGSSQGSV IHFSHGEYFY 1560 

SLFSETINTE LLKNLDLAVL ELMQSSVT5NT KMVSAVLNGM LDQSFRERAN QKHQGLKLAT 1620 

TILQHWKKCD SWWAKDSPLE TKMAVLALLA KILQIDSSVS FNTSHGSFPE VFTTYISLLA 1680 

DTKLDLHLKG QAVTLLPFFT SI/TGGSLEEL RRVLEQLIVA HFPMQSREFP PGTPRFNNYV 1740 

DCMKKFLDAL ELSOSPMLLE LMTEVLCREQ QHVMEELFQS SFRRIARRGS CVTQVGLLES 1800 

VYEMFRKDDP RLSFTRQSFV DRSLLTLLWH CSLDALREFP STIWDAIDV LKSRFTKLNE 1860 

STFDTQITKK MGYYKILDVM YSRLPKDDVH AKESKINQVF HGSCITEGNE LTKTLIKLCY 1920 

DAFTENMAGE NQLLERRRLY HCAAYNCAIS VICCVFNELK FYQGFLFSEK PEKNLLIFEN 1980 

LIDLKRRYNF PVEVEVPMER KKKYIEIRKE AREAANGDSD GPSYMSSLSY LADSTLSEEM 2040 

SQFDFSTGVQ SYSYSSQDPR PATGRFRRRE QRDPTVHDDV LELEMDELNR HECMAPLTAL 2100 

VKHMHRSLGP PQGEEDSVPR DLPSWMKFLH GKLGNPIVPL NIRLFLAKLV INTEEVFRPY 2160 

AKHWLSPLLQ LAASENNGGE GIHYMWEIV ATILSWTGLA TPTGVPKDEV LANRLLNFLM 2220 

KHVFHPKRAV FRHNLEIIKT LVECWKDCLS IPYRLIFEKF SGKDPNSKDN SVGIQLLGIV 2280 

MANDLPPYDP QCGIQSSEYF QALVNNMSFV RYKEVYAAAA EVLGLILRYV MERKNILEES 2340 

LCELVAKQLK QHQNTMEDKF IVCLNKVTKS FPPLADRFMN AVFFLLPKFH GVLKTLCLEV 2400 

VLCRVEGMTE LYFQLKSKDF VQVMRHRDDE RQKVCLDIIY KMMPKLKPVE LRELLNPWE 2460 

FVSHPSTTCR EQMYNILMWI HDNYRDPESE TDNDSQEIFK LAKDVLIQGL IDENPGLQLI 2520 

IRNFWSHETR LPSNTLDRLIj ALMSLYSPKI EVHFLSLATN FLLEMTSMSP DYPNPMFEHP 2580 

LSECEFQEYT IDSDWRFRST VLTPMFVETQ ASQGTLQTRT QEGSLSARWP VAGQIRATQQ 2640 

QHDFTLTQTA DGRSSFDWLT GSSTDPLVDH TSPSSDSLLF AHKRSERLQR APLKSVGPDF 2700 

GKKRLGLPGD EVDNKVKGAA GRTDLLRLRR RFMRDQEKLS LMYARKGVAE QKREKEIKSE 2760 

LKMKQDAQW LYRSYRHGDL PDIQIKHSSL ITPLQAVAQR DPIIAKQLFS SLFSGILKEM 2820 

DKFKTLSEKN NITQKLLQDF NRFLNTTFSF FPPFVSCIQD ISCQHAALLS LDPAAVSAGC 2880 

LASLQQPVGI RLLEEALLRL LPAELPAKRV RGKARLPPDV LRWVELAKLY RSIGEYDVLR 2940 

GIFTSE1GTK QITQSALLAE ARSDYSEAAK QYDEALNKQD WVDGEPTEAE KDFWELASLD 3000 

CYNHLAEWKS LEYCSTASID SENPPDLNKI WSEPFYQETY LPYMIRSKLK LLLQGEADQS 3060 

LLTFIDKAMH GELQKAILEL HYSQELSLLY LLQDDVDRAK YYIQNGIQSF MQNYSSIDVL 3120 

LHQSRLTKLQ SVQALTEIQE FISFISKQGN LSSQVPLKRL LNTWTNRYPD AKMDPMNIWD 3180 

DIITNRCFFL SKIEEKLTPL PEDNSMNVDQ DGDPSDRMEV QEQEEDISSL IRSCKFSMKM 3240 

KMIDSARKQN NFSLAMKLLK ELHKESKTRD DWLVSWVQSY CRLSHCRSRS QGCSEQVLTV 3300 

LKTVSLLDEN NVSSYLSKNI LAFRDQNILL GTTYRI I ANA LSSEPACLAE IEEDKARRIL 3360 

ELSGSSSEDS EKVIAGLYQR AFQHLSEAVQ AAEEEAQPPS WSCGPAAGVI DAYMTLADFC 3420 

DQQLRKEEEN ASVIDSAELQ AYPALWEKM LKALKLNSNE ARLKFPRLLQ IIERYPEETL 3480 

SLMTKEISSV PCWQFISWIS HMVALLDKDQ AVAVQHSVEE ITDNYPQAIV YPFIISSESY 3540 

SFKDTSTGHK NKEFVARIKS KLDQGGVIQD FINALDQLSN PELLFKDWSN DVRAELAKTP 3600 

VNKKNIEKMY ERMYAALGDP KAPGLGAFRR KFIQTFGKEF DKHFGKGGSK LLRMKLSDFN 3660 

DITNMLLLKM NKDSKPPGNL KECSPWMSDF KVEFLRNELE I PGQYDGRGK PLPEYHVRIA 3720 

GFDERVTVMA SLRRPKRIII RGHDEREHPF LVKGGEDLRQ DQRVEQLFQV MNGILAQDSA 3780 

CSQRALQIiRT YSWPMTSRL GLIEWLENTV TLKDLLLNTM SQEEKAAYLS DPRAPPCEYK 3840 

DWLTKMSGKH DVGAYMLMYK GANRTETVTS FRKRESKVPA DLLKRAFVRM STSPEAFLAL 3900 

RSHFASSHAL ICISHWILGI GDRHLNNFMV AMETGGVIGI DFGHAFGSAT QFLPVPELMP 3960 

FRLTRQFINL MLPMXETGLM YSIMVHALRA FRSDPGLLTN TMDVFVKEPS FDWKNFEQKM 4020 

LKKGGSWIQE INVAEKNWYP RQKICYAKRK LAGANPAVIT CDELLLGHEK APAFRDYVAV 4080 
ARGSKDHNIR AQEPESGLSE ETQVKCLMDQ ATDPNILGRT WEGWEPWM 

Seq ID NO: 100 DNA sequence 
Nucleic Acid Accession #: NM_000673 
Coding sequence: 101-1225 

1 11 21 31 41 ■ 51 

I II I I I 

ATGTGAAGGC ACAAGCTGCT GTTATATACA ACAGAGTGAA CTGAGCATCA GTCAGAAAAA 60 

GTCTATGTTT GCAGAAATAC AGATCCAAGA CAAAGACAGG ATGGGCACTG CTGGAAAAGT 120 

TATTAAATGC AAAGCAGCTG TGCTTTGGGA GCAGAAGCAA CCCTTCTCCA TTGAGGAAAT 180 

AGAAGTTGCC CCACCAAAGA CTAAAGAAGT TCGCATTAAG ATTTTGGCCA CAGGAATCTG 240 

TCGCACAGAT GACCATGTGA TAAAAGGAAC AATGGTGTCC AAGTTTCCAG TGATTGTGGG 300 

ACATGAGGCA ACTGGGATTG TAGAGAGCAT TGGAGAAGGA GTGACTACAG TGAAACCAGG 360 

TGACAAAGTC ATCCCTCTCT TTCTGCCACA ATGTAGAGAA TGCAATGCTT GTCGCAACCC 420 

AGATGGCAAC CTTTGCATTA GGAGCGATAT TACTGGTCGT GGAGTACTGG CTGATGGCAC 480 

CACCAGATTT ACATGCAAGG GCAAACCAGT ACACCACTTC ATGAACACCA GTACATTTAC 540 

CGAGTACACA GTGGTGGATG AATCTTCTGT TGCTAAGATT GATGATGCAG CTCCTCCTGA 600 

GAAAGTCTGT TTAATTGGCT GTGGGTTTTC CACTGGATAT GGCGCTGCTG TTAAAACTGG 660 

CAAGGTCAAA CCTGGTTCCA CTTGCGTCGT CTTTGGCCTG GGAGGAGTTG GCCTGTCAGT 720 

CATCATGGGC TGTAAGTCAG CTGGTGCATC TAGGATCATT GGGATTGACC TCAACAAAGA 780 

CAAATTTGAG AAGGCCATGG CTGTAGGTGC CACTGAGTGT ATCAGTCCCA AGGACTCTAC 840 

CAAACCCATC AGTGAGGTGC TGTCAGAAAT GACAGGCAAC AACGTGGGAT ACACCTTTGA 900 

AGTTATTGGG CATCTTGAAA CCATGATTGA TGCCCTGGCA TCCTGCCACA TGAACTATGG 960 

GACCAGCGTG GTTGTAGGAG TTCCTCCATC AGCCAAGATG CTCACCTATG ACCCGATGTT 1020 
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GCTCTTCACT GGACGCACAT GGAAGGGATG 
TGTCCCAAAA CTAGTGACTG AGTTCCTGGC 
TCATGTTTTA CCATTTAAAA AAATCAGTGA 
CATTCGAACG GTCCTGACGT TTTGAGATCC 
GAACTGGAGT TTCTCTTGTG AGAGTTCCCT 
ACAAGCATAA GTAGAAGATT TGTTGAAGAC 
TATAAACATT TAAAGTCTTG TGAGCACCTG 
TTGATTTACA TTTTGTAAGG CTATAATTGT 
TATGTTGAAA TGGAGATTTT TAAGAGTTTT 
CAGATATAGC GTATAAAGAT ATAGTAAATG 
TTGAAACTAT TATTTTTTAG ATTTGAATAT 
TAACTTGGAT TACATTTTGA AATCAGTTCA 
AGAAAGACAG AAAAGATTAA GGGACGGGCA 
ATAACTTGGT GAAACTGAAA AAGTATATCA 
TATTAATATT TTAGAAAATA TTCCTTTTGT 
AT ATT AT CAT ACTTATCATA ATGTTCAATT 
CCTATTCACT GTGCTTAGTA GTGACTCCAT 
CTAAACCG 



TGTCTTTGGA GGTTTGAAAA GCAGAGATGA 1080 

AAAGAAATTT GACCTGGACC AGTTGATAAC 1140 

AGGATTTGAG CTGCTCAATT CAGGACAAAG 1200 

AAAGTGGCAG GAGGTCTGTG TTGTCATGGT 1260 

CATCTGAAAT CATGTATCTG TCTCACAAAT 1320 

ATAGAACCCT TATAAAGAAT TATTAACCTT 1380 

GGAATTAGTA TAATAACAAT GTTAATATTT 1440 

ATCTTTTAAG AAAACATACA CTTGGATTTC 1500 

AACCAGCTGC TGCAGATATA TAACTCAAAA 1560 

CATCTCCCAG AGTAATATTC ACTTAACACA 1620 

AAATGTATTT TTTAAACACT TGTTATGAGT 1680 

TTCCATGATG CATATTACTG GATTAGATTA 1740 

CATTTTTCAA CGATTAAGAA TCATCATTAC 1800 

TATGGGTACA CAAGGCTATT TGCCAGCATA 1860 

AATACTGAAT ATAAACATAG AGCTAGAGTC 1920 

TGATACAGTA GAATTGCAAG TCCCTAAGTC 1980 

TTAATAAAAA GTGTTTTTAG TTTTTAACAA 2040 



Seq ID NO: 101 Protein sequence: 
Protein Accession #: NP_000664 

1 11 21 31 41 51 

I I I I I I 

MGTAGKVIKC KAAVLWEQKQ PFSIEEIEVA PPKTKEVRIK ILATGICRTD DHVIKGTMVS 60 
KFPVIVGHEA TGIVESIGEG VTTVKPGDKV IPLFLPQCRE CNACRNPDGN LCIRSDITGR 120 
GVLADGTTRF TCKGKPVHHF MNTSTFTEYT WDESSVAKI DDAAPPEKVC IjIGCGFSTGY 180 
GAAVKTGKVK PGSTCWFGL GGVGLSVIMG CKSAGASRII GIDLNKDKFE KAMAVGATEC 240 
ISPKDSTKPI SEVLSEMTGN NVGYTFEVIG HLETMIDALA SCHMNYGTSV WGVPPSAKM 300 
LTYDPMLfcFT GRTWKGCVFG GLKSRDDVPK LVTEFLAKKF DLDQLITHVL PFKKISEGPE 360 
LLNSGQSIRT VLTF 

Seq ID NO: 102 DNA sequence 

Nucleic Acid Accession ft: NM_006783.1 

Coding sequence : 1 . . 786 

1 11 21 31 41 51 

I I I I I I 

ATGGATTGGG GGACGCTGCA CACTTTCATC GGGGGTGTCA ACAAACACTC CACCAGCATC 60 

GGGAAGGTGT GGATCACAGT CATCTTTATT TTCCGAGTCA TGATCCTAGT GGTGGCTGCC 120 

CAGGAAGTGT GGGGTGACGA GCAAGAGGAC TTCGTCTGCA ACACACTGCA ACCGGGATGC 180 

AAAAATGTGT GCTATGACCA CTTTTTCCCG GTGTCCCACA TCCGGCTGTG GGCCCTCCAG 240 

CTGATCTTCG TCTCCACCCC AGCGCTGCTG GTGGCCATGC ATGTGGCCTA CTACAGGCAC 300 

GAAACCACTC GCAAGTTCAG GCGAGGAGAG AAGAGGAATG ATTTCAAAGA CATAGAGGAC 360 

ATTAAAAAGC ACAAGGTTCG GATAGAGGGG TCGCTGTGGT GGACGTACAC CAGCAGCATC 420 

TTTTTCCGAA TCATCTTTGA AGCAGCCTTT ATGTATGTGT TTTACTTCCT TTACAATGGG 480 

TACCACCTGC CCTGGGTGTT GAAATGTGGG ATTGACCCCT GCCCCAACCT TGTTGACTGC 540 

TTTATTTCTA GGCCAACAGA GAAGACCGTG TTTACCATTT TTATGATTTC TGCGTCTGTG 600 

ATTTGCATGC TGCTTAACGT GGCAGAGTTG TGCTACCTGC TGCTGAAAGT GTGTTTTAGG 660 

AGATCAAAGA GAGCACAGAC GCAAAAAAAT CACCCCAATC ATGCCCTAAA GGAGAGTAAG 720 

CAGAATGAAA TGAATGAGCT GATTTCAGAT AGTGGTCAAA ATGCAATCAC AGGTTTCCCA 780 
AGCTAA 



Seq ID NO: 103 Protein sequence: 
Protein Accession #: NP_006774.1 



1 11 21 31 41 51 

I I I I I 1 

MDWGTLHTFI GGVNKHSTSI GKVWITVIFI FRVMILWAA QEVWGDEQED FVCNTLQPGC 60 
KNVCYDHFFP VSHIRLWALQ LIFVSTPALL VAMHVAYYRH ETTRKFRRGE KRNDFKDIED 120 
IKKHKVRIEG SLWWTYTSSI FFRIIFEAAF MYVFYFLYNG YHLPWVLKCG IDPCPNLVDC 180 
FISRPTEKTV FTIFMISASV ICMLLNVAEIi CYLLLKVCFR RSKRAQTQKN HPNHALKESK 240 
QNEMNELISD SGQNAITGFP S 



Seq ID NO: 104 DNA sequence 
Nucleic Acid Accession #: NM_020411 
Coding sequence: 86-526 



1 11 21 31 41 51 

I 1 I I I I 

GGACCTGGGA AGGAGCATAG GACAGGGCAA GGCGGGATAA GGAGGGGCAC CACAGCCCTT 60 

AAGGCACGAG GGAACCTCAC TGCGCATGCT CCTTTGGTGC CCACCTCAGT GCGCATGTTC 120 

ACTGGGCGTC TTCCCATCGG CCCCTTCGCC AGTGTGGGGA ACGCGGCGGA GCTGTGAGCC 180 

GGCGACTCGG GTCCCTGAGG TCTGGATTCT TTCTCCGCTA CTGAGACACG GCGGACACAC 240 

ACAAACACAG AACCACACAG CCAGTCCCAG GAGCCCAGTA ATGGAGAGCC CCAAAAAGAA 300 

GAACCAGCAG CTGAAAGTCG GGATCCTACA CCTGGGCAGC AGACAGAAGA AGATCAGGAT 360 

ACAGCTGAGA TCCCAGTGCG CGACATGGAA GGTGATCTGC AAGAGCTGCA TCAGTCAAAC 420 

ACCGGGGATA AATCTGGATT TGGGTTCCGG CGTCAAGGTG AAGATAATAC CTAAAGAGGA 480 

ACACTGTAAA ATGCCAGAAG CAGGTGAAGA GCAACCACAA GTTTAAATGA AGACAAGCTG 540 

AAACAACGCA AGCTGGTTTT ATATTAGATA TTTGACTTAA ACTATCTCAA TAAAGTTTTG 600 
CAGCTTTCAC CAAAAAAAAA AAAAAA 



Seq ID NO: 105 Protein sequence: 
Protein Accession #: NP_065144.1 

1 11 21 31 41 51 



229 



5 

10 
15 
20 
25 



WO 02/086443 
I I I I I I 

MLLWCPPQCA CSLGVFPSAP SPVWGTRRSC BPATRVPEVW ILSPLLRHGG HTQTQNHTAS 
PRSPVMESPK KKNQQLKVGI LHLGSRQKKI RIQLRSQCAT WKVICKSCIS QTPGINLDLG 
SGVKVKIIPK EEHCKMPEAG EEQPQV 

Seq ID NO: 106 DNA sequence 
Nucleic Acid Accession #; J04129 
Coding sequence: 99-587 



PCIYUS02/12476 



CATCCCTCTG 
TCACCCTGGG 
AGGACCTGGA 
ACATCTCCCT 
CCACCCCCGA 
AGAAGAAGGT 
TGGCGAACGA 
AGGACACCAC 
AGGACGATGA 
GGTACTTGCT 
CCAGGAAGAC 
TTTCAAAGAA 
TCCTGCTGCA 
GCAGAGGTTA 



11 
I 

GCTCCAGAGC 
CGTGGCCCTG 
GCTCCCAAAG 
CATGGCGACA 
GGACAACCTG 
CCTTGGAGAG 
GGCCACGCTG 
CACCCCCATC 
GATCATGCAG 
GGACTTGAAA 
CAGACTCCCA 
TAACCACAGC 
CACCTGCACC 
TTAATAAACC 



21 
I 

TCAGAGCCAC 
GTCTGTGGTG 
TTGGCAGGGA 
CTGAAGGCCC 
GAGATCGTTC 
AAGACTGGGA 
CTCGATACTG. 
CAGAGCATGA 
GGATTCATCA 
CAGATGGAAG 
CCCTTCCACA 
TCAGAAGACG 
ATTGCCATGG 
CTTGGAGCAT 



31 

I 

CCACAGCCGC 
TCCCGGCCAT 
CCTGGCACTC 
CTCTGAGGGT 
TGCACAGATG 
ATCCAAAGAA 
ACTACGACAA 
TGTGCCAGTA 
GGGCTTTCAG 
AGCCGTGCCG 
CCTCCAGAGC 
ATGACGTGGT 
GGAGGCTGCT 
G 



41 

I 

AGCCATGCTG 
GGACATCCCC 
CATGGCCATG 
CCACATCACC 
GGAGAACAAC 
GTTCAAGATC 
TTTCCTGTTT 
CCTGGCCAGA 
GCCCCTGCCC 
TTTCTAGCTC 
AGTGGGACTT 
CATCTGTGTC 
CCCTGGGGGC 



51 

I 

TGCCTCCTGC 
CAGACCAAGC 
GCGACCAACA 
TCACTGTTGC 
AGCTGTGTTG 
AACTATACGG 
CTCTGCCTAC 
GTCCTGGTGG 
AGGCACCTAT 
ACCTCCGCCT 
CCTCCTGCCC 
GCCATCCCCT 
AGAGTCTCTG 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 107 Protein sequence: 
Protein Accession 8: AAA60147 



1 11 21 31 41 51 

I I I I I I 

MDIPQTKQDL ELPKLAGTWH SMAMATNNIS LMATLKAPLR VHITSLLPTP EDNLEIVLHR 
WENNSCVEKK VLGEKTGNPK KFKINYTVAN EATLLDTDYD NFLPLCLQDT TTPIQSMMCQ 
YLARVLVEDD EIMQGFIRAF RPLPRHLWYL LDLKQMEEPC RF 

Seq ID NO: 108 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 48-794 



TCCCAGGCAG 
GTCTGATCCA 
TCATGAAAGG 
CAGTAGCCTA 
TTGAGCAGAA 
GGGAGAAGGT 
GCCACCTCAT 
GTGACTACTA 
ACTCAGCCCG 
CCAACCCCAT 
ACAGCCCCGA 
TGCACACCCT 
ACAACCTGAC 
AGCCCCAGAG 
TGCCGAGAGG 
CTCCAAAGGG 
CACTCTTCTT 
CGCACCCGCT 
CTGCCCCTGC 
GGACAGTGGC 
CGCGCGCGCC 
TTCCTCTCAA 



11 
I 

CAGTTAGCCC 
GAAGGCCAAG 
CGCCGTGGAG 
TAAGAACGTG 
AAGCAACGAG 
GGAGACTGAG 
CAAGGAGGCC 
CCGCTACCTG 
GTCAGCCTAC 
CCGCCTGGGC 
GGAGGCCATC 
CAGCGAGGAC 
ACTGTGGACG 
CTGAGTGTTG 
ACTAGTATGG 
CTCCGTGGAG 
GCAGCTGTTG 
TCCTCCCGAC 
TGCCTCTGAT 
AGGGGCTGGA 
AGTGCAAGAC 
TAAAGTTCCC 



21 
I 

GCCGCCCGCC 
CTGGCAGAGC 
AAGGGCGAGG 
GTGGGCGGCC 
GAGGGCTCGG 
CTCCAGGGCG 
GGGGACGCCG 
GCCGAGGTGG 
CAGGAGGCCA 
CTGGCCCTGA 
TCTCTGGCCA 
TCCTACAAAG 
GCCGACAACG 
CCCGCCACCG 
GGTGGGAGGC 
AGGGACTGGC 
AGCGCACCTA 
CCCAGGACCA 
CGTAGGAATT 
GATGGGTGTG 
CGAGATTGAG 
CTGTGACACT 



31 
I 

TGTGTGTCCC 
AGGCCGAACG 
AGCTCTCCTG 
AGAGGGCTGC 
AGGAGAAGGG 
TGTGCGACAC 
AGAGCCGGGT 
CCACCGGTGA 
TGGACATCAG 
ACTTTTCCGT 
AGACCACTTT 
ACAGCACCCT 
CCGGGGAAGA 
CCCCGCCCTG 
CCCACCCTTC 
AGAGCTGAGG 
ACCACTGGTC 
GGCTACTTCT 
GAGGAGTGTC 
TGTGTGTGTG 
GGAAAGCATG 
C 



41 
I 

CAGAGCCATG 
CTATGAGGAC 
CGAAGAGCGA 
CTGGAGGGTG 
GCCCGAGGTG 
CGTGCTGGGC 
CTTCTACCTG 
CGACAAGAAG 
CAAGAAGGAG 
CTTCCACTAC 
CGACGAGGCC 
CATCATGCAG 
GGGGGGCGAG 
CCCCCTCCAG 
TCCCCTAGGC 
CCACCTGGGG 
ATGCCCCCAC 
CCCCTCCTCT 
CCGCCTTGTG 
TGTGTGTGTG 
TCTGCTGGGT 



51 
I 

GAGAGAGCCA 
ATGGCAGCCT 
AACCTGCTCT 
CTGTCCAGTA 
CGTGAGTACC 
CTGCTGGACA 
AAGATGAAGG 
CGCATCATTG 
ATGCCGCCCA 
GAGATCGCCA 
ATGGCTGATC 
CTGCTGCGAG 
GCTCCCCAGG 
TCCCCCACCC 
GCTGTTCTTG 
CTGGGGATCC 
CCCTGCTCTC 
TGCCTCCCTC 
GCTGAGAACT 
TGTGTGTGTG 
GTGACCATGT 



Seq ID NO: 109 Protein sequence: 
Protein Accession ft: NP_006133.1 

1 11 21 31 41 51 

I I I I J i 

MERASLIQKA KLAEQAERYE DMAAFMKGAV EKGEELSCEE RNLLSVAYKN WGGQRAAWR 
VLSSIEQKSN EEGSEEKGPE VREYREKVET ELQGVCDTVL GLLDSHLIKE AGDAESRVFY 
LKMKGDYYRY LAEVATGDDK KRIIDSARSA YQEAMDISKK EMPPTNPIRL GLALNFSVFH 
YEIANSPEEA ISLAKTTFDE AMADLHTLSE DSYKDSTLIM QLLRDNI/TLW TADNAGEEGG 
EAPQEPQS 

Seq ID NO: 110 DNA sequence 
Nucleic Acid Accession ft: NM_000695 
Coding sequence: 407-1564 



1 11 

I I 
CACGAGTTGG TTTGGGAGCT 
GAGGCCTGGG GGTAGGAGCA 
TGGAGGTGCA GCGAAGGACC 
CACACTGCGG CGGCTGCGTG 
GGCTGCGCAG CTCCAGGGCC 



21 



41 



51 



31 

ill] 

GCCAGTCTCC TGGGAGGATC GCAGTCAGCA GAGCAGGGCT 
GAGCCTGCGC ATCTGGAGGC AGCATGTCCA AGAAAGGGAG 
CAGGGGCAGA GCCCACGCTG GGGATGGACC CCTTCGAGGA 
AGGCCTTCAA CTGAGGGCGC ACGCGGCCGG CCGAGTTCCG 
TGGGCCACTT CCTTCAAGAA AACAAGCAGC TTCTGCGCGA 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



60 
120 
180 
240 



60 
120 
180 . 
240 
300 
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CGTGCTGGCC CAGGACCTGC ATAAGCCAGC TTTCGAGGCA GACATATCTG AGCTCATCCT 360 

TTGCCAGAAC GAGGTTGACT ACGCTCTCAA GAACCTTCAG GCCTGGATGA AGGATGAACC 420 

ACGGTCCACG AACCTGTTCA TGAAGCTGGA CTCGGTCTTC ATCTGGAAGG AACCCTTTGG 480 

CCTGGTCCTC ATCATCGCAC CCTGGAACTA CCCATTGAAC CTGACCCTGG TGCTCCTGGT 540 

GGGCACCCTC CCCGCAGGGA ATTGCGTGGT GCTGAAGCCG TCAGAAATCA GCCAGGGCAC 600 

AGAGAAGGTC CTGGCTGAGG TGCTGCCCCA GTACCTGGAC CAGAGCTGCT TTGCCGTGGT 660 

GCTGGGCGGA CCCCAGGAGA CAGGGCAGCT GCTAGAGCAC AAGTTGGACT ACATCTTCTT 720 

CACAGGGAGC CCTCGTGTGG GCAAGATTGT CATGACTGCT GCCACCAAGC ACCTGACGCC 780 

TGTCACCCTG GAGCTGGGGG GCAAGAACCC CTGCTACGTG GACGACAACT GCGACCCCCA 840 

GACCGTGGCC AACCGCGTGG CCTGGTTCTG CTACTTCAAT GCCGGCCAGA CCTGCGTGGC 900 

CCCTGACTAC GTCCTGTGCA GCCCCGAGAT GCAGGAGAGG CTGCTGCCCG CCCTGCAGAG 960 

CACCATCACC CGTTTCTATG GCGACGACCC CCAGAGCTCC CCAAACCTGG GCCGCATCAT 1020 

CAACCAGAAA CAGTTCCAGC GGCTGCGGGC ATTGCTGGGC TGCGGCCGCG TGGCCATTGG 1080 

GGGCCAGAGC AACGAGAGCG ATCGCTACAT CGCCCCCACG GTGCTGGTGG ACGTGCAGGA 1140 

GACGGAGCCT GTGATGCAGG AGGAGATCTT CGGGCCCATC CTGCCCATCG TGAACGTGCA 1200 

GAGCGTGGAC GAGGCCATCA AGTTCATCAA CCGGCAGGAG AAGCCCCTGG CCCTGTACGC 1260 

CTTCTCCAAC AGCAGACAGG TTGTGAACCA GATGCTGGAG CGGACCAGCA GCGGCAGCTT 1320 

TGGAGGCAAT GAGGGCTTCA CCTACATATC TCTGCTGTCC GTGCCATTCG GGGGAGTCGG 1380 

CCACAGTGGG ATGGGCCGGT ACCACGGCAA GTTCACCTTC GACACCTTCT CCCACCACCG 1440 

CACCTGCCTG CTCGCCCCCT CCGGCCTGGA GAAATTAAAG GAGATCCGCT ACCCACCCTA 1500 

TACCGACTGG AACCAGCAGC TGTTACGCTG GGGCATGGGC TCCCAGAGCT GCACCCTCCT 1560 

GTGAGCGTCC CACCCGCCTC CAACGGGTCA CACAGAGAAA CCTGAGTCTA GCCATGAGGG 1620 

GCTTATGCTC CCAACTCACA TTGTTCCTCC AGACCGCAGG CTCCCCCAGC CTCAGGTTGC 1680 

TGGAGCTGTC ACATGACTGC ATCCTGCCTG CCAGGGCTGC AAAGCAAGGT CTTGCTTCTA 1740 

TCTGGGGGAC GCTGCTCGAG AGAGGCCGAG AGGCCGCAGA ACATGCCAGG TGTCCTCACT 1800 

CACCCCACCC TCCCCAATTC CAGCCCTTTG CCCTCTCGGT CAGGGTTGGC CAGGCCCAGT 1860 

CACAGGGGCA GTGTCACCCT GGAAAATACA GTGCCCTGCC TTCTTAGGGG CATCAGCCCT 1920 

GAACGGTTGA GAGCGTGGAG CCCTCCAGGC CTTTGCTCTC CCCTCTAGGC ACACGCGCAC 1980 

TTCCACCTCT GCCCCATCCC AACTGCACCA GCACTGCCTC CCCCAGGGAT CCTCTCACAT 2040 

CCCACACTGG TCTCTGCACC ACCCCTCTGG TTCACACCGC ACCCTGCACT CACCCACAGC 2100 

AGCTCCATCC ACTGGGAAAA CTGGGGTTTG CATCACTCCA CTGCACAGTG TTAGTGGGAC 2160 

CTGGGGGCAA GTCCCTTGAC TTCTCTGAGC CTCAGTTTCC TTATGTGAAA GTTGCTGGAA 2220 

CCAAAATGGA GTCACTTATG CCAAACTCTA ATAAAATGGA GTCGGGGGGG CACATAGAAG 2280 

CCCTCACACA CACATGCCCG TAACAGGATT TATCACCAAG ACACGCCTGC ATGTAAGACC 2340 

AGACACAGGG CGTATGGAAA AGCACGTCCT CAAAGACTGT AGTATTCCAG ATGAGCTGCA 2400 

GATGCTTACC TACCACGGCC GTCTCCACCA GAAAACCATC GCCAACTCCT GCGATCAGCT 2460 

TGTGACTTAC AAACCTTGTT TAAAAGCTGC TTACATGGAC TTCTGTCCTT TAAAACGTTC 2520 

CCCTTGGCTG TGGCCCTCTG TGTATGCCTG GGATCCTTCC AAGCACTCAT AGCCCAGATA 2580 
GGAATCCTCT GCTCCTCCCA AATAAATTCA TCTGTTC 



Seq ID NO: 111 Protein sequence: 
Protein Accession #: NP_000686 

1 11 21 31 41 51 

I I I I I I 

MKDEPRSTNL FMKLDSVFIW KEPFGLVLI I APWNYPLNLT LVLLVGTLPA GNCWLKPSE 60 

I SQGTEKVIxA EVLPQYLDQS CFAWLGGPQ ETGQLLEHKL DYIFFTGSPR VGKIVMTAAT 120 

KHLTPVTLEL GGKNPCYVDD NCDPQTVANR VAWFCYFNAG QTCVAPDYVL CSPEMQERLL 180 

PALQSTITRF YGDDPQSSPN LGRIINQKQF ORLRALLGCG RVAIGGQSNE SDRYIAPTVL 240 

VDVQETEPVM QEEIFGPILP IVNVQSVDEA IKFINRQEKP LALYAFSNSR QWNQMLERT 300 

SSGSFGGNEG FTYISLLSVP FGGVGHSGMG RYHGKFTFDT FSHHRTCLLA PSGLEKLKEI 360 
RYPPYTDWNQ QLLRWGMGSQ SCTliL 

Seq ID NO: 112 DNA sequence 
Nucleic Acid Accession #: NM__004456 
Coding sequence: 58-2298 

1 11 21 31 41 51 

I I | I I I 

GAATTCCGGG CGACGCGCGG GAACAACGCG AGTCGGCGCG CGGGACGAAG AATAATCATG 60 

GGCCAGACTG GGAAGAAATC TGAGAAGGGA CCAGTTTGTT GGCGGAAGCG TGTAAAATCA 120 

GAGTACATGC GACTGAGACA GCTCAAGAGG TTCAGACGAG CTGATGAAGT AAAGAGTATG 180 

TTTAGTTCCA ATCGTCAGAA AATTTTGGAA AGAACGGAAA TCTTAAACCA AGAATGGAAA 240 

CAGCGAAGGA TACAGCCTGT GCACATCCTG ACTTCTGTGA GCTCATTGCG CGGGACTAGG 300 

GAGTGTTCGG TGACCAGTGA CTTGGATTTT CCAACACAAG TCATCCCATT AAAGACTCTG 360 

AATGCAGTTG CTTCAGTACC CATAATGTAT TCTTGGTCTC CCCTACAGCA GAATTTTATG 420 

GTGGAAGATG AAACTGTTTT ACATAACATT CCTTATATGG GAGATGAAGT TTTAGATCAG 480 

GATGGTACTT TCATTGAAGA ACTAATAAAA AATTATGATG GGAAAGTACA CGGGGATAGA 540 

GAATGTGGGT TTATAAATGA TGAAATTTTT GTGGAGTTGG TGAATGCCCT TGGTCAATAT 600 

AATGATGATG ACGATGATGA TGATGGAGAC GATCCTGAAG AAAGAGAAGA AAAGCAGAAA 660 

GATCTGGAGG ATCACCGAGA TGATAAAGAA AGCCGCCCAC CTCGGAAATT TCCTTCTGAT 720 

AAAATTTTGG AGGCCATTTC CTCAATGTTT CCAGATAAGG GCACAGCAGA AGAACTAAAG 780 

GAAAAATATA AAGAACTCAC CGAACAGCAG CTCCCAGGCG CACTTCCTCC TGAATGTACC 840 

CCCAACATAG ATGGACCAAA TGCTAAATCT GTTCAGAGAG AGCAAAGCTT ACACTCCTTT 900 

CATACGCTTT TCTGTAGGCG ATGTTTTAAA TATGACTGCT TCCTACATCC TTTTCATGCA 960 

ACACCCAACA CTTATAAGCG GAAGAACACA GAAACAGCTC TAGACAACAA ACCTTGTGGA 1020 

CCACAGTGTT ACCAGCATTT GGAGGGAGCA AAGGAGTTTG CTGCTGCTCT CACCGCTGAG 1080 

CGGATAAAGA CCCCACCAAA ACGTCCAGGA GGCCGCAGAA GAGGACGGCT TCCCAATAAC 1140 

AGTAGCAGGC CCAGCACCCC CACCATTAAT GTGCTGGAAT CAAAGGATAC AGACAGTGAT 1200 

AGGGAAGCAG GGACTGAAAC GGGGGGAGAG AACAATGATA AAGAAGAAGA AGAGAAGAAA 1260 

GATGAAACTT CGAGCTCCTC TGAAGCAAAT TCTCGGTGTC AAACACCAAT AAAGATGAAG 1320 

CCAAATATTG AACCTCCTGA GAATGTGGAG TGGAGTGGTG CTGAAGCCTC AATGTTTAGA 1380 

GTCCTCATTG GCACTTACTA TGACAATTTC TGTGCCATTG CTAGGTTAAT TGGGACCAAA 1440 

ACATGTAGAC AGGTGTATGA GTTTAGAGTC AAAGAATCTA GCATCATAGC TCCAGCTCCC 1500 

GCTGAGGATG TGGATACTCC TCCAAGGAAA AAGAAGAGGA AACACCGGTT GTGGGCTGCA 1560 

CACTGCAGAA AGATACAGCT GAAAAAGGAC GGCTCCTCTA ACCATGTTTA CAACTATCAA 1620 
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CCCTGTGATC ATCCACGGCA GCCTTGTGAC 
TTTTGTGAAA AGTTTTGTCA ATGTAGTTCA 
TGCAAAGCAC AGTGCAACAC CAAGCAGTGC 
CCTGACCTCT GTCTTACTTG TGGAGCCGCT 
AAGAACTGCA GTATTCAGCG GGGCTCCAAA 
GCAGGCTGGG GGATTTTTAT CAAAGATCCT 
TGTGGAGAGA TTATTTCTCA AGATGAAGCT 
ATGTGCAGCT TTCTGTTCAA CTTGAACAAT 
AACAAAATTC GTTTTGCAAA TCATTCGGTA 
GTTAACGGTG ATCACAGGAT AGGTATTTTT 
CTGTTTGTTG ATTACAGATA CAGCCAGGCT 
GAAATGGAAA TCCCTTGACA TCTGCTACCT 
CTTCAGGAAC CTCGAGTACT GTGGGCAATT 
AATTTGCAAA GTACTGTAAG AATAATTTAT 
GCCTTCTCAC CAGCTGCAAA GTGTTTTGTA 
TACATTTTTC AACTTTGAAT AAAGAATACT 



Seq ID NO: 113 Protein sequence: 
Protein Accession 8: np 004447 



PCT/US02/12476 



AGTTCGTGCC 
GAGTGTCAAA 
CCGTGCTACC 
GACCATTGGG 
AAGCATCTAT 
GTGCAGAAAA 
GACAGAAGAG 
GATTTTGTGG 
AATCCAAACT 
GCCAAGAGAG 
GATGCCCTGA 
CCTCCCCCTC 
TAGAAAAAGA 
AGTAATGAGT 
CCAGTGAATT 
TGAACTTGAA 



MGQTGKKSEK 
KQRRIQPVHI 
MVEDETVLHN 
YNDDDDDDDG 
KEKYKELTEQ 
ATPNTYKRKN 
NSSRPSTPTI 
KPNIEPPENV 
PAEDVDTPPR 
NFCEKFCQCS 
CKNCSIQRGS 
YMCSFLFNLN 
ELFVDYRYSQ 



11 

I 

GPVCWRKRVK 
LTSVSSLRGT 
IPYMGDEVLD 
DDPEEREEKQ 
QLPGALPPEC 
TETALDNKPC 
NVLESKDTDS 
EWSGAEASMF 
KKKRKHRLWA 
SECQNRFPGC 
KKHLLLAPSD 
NDFWDATRK 
ADALKYVGIE 



21 

i 

SEYMRLRQLK 
RECSVTSDLD 
QDGTFIEELI 
KDLEDHRDDK 
TPNIDGPNAK 
GPQCYQHLEG 
DREAGTETGG 
RVLIGTYYDN 
AHCRKIQLKK 
RCKAQCNTKQ 
VAGWGIFIKD 
GNKIRFANHS 
REMEIP 



31 
I 

RFRRADEVKS 
FPTQVIPLKT 
KNYDGKVHGD 
ESRPPRKFPS 
SVQREQSLHS 
AKEFAAALTA 
ENNDKEEEEK 
FCAIARLIGT 
DGSSNHVYNY 
CPCYLAVREC 
PVQKNEFISE 
VNPNCYAKVM 



CTTGTGTGAT 


AGCACAAAAT 


1680 


ACCGCTTTCC 


GGGATGCCGC 


1740 


TGGCTGTCCG 


AGAGTGTGAC 


1800 


ACAGTAAAAA 


TGTGTCCTGC 


1860 


TGCTGGCACC 


ATCTGACGTG 


1920 


ATGAATTCAT 


CTCAGAATAC 


1980 


GGAAAGTGTA 


TGATAAATAC 


2040 


TGGATGCAAC 


CCGCAAGGGT 


2100 


GCTATGCAAA 


AGTTATGATG 


2160 


CCATCCAGAC 


TGGCGAAGAG 


2220 


AGTATGTCGG 


CATCGAAAGA 


2280 


CTCTGAAACA 


GCTGCCTTAG 


2340 


ACATGCAGTT 


TGAAATTCTG 


2400 


TTAAAAATCA 


ACTTTTTATT 


2460 


TTTGCAATAA 


TGCAGTATGG 


2520 


AAAAAAAAAA 


AAAAAA 




41 

| ' 


51 




MFSSNRQKIL 


| 

ERTEILNQEW 


60 


LNAVASVPIM 


YSWSPLQQNF 


120 


RECGF INDEX 


FVELVNALGQ 


180 


DKILEAISSM 


FPDKGTAEEL 


240 


FHTLFCRRCF 


KYDCFLHPFH 


300 


ERIKTPPKRP 


GGRRRGRLPN 


360 


KDETSSSSEA 


NSRCOTPIKM 


420 


KTCRQVYEFR 


VKESSIIAPA 


480 


QPCDHPRQPC 


DSSCPCVIAQ 


540 


DPDLCLTCGA 


ADHWDSKNVS 


600 


YCGEIISQDE 


ADRRGKVYDK 


660 


MVNGDHRIGI 


FAKRAIQTGE 


720 



Seq ID NO: 114 DNA sequence 
Nucleic Acid Accession #: NM_001827 
Coding sequence: 96-335 



1 
I 

AGTCTCCGGC 
CGCTCTCGTT 
CGGACAAGTA 
CCAAACAAGT 
AACAGAGTCT 
TTAGACGACC 
TTTTCAAATT 
ACAAATCTTT 
AAATGCAACT 
TTTCTCTTAA 
TATGTTGCAT 



11 
I 

GAGTTGTTGC 
TCATTTTCTG 
CTTCGACGAA 
ACCTAAAACT 
AGGCTGGGTT 
TCTTCCAAAA 
TAATGTATAT 
CATCCATACC 
GCAAGTAGGT 
GTGCCTGTTT 
TTAAAAAAAA 



21 
I 

CTGGGCTGGA 
CAGCGCGCCA 
CACTACGAGT 
CATCTGATGT 
CATTACATGA 
GATCAACAAA 
GTGTATATAA 
TGTGCATGAG 
TACTGTAAGA 
GAGTTTACTG 
AAAAAAA 



31 
I 

CGTGGTTTTG 
CGAGGATGGC 
ACCGGCATGT 
CTGAAGAGGA 
TTCATGAGCC 
AATGAAGTTT 
GGTAGTATTC 
CTGTATTCTT 
TGTTTAAGAT 
AAACAGTTTA 



41 
I 

TCTGCTGCGC 
CCACAAGCAG 
TATGTTACCC 
GTGGAGGAGA 
AGAACCACAT 
ATCTGGGGAT 
AGTGAATACT 
CACAGCAACA 
AAAAGTTCTT 
CTTTTGTTCA 



51 
I 

CCGCTCTTCG 
ATCTACTACT 
AGAGAACTTT 
CTTGGTGTCC 
ATTCTTCTCT 
CGTCAAATCT 
TGAGAAATGT 
GAGCTCAGTT 
CCAGTCAGTT 
ATAAAGTTTG 



Seq ID NO: 115 Protein sequence: 
Protein Accession #: NP 001818 



51 



1 11 21 31 41 

I I I I I I 

MAHKQIYYSD KYFDEHYEYR HVMLPRELSK QVPKTHLMSE EEWRRLGVQQ SLGWVHYMIH 
EPEPHILLFR RPLPKDQQK 

Seq ID NO: 116 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



1 
I 

TCAGACCTCA 
GCATCTGGAC 
AGAGGTGTGT 
GCAGCCAACA 
AAGGACTGAT 
CCTTGGAAGG 
TCAAGAATTC 
AGTGTGGGAG 
CCCATCTCTA 
CCTGTAGTTC 
AGGCTGCAAT 
CCTGTCTCAA 
TTTGAGGTGC 
CCTGAAGGAG 
GACAGACCTT 
CTCTTCCCCC 
CCGCCTCCCA 
CAGCCGGATA 
TCCTGCTCCG 



11 
I 

TGAGTCACTT 
CCTTGGTGCT 
TCCAGGGAAA 
GAGTTCAAAA 
CCACATTCCC 
ACCTGGCTCA 
TTTGCTGAGC 
GATCTCTTGA 
AAATAATAAT 
CAGCTACCCA 
GAACTGTGAT 
ATAATAATAA 
CATTTGGGTA 
CAGAGGGATG 

TCCCTGTCCC 
TGTCTGCTGT 
CAGAGTGAAT 
TGTAAAGAGG 



21 
I 

GGACTCTTGA 
ATCGACGAAG 
GCCCCTATCT 
TGCAGGCTTG 
ACCAGGAAGT 
GGCTGGACCA 
ATGGTGCCTC 
GCCCAGGAGT 
AATAATAAAA 
GGAGGCTGAG 
TACCCCACTG 
TAATAATAAT 
GAAAGAAAAG 
CATCGCTGGA 
CTTGTGGAAA 
AGGGAACCAA 
GCCTTTGTAC 
AGTTAACCAC 
CCAGTGTTTG 



31 
I 

GCCACCTCTG 
CTTGGGTGGG 
CTCTGCAGAG 
GAAAGTACAG 
TTAGCAGAAC 
CCTCTTGAGA 
ATGCCTATAA 
TCAAGACTAG 
TAAAAAATTA 
GCAAGAGGAT 
CACTCCAGCC 
CTTATTTTGG 
ACGTTTACAC 
GGTGACCTAC 
GTGTTTCCTC 
AGGGCTTTCT 
TCAGCAATTC 
ACTTAGGTCA 
TGTGTTGCAA 



41 

! 

GGGGTGGAGT 
GCTCTTAGCT 
GTCAAGTGAA 
GGGGCTCTGT 
CCCCGCGTGC 
GGGAGGAGCT 
TACCAACACT 
CCTGGGCAAC 
GCAGGGCATG 
GGCTGGAGCC 
TGGGCAAAAG 
AGAATAAAGA 
CGAGAAATAG 
AGTTGAAGAA 
TGCTGCTACT 
ACCACACCCT 
TTGTTTGCTC 
AATAGGATCT 
GCAGCCTTGG 



51 
I 

CTCTCTCCTG 
GCTATGTGCA 
AGCGACGGCC 
GGAGGATGGG 
CAACTGGACC 
CTGGATTTGA 
TTGGGAGGCC 
ACAGAGAGAA 
GTGGCATGTG 
TGGGATGTTG 
AGCGAGAGAA 
GACCTCTGGA 
TCTGTGTTGC 
GACTCATTAT 
GCTCATGAGA 
TTCTTGCCCC 
CATTATCTTC 
AAATTTTTGT 
AATAGTAACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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CTTCTCATTT GTTTGGGATC TGGCCACCAA GTTCCAGAAT GATACACGGA TCAGTGCAGA 1200 

AGTTCATCAG GCTCTCGGAC CTTAGGGCTG TTGGAGAAGG CTTCAGCAGC AGAACTGATG 1260 

GTGAAGGCTC GTGTTCTCCA TCCTCAACTT TCTTTGCTTC GA TCAT ACAC AAGAATACAT 1320 

TTGGAAGGGC AAAAAATGAA CACTGTCGTT CATTGCAGCC GTGTTTTGTG ACACAGATGC 1380 

ACAGTCTGCT GTGAAGACCT TCTCTCAAGT GGCATTTGGG AGTCCATGCC AGATCATGGT 1440 

GCTTCATGAG AGACTGACAG CTATCAGGGG TTGTGGCACT TAGTGAGGAC TCTCCTCCCC 1500 

CAGTGTGTGC TGATGACACA TACACACCTG ACAATAGCTT GAGTCTTCTC TGTTCCTTTT 1560 

ACTCTGTAGC CAACATACAC ATGATTTAAA ACCCTTTCTA AATATCTATC ATGGTTCATC 1620 

CTTGTCCAAA TGCAGAGTCA GAGCTATTTG TACTTCATTA TTATTTCCAA GGCGAATAGT 1680 

TGGCTTTCTT TTTGCAAAAA TAATTAAAGT TTTTGTATGT TGCAAAAAAA AAAAAAAAAA 1740 
AAACAAAAAA 

Seq ID NO: 117 DNA sequence 

Nucleic Acid Accession #: BC012178.1 

Coding sequence: 204-2285 

1 11 21 31 41 51 

I ! I I I I 

CTTCTCTCCC GCGGCGCTGG GGCCCGCGCT CCGCTGCTGT TGCTCCATTC GGCGCTTTTC 60 

TGGCGGCTGG CTCCTCTCCG CTGCCGGCTG CTCCTCGACC AGGCCTCCTT CTCAACCTCA 120 

GCCCGCGGCG CCGACCCTTC CGGCACCCTC CCGCCCCGTC TCGTACTGTC GCCGTCACCG 180 

CCGCGGCTCC GGCCCTGGCC CCGATGGCTC TGTGCAACGG AGACTCCAAG CTGGAGAATG 240 

CTGGAGGAGA CCTTAAGGAT GGCCACCACC ACTATGAAGG AGCTGTTGTC ATTCTGGATG 300 

CTGGTGCTCA GTACGGGAAA GTCATAGACC GAAGAGTGAG GGAACTGTTC GTGCAGTCTG 360 

AAATTTTCCC CTTGGAAACA CCAGCATTTG CTATAAAGGA ACAAGGATTC CGTGCTATTA 420 

TCATCTCTGG AGGACCTAAT TCTGTGTATG CTGAAGATGC TCCCTGGTTT GATCCAGCAA 480 

TATTCACTAT TGGCAAGCCT GTTCTTGGAA TTTGCTATGG TATGCAGATG ATGAATAAGG 540 

TATTTGGAGG TACTGTGCAC AAAAAAAGTG TCAGAGAAGA TGGAGTTTTC AACATTAGTG 600 

TGGATAATAC ATGTTCATTA TTCAGGGGCC TTCAGAAGGA AGAAGTTGTT TTGCTTACAC 660 

ATGGAGATAG TGTAGACAAA GTAGCTGATG GATTCAAGGT TGTGGCACGT TCTGGAAACA 720 

TAGTAGCAGG CATAGCAAAT GAATCTAAAA AGTTATATGG AGCACAGTTC CACCCTGAAG 780 

TTGGCCTTAC AGAAAATGGA AAAGTAATAC TGAAGAATTT CCTTTATGAT ATAGCTGGAT 840 

GCAGTGGAAC CTTCACCGTG CAGAACAGAG AACTTGAGTG TATTCGAGAG ATCAAAGAGA 900 

GAGTAGGCAC GTCAAAAGTT TTGGTTTTAC TCAGTGGTGG AGTAGACTCA ACAGTTTGTA 960 

CAGCTTTGCT AAATCGTGCT TTGAACCAAG AACAAGTCAT TGCTGTGCAC ATTGATAATG 1020 

GCTTTATGAG AAAACGAGAA AGCCAGTCTG TTGAAGAGGC CCTCAAAAAG CTTGGAATTC 1080 

AGGTCAAAGT GATAAATGCT GCTCATTCTT TCTACAATGG AACAACAACC CTACCAATAT 1140 

CAGATGAAGA TAGAACCCCA CGGAAAAGAA TTAGCAAAAC GTTAAATATG ACCACAAGTC 1200 

CTGAAGAGAA AAGAAAAATC ATTGGGGATA CTTTTGTTAA GATTGCCAAT GAAGTAATTG 1260 

GAGAAATGAA CTTGAAACCA GAGGAGGTTT TCCTTGCCCA AGGTACTTTA CGGCCTGATC 1320 

TAATTGAAAG TGCATCCCTT GTTGCAAGTG GCAAAGCTGA ACTCATCAAA ACCCATCACA 1380 

ATGACACAGA GCTCATCAGA AAGTTGAGAG AGGAGGGAAA AGTAATAGAA CCTCTGAAAG 1440 

ATTTTCATAA AGATGAAGTG AGAATTTTGG GCAGAGAACT TGGACTTCCA GAAGAGTTAG 1500 

TTTCCAGGCA TCCATTTCCA GGTCCTGGCC TGGCAATCAG AGTAATATGT GCTGAAGAAC 1560 

CTTATATTTG TAAGGACTTT CCTGAAACCA ACAATATTTT GAAAATAGTA GCTGATTTTT 1620 

CTGCAAGTGT TAAAAAGCCA CATACCCTAT TACAGAGAGT CAAAGCCTGC ACAACAGAAG 1680 

AGGATCAGGA GAAGCTGATG CAAATTACCA GTCTGCATTC ACTGAATGCC TTCTTGCTGC 1740 

CAATTAAAAC TGTAGGTGTG CAGGGTGACT GTCGTTCCTA CAGTTACGTG TGTGGAATCT 1800 

CCAGTAAAGA TGAACCTGAC TGGGAATCAC TTATTTTTCT GGCTAGGCTT ATACCTCGCA 1860 

TGTGTCACAA CGTTAACAGA GTTGTTTATA TATTTGGCCC ACCAGTTAAA GAACCTCCTA 1920 

CAGATGTTAC TCCCACTTTC TTGACAACAG GGGTGCTCAG TACTTTACGC CAAGCTGATT 1980 

TTGAGGCCCA TAACATTCTC AGGGAGTCTG GGTATGCTGG GAAAATCAGC CAGATGCCGG 2040 

TGATTTTGAC ACCATTACAT TTTGATCGGG ACCCACTTCA AAAGCAGCCT TCATGCCAGA 2100 

GATCTGTGGT TATTCGAACC TTTATTACTA GTGACTTCAT GACTGGTATA CCTGCAACAC 2160 

CTGGCAATGA GATCCCTGTA GAGGTGGTAT TAAAGATGGT CACTGAGATT AAGAAGATTC 2220 

CTGGTATTTC TCGAATTATG TATGACTTAA CATCAAAGCC CCCAGGAACT ACTGAGTGGG 2280 
AGTAATAAAC TTCTTGTTCT ATTAAAA 



Seq ID NO: 118 Protein sequence: 
Protein Accession #: AAH12178.1 

1 11 21 31 41 51 

I I I I I I 

MALCNGDSKL ENAGGDLKDG HHHYEGAWI LDAGAQYGKV IDRRVRELFV QSEIFPLETP 60 

AFAIKEQGFR AIIISGGPNS VYAEDAPWFD PAIFTIGKPV LGICYGMQMM NKVFGGTVHK 120 

KSVREDGVFN ISVDNTCSLF RGLQKEEWL LTHGDSVDKV ADGFKWARS GNIVAGIANE 180 

SKKLYGAQFH PEVGLTENGK VILKNFLYDI AGCSGTFTVQ NRELECIREI KERVGTSKVL 240 

VLLSGGVDST VCTALLNRAL NQEQVIAVHI DNGFMRKRES QSVEEALKKL GIQVKVINAA 300 

HSFYNGTTTL PISDEDRTPR KRISKTIjNMT TSPEEKRKII GDTFVKIANE VIGEMNLKPE 360 

EVFLAQGTLR PDLIESASLV ASGKAELIKT HHNDTELIRK LREEGKVIEP LKDFHKDEVR 420 

ILGREIiGLPE ELVSRHPFPG PGLAIRVICA EEPYICKDFP ETNNILKIVA DFSASVKKPH 480 

TLLQRVKACT TEEDQEKLMQ ITSLHSLNAF LLPIKTVGVQ GDCRSYSYVC GISSKDEPDW 540 

ESLIFLARLI PRMCHNVNRV VYIFGPPVKE PPTDVTPTFL TTGVLSTLRQ ADFEAHNILR 600 

ESGYAGKISQ MPVILTPLHF DRDPLQKQPS CQRSWIRTF ITSDFMTGIP ATPGNEIPVE 660 

VVLKMVTEIK KIPGISRIMY DLTSKPPGTT EWE 

Seq ID NO: 119 DNA sequence 

Nucleic Acid Accession #: NM_006500.1 

Coding sequence: 27.. 1967 

1 11 21 31 41 51 

I I I I I I 

ACTTGCGTCT CGCCCTCCGG CCAAGCATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 60 

TCGCCGCCTG CTGCTGCTGT CCTCGCGTCG CGGGTGTGCC CGGAGAGGCT GAGCAGCCTG 120 

CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 

AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 240 
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TCATCTTCCG TGTGCGCCAG GGCCAGGGCC AGAGCGAACC TGGGGAGTAC GAGCAGCGGC 300 

TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC 360 

GCATCTTCTT GTGCCAGGGC AAGCGCCCTC GGTCCCAGGA GTACCGCATC CAGCTCCGCG 420 

TCTACAAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC CCTGGGCATC CCTGTGAACA 480 

GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA CGGGTACCCC ATTCCTCAAG 540 

TCATCTGGTA CAAGAATGGC CGGCCTCTGA AGGAGGAGAA GAACCGGGTC CACATTCAGT 600 

CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTGCA GAGTATTCTG AAGGCACAGC 660 

TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACCGG CTGCCCAGTG 720 

GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 

TGTGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG GGACCGCGTG GAAATCAGGT 840 

GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG CAAGCAGAAC CCCAGCACCA 900 

GGGAGGCAGA GGAAGAGACA ACCAACGACA ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA 960 

AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT GGACACCATG ATATCGCTGC 1020 

TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCCGAGTG AGTCCCGCAG 1080 

CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 1140 

ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 1200 

TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA TCGCTGCGTG GCGTCTGTGC 1260 

CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 

GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTG- AATCTGTCTT 1380 

GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 1440 

AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 

TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 

TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACA ACCACTGGCC 1620 

TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 

TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740. 

TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG I860 

TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 

GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCC CGAATCACTT 1980 

CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2040 

CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 2220 

CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 

AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 

GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 2460 

AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 2520 

ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2580 

GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 2640 

TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 

TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 

CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 

CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 

ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 

TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 

GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3060 

TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 

AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 3240 

TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300 

AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 3360 

AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 

AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA GAGCAGACAG 3480 

CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA GCACGAAGGG CCTGGCAGGC 3540 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA CTT 



Seq ID NO: 120 Protein sequence: 
Protein Accession 8: NP 006491.1 



1 11 21 31 41 51 

I I I I I I 

MGLPRLVCAF LLAACCCCPR VAGVPGEAEQ PAPELVEVEV GSTALLKCGL SQSOGNLSHV 60 

DWFSVHKEKR TLIFRVRQGQ GQSEPGEYEQ RLSLQDRGAT LALTQVTPQD ERIFLCQGKR 120 

PRSQEYRIQL RVYKAPEEPN IQVNPLGIPV NSKEPEEVAT CVGRNGYPIP QVIWYKNGRP 180 

LKEEKNRVHI QSSQTVESSG LYTLQSILKA QLVKEDKDAQ FYCELNYRLP SGNHMKESRE 240 

VTVPVFYPTE KVWLEVEPVG MLKEGDRVEI RCUVDGNPPP HFSISKQNPS TREAEEETTN 300 

DNGVLVLEPA RKEHSGRYEC QAWNLDTMIS LLSEPQELLV NYVSDVRVSP AAPERQEGSS 360 

LTLTCEAESS QDLEFQWLRE ETDQVLERGP VLQLHDLKRE AGGGYRCVAS VPSIPGLNRT 420 

QLVKLAIFGP PWMAFKERKV WVKENMVLNL SCEASGHPRP TISWNVNGTA SEQDQDPQRV 480 

LSTLNVLVTP ELLETGVECT ASNDLGKNTS ILFLELVNLT TLTPDSNTTT GLSTSTASPH 540 

TRANSTSTER KLPEPESRGV VIVAVIVCIL VLAVLGAVLY FLYKKGKLPC RRSGKQEITL 600 
PPSRKTELW EVKSDKLPEE MGLLQGSSGD KRAPGDQGEK YIDLRH 

Seq ID NO: 121 DNA sequence 
Nucleic Acid Accession <h NM_018306 
Coding sequence: 60-671 

1 11 21 31 41 51 

I I I I I I 

ATAGTCTACA CAGAGCTCCC CTTGCTGCCC AGACAAGCTG AAGGACCACA GGAAAAGCCA 60 
TGGAGACTTC AGCATCCTCC TCCCAGCCTC AGGACAACAG TCAAGTCCAC AGAGAAACAG 120 
AAGATGTAGA CTATGGAGAG ACAGATTTCC ACAAGCAAGA CGGGAAGGCT GGACTCTTTT 180 
CCCAAGAACA ATATGAGAGA AACAAGTCTT CTTCCTCCTC CTTCTCTTCC TCCTCATCCT 240 
CCTCATCTTC TTCATCCTCC TCCTCCTCAG GTCCTGGGCA TGGGGAGCCT GACGTTTTGA 300 



234 



WO 02/086443 

AGGATGAGCT TCAACTCTAT GGAGATGCTC 
GACTCCGAAG GAGAGGCTCT GACCCAGCAA 
GACTGAATAT AAAGAAAGAT GATGAGTTTT 
GGGCCTTGCT GGTGTGTTAT CACTATTACG 
TGCTCACCTT CGCCTCCCTG GAAACCGTTG 
ACAGCGTCCT CCAAGGCTTC ATCCCCCTCT 
AGACTGACTG AGGCCACTTC CAGGTGGGCA 
GCGACCCCTG AGCCCACAAG GGCAGAGCAG 
AGACCAATAA ACAGAACACT TTTCCTTCCA 
AGGGGCATCT CATTTGGGCA GTACTGCTGT 
GGTGGGTGTG GGGCCTGAGG CTTCACAGTA 
ACTGCTCTCA GAGGACAGCA AGGGACCCTG 
ATGGTTTTTC TCAAATCCCA GGCTATCTGC 
GGGAGAGATG GATGGTCCAC TGCTTTGAGG 
GAACCTGATG CAGGTAAGAT GCTGAGGACT 
GGCAGGAAAA TGATCATCAG AAACTAAATG 
TCCTCGCACT TTGGGAGGCT CAGGCTAAGG 
CTGGGCAACA TAGTGAGACC CCCATCTCTA 
TACATACCTG TACATACCTG CGGTTCCAGC 
TGAGCCCAGG AGTTCAGGGC TGCAGTGAGG 
CGACAGAGCA AGATCGTTTC TCTAAAATT 



CTGGAGAGGT GGTACCCTCT GGGGAATCAG 360 

GTGGAGAAGT GGAGGCCTCT CAGTTAAGAA 420 

TCCATTTCGT CCTCCTGTGC TTTGCCATCG 480 

CAGACTGGTT CATGTCTCTT GGGGTCGGCC 540 

GCATCTACTT CGGACTAGTG TACCGTATCC 600 

TCCAGAAGTT TAGGCTGACA GGGTTCAGGA 660 

GCAGAGGCAG GCCCCAGTGT GACCACCACT 720 

CATTCTGAGA GACGCACAGG AGACCAAGCC 780 

TGTGGTCTGA ATGTTGGCAC CAGCCCGGGC 840 

GCAACCCAGC TGCAAGGATG GAAGGCAGAG 900 

CCTGGACCAG CAGGAAGATT CTGGGAGGTC 960 

AGCTCTGCAA GCTGTGATCT GTCTGGGTTC 1020 

ATGCGCTCTC AGGTGCTACC GAGCCATCCT 1080 

CAGGGAGCCA TCGGGCTGGG GCCCCTTGGT 1140 

AAAACCATTT TTTTTGCACC CAAAAAAAAA 1200 

GCAGCCAGGC ATGGGGGCTC ACGACTGTAA 1260 

GTCGCTTGAA GCTGAGAGTT CAAGACCAAC 1320 

CAATTTTTTT TTAATGACCA AATGTGGCGG 1380 

TACTCAAGAG GCTGAGGCAG GAGGACTGCT 1440 

TACGATCAAG CCACTGCACT CCAGCCTGGG 1500 



Seg ID NO: 122 Protein sequence: - 
Protein Accession ft: NP_060776 

1 11 21 31 41 51 

I I I I I I 

METSASSSQP QDNSQVHRET EDVDYGETDF HKQDGKAGLF SQEQYERNKS SSSSPSSSSS 60 

SSSSSSSSSS GPGHGEPDVL KDELQLYGDA PGEWPSGES GLRRRGSDPA SGEVEASQLR 120 

RLNIKKDDEF FHFVLLCFAI GALLVCYHYY ADWFMSLGVG LLTFASLETV GIYFGLVYRI 180 
HSVLQGFIPL FQKFRLTGFR KTD 

Seg ID NO: 123 DNA sequence 
Nucleic Acid Accession #: BC022542 
Coding seguence: 243.. 896 

1 11 21 31 41 51 

I I I I I I 

ACTTGGTCCC AGCCGATAAA TCTGGGGCAG CGCGCGGTAG GAGCTGCGGG CGGCCAGGCC 60 

CCTTCCTGCG TCCGCACCTG GCCCCGCGCG CCCCTCTCGG GCGTCCGGCT TCCGGCGTCC 120 

TGGCGGCTCG GGTGGCGGCG GTTCGGGCGG CCGCCTGGCT GCTCCTCGGG GCGGCGACGG 180 

GGCTCACGCG CGGGCCCGCC ACGGCCTTCA CCGCCGCGCG CTCTGACGCC GGCATAAGGG 240 

CCATGTGTTC TGAAATTATT TTGAGGCAAG AAGTTTTGAA AGATGGTTTC CACAGAGACC 300 

TTTTAATCAA AGTGAAGTTT GGGGAAAGCA TTGAGGACTT GCACACGTGC CGTCTCTTAA 360 

TTAAACAGGA CATTCCTGCA GGACTTTATG TGGATCCGTA TGAGTTGGCT TCATTACGAG 420 

AGAGAAACAT AACAGAGGCA GTGATGGTTT CAGAAAATTT TGATATAGAG GCCCCTAACT 480 

ATTTGTCCAA GGAGTCTGAA GTTCTCATTT ATGCCAGACG AGATTCACAG TGCATTGACT 540 

GTTTTCAAGC CTTTTTGCCT GTGCACTGCC GCTATCATCG GCCGCACAGT GAAGATGGAG 600 

AAGCCTCGAT TGTGGTCAAT AACCCAGATT TGTTGATGTT TTGTGACCAA GAGTTCCCGA 660 

' TTTTGAAATG CTGGGCTCAC TCAGAAGTGG CAGCCCCTTG TGCTTTGGAT AATGAGGATA 720 

TATGCCAATG GAACAAGATG AAGTATAAAT CAGTATATAA GAATGTGATT CTACAAGTTC 780 

CAGTGGGACT GACTGTACAT ACCTCTCTAG TATGTTCTGT GACTCTGCTC ATTACAATCC 840 

TGTGCTCTAC ATTGATCCTT GTAGCAGTTT TCAAATATGG CCATTTTTCC CTATAAGTTT 900 

TATGTAGTTA AATGCTTCCT AGAAACCTAA ATAAGATCTA TTAATTTCTG ACGAGAGGTG 960 

TTCTTCTAGA ATTAATTACT TTTATCTTTT GTCTTCATTT GTGGCCAAAA TTATGTTTAC 1020 

TAGAGGAAAT TTGGGATCAT TCTCAGCTAA TTCCAAAATG TAGTGCTCTA TTGCATGGAT 1080 

CCTTGGTAAT CCTCAAGCAT CAGATGCCAT AAGGGGAAAC TTAATTCTGC TAAATTAATG 1140 

TTTATTTTGT GAGAAGTGAC TTTATCTTCA TTTGGGGTAG AAAAATTATT TCTTTATGTA 1200 

GTAGAGACAA ATTATTCTCA TTTTGCAAGT ACTTTCAATT TAAGCTACAA ATTGAGAAAA 1260 

CCGTTATAAA TAAGAATAAA ATAGGCCAGG CACAGTGGCT CACACCTGTA ATCCCAGCAC 1320 

TTTGGGAGGC CGAGGTGGGC GGATCACCAG AGGTCAAGAG TTTGAGACCA GCTTGGTGAA 1380 

ACCCTGTCTC TACTAAAAAT ACAAAAGTTA GCTGGGGCTG GTGGTGGGCA TCTGTAGTCC 1440 

CAGCTAATTG GAAGGGTGAG GCGGGAGGAT CGCTTGAACC TGGGAGGCGG AGGTTCCAGA 1500 

GAGCCAAGAT CGCACCACTG CACTACAGCC TGGGCGACAG AACGAGACCC TGTCTCCAAA 1560 

GGAAAAACAA AAAAGAAGAA TAAAATAATT TGGATGAAAA TCATGTTTAT TTAAATAGTA 1620 

ATGTCATGAG ACTATTAAAG ATGTGCCAGA GTTTCAATGA AAATCATTAA AGTAGGACAG 1680 

CTAAGAAATT AATATTAATA TAAAAATTAT TGATAATCTT AAATTATTGA TTATTCCTTA 1740 

ACGCACTCCA TTCTCCTTTT ACATTTTATC ATGTTTCTTT TGAATATATG AATTGGCAAA 1800 

GGACTTGATG AAACTGAGTA CTAAGATTTG GTACAGAGTA TGTCAGGAAG ACAACTCAGA 1860 
TTGCCATTTT AAATAAAGTT GTACATGAAC AAAAAAAAAA AAAAAA 



Seq ID NO: 124 Protein sequence: 
Protein Accession #: AAH22542 

1 11 21 31 41 51 . 

I I I I I I 

MCSEIILRQE VLKDGFHRDL LIKVKFGESI EDLHTCRLLI KQDIPAGLYV DPYELASLRE 60 
RNITEAVMVS ENFDIEAPNY LSKESEVLIY ARRDSQCIDC FQAFLPVHCR YHRPH SEDGE 120 
ASIWNNPDL LMFCDQAGSR RMIRFRFDSF DKTIEFPILK CWAHSEVAAP CALENEDICQ 180 
WNKMKYKSVY KNVILQVPVG LTVHTSLVCS VTLLITILCS KKKKK 

Seq ID NO: 125 DNA sequence 

.Nucleic Acid Accession #: NM_004994.1 , 
Coding sequence: 20.. 2143 



235 



WO 02/086443 

1 11 21 31 41 51 

I I I I I I 

AGACACCTCT GCCCTCACCA TGAGCCTCTG GCAGCCCCTG GTCCTGGTGC TCCTGGTGCT 60 

GGGCTGCTGC TTTGCTGCCC CCAGACAGCG CCAGTCCACC CTTGTGCTCT TCCCTGGAGA 120 

5 CCTGAGAACC AATCTCACCG ACAGGCAGCT GGCAGAGGAA TACCTGTACC GCTATGGTTA 180 

CACTCGGGTG GCAGAGATGC GTGGAGAGTC GAAATCTCTG GGGCCTGCGC TGCTGCTTCT 240 

CCAGAAGCAA CTGTCCCTGC CCGAGACCGG TGAGCTGGAT AGCGCCACGC TGAAGGCCAT 300 

GCGAACCCCA CGGTGCGGGG TCCCAGACCT GGGCAGATTC CAAACCTTTG AGGGCGACCT 360 

CAAGTGGCAC CACCACAACA TCACCTATTG GATCCAAAAC TACTCGGAAG ACTTGCCGCG 420 

10 GGCGGTGATT GACGACGCCT TTGCCCGCGC CTTCGCACTG TGGAGCGCGG TGACGCCGCT 480 

CACCTTCACT OGCGTGTACA GCCGGGACGC AGACATOGTC ATCCAGTTTG GTGTCGCGGA 540 

GCACGGAGAC GGGTATCCCT TCGACGGGAA GGACGGGCTC CTGGCACACG CCTTTCCTCC 600 

TGGCCCCGGC ATTCAGGGAG ACGCCCATTT CGACGATGAC GAGTTGTGGT CCCTGGGCAA 660 

GGGCGTCGTG GTTCCAACTC GGTTTGGAAA CGCAGATGGC GCGGCCTGCC ACTTCCCCTT 720 

15 CATCTTCGAG GGCCGCTCCT ACTCTGCCTG CACCACCGAC GGTCGCTCCG ACGGCTTGCC 780 

CTGGTGCAGT ACCACGGCCA ACTACGACAC CGACGACCGG TTTGGCTTCT GCCCCAGCGA 840 

GAGACTCTAC ACCCGGGACG GCAATGCTGA TGGGAAACCC TGCCAGTTTC CATTCATCTT 900 

CCAAGGCCAA TCCTACTCCG CCTGCACCAC GGAOGGTCGC TCCGACGGCT ACCGCTGGTG 960 

CGCCACCACC GCCAACTACG ACCGGGACAA GCTCTTCGGC TTCTGCCCGA CCCGAGCTGA 1020 

20 CTCGACGGTG ATGGGGGGCA ACTCGGCGGG GGAGCTGTGC GTCTTCCCCT TCACTTTCCT 1080 

GGGTAAGGAG TACTCGACCT GTACCAGCGA GGGCCGCGGA GATGGGCGCC TCTGGTGCGC 1140 

TACCACCTCG AACTTTGACA GCGACAAGAA GTGGGGCTTC TGCCCGGACC AAGGATACAG 1200 

TTTGTTCCTC GTGGCGGCGC ATGAGTTCGG CCACGCGCTG GGCTTAGATC ATTCCTCAGT 1260 

GCCGGAGGCG CTCATGTACC CTATGTACCG CTTCACTGAG GGGCCCCCCT TGCATAAGGA 1320 

25 CGACGTGAAT GGCATCCGGC ACCTCTATGG TCCTCGCCCT GAACCTGAGC CACGGCCTCC 1380 

AACCACCACC ACACCGCAGC CCACGGCTCC CCCGACGGTC TGCCCCACCG GACCCCCCAC 1440 

TGTCCACCCC TCAGAGCGCC CCACAGCTGG CCCCACAGGT CCCCCCTCAG CTGGCCCCAC 1500 

AGGTCCCCCC ACTGCTGGCC CTTCTACGGC CACTACTGTG CCTTTGAGTC CGGTGGACGA 1560 

TGCCTGCAAC GTGAACATCT TCGACGCCAT CGCGGAGATT GGGAACCAGC TGTATTTGTT 1620 

30 CAAGGATGGG AAGTACTGGC GATTCTCTGA GGGCAGGGGG AGCCGGCCGC AGGGCCCCTT 1680 

CCTTATCGCC GACAAGTGGC CCGCGCTGCC CCGCAAGCTG GACTCGGTCT TTGAGGAGCC 1740 

GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG GCGCCAGGTG TGGGTGTACA CAGGCGCGTC 1800 

GGTGCTGGGC CCGAGGCGTC TGGACAAGCT GGGCCTGGGA GCCGACGTGG CCCAGGTGAC 186.0 

CGGGGCCCTC CGGAGTGGCA GGGGGAAGAT GCTGCTGTTC AGCGGGCGGC GCCTCTGGAG 1920 

35 GTTCGACGTG AAGGCGCAGA TGGTGGATCC CCGGAGCGCC AGCGAGGTGG ACCGGATGTT 1980 

CCCCGGGGTG CCTTTGGACA CGCACGACGT CTTCCAGTAC CGAGAGAAAG CCTATTTCTG 2040 

CCAGGACCGC TTCTACTGGC GCGTGAGTTC CCGGAGTGAG TTGAACCAGG TGGACCAAGT 2100 

GGGCTACGTG ACCTATGACA TCCTGCAGTG CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 2160 

GCAGTGCCAT GTAAATCCCC ACTGGGACCA ACCCTGGGGA AGGAGCCAGT TTGCCGGATA 2220 

40 CAAACTGGTA TTCTGTTCTG GAGGAAAGGG AGGAGTGGAG GTGGGCTGGG CCCTCTCTTC 2280 
TCACCTTTGT TTTTTGTTGG AGTGTTTCTA ATAAACTTGG ATTCTCTAAC CTTT 



PCT/US02/12476 
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Seq ID NO: 126 Protein sequence: 
Protein Accession #: NP_0049B5.1 



MSLWQPLVLV 
RGESKSLGPA 
ITYWIQNYSE 
FDGKDGLLAH 
YSACTTDGRS 
ACTTDGRSDG 
CTSEGRGDGR 
PMYRFTEGPP 
PTAGPTGPPS 
RFSEGRGSRP 
IiDKLGLGADV 
THDVFQYREK 



11 

I 

LLVLGCCFAA 
LLLLQKQLSL 
DLPRAVIDDA 
AFPPGPGIQG 
DGLPWCSTTA 
YRWCATTANY 
LWCATTSNFD 
LHKDDVNGIR 
AGPTGPPTAG 
QGPFLIADKW 
AQVTGALRSG 
AYFCQDRFYW 



21 
i 

PRQRQSTIiVL 
PETGELDSAT 
FARAFALWSA 
DAHFDDDELW 
NYDTDDRFGF 
DRDKLFGFCP 
SDKKWGFCPD 
HLYGPRPEPE 
PSTATTVPLS 
PALPRKLDSV 
RGKMLLFSGR 
RVSSRSEIdfQ 



31 
I 

FPGDLRTNLT 
LKAMRTPRCG 
VTPLTFTRVY 
SLGKGVWPT 
CPSERLYTRD 
TRADSTVMGG 
QGYSLFLVAA 
PRPPTTTTPQ 
PVDDACNVNI 
FEEPLSKKLF 
RLWRFDVKAQ 
VDQVGYVTYD 



41 
I 

DRQLAEE^LY 
VPDLGRFQTF 
SRDADIVIQF 
RFGNADGAAC 
GNADGKPCQF 
NSAGELCVFP 
HEFGHALGLD 
PTAPPTVCPT 
FDAIAEIGNQ 
FFSGRQVWVY 
MVDPRSASEV 
ILQCPED 



51 

I 

RYGYTRVAEM 
EGDLKWHHHN 
GVAEHGDGYP 
HFPFIFEGRS 
PFIFQGQSYS 
FTFLGKEYST 
HSSVPEALMY 
GPPTVHPSER 
LYLFKDGKYW 
TGASVLGPRR 
DRMFPGVPLD 



Seq ID NO: 127 DNA sequence 
Nucleic Acid Accession #: NM_004181 
Coding sequence: 32-670 



GCAGAAATAG 
GGTCGCCGGC 
GGTGCCAGCG 
CTTCAGGAAA 
CATGAAGCAG 
TAATCAAGAC 
AGAGAAAATG 
AGCCCATGAT 
TTTTATTCTG 
TCCGGTGAAC 
CAGAGAATTC 
GGCAGCCTAA 
AATATATACC 
TGTTCTGCAG 
ACAGCTGTCC 
TATGTCTTGT 
AAGACCTTGG 



11 
I 

CCTAGGGAGA 
CAGTGGCGCT 
CCTGCCTGCG 
AAGCAGATTG 
ACCATTGGGA 
AAACTGGGAT 
TCCCCTGAAG 
GCCGTGGCAC 
TTTAACAACG 
CATGGCGCCA 
ACCGAGCGTG 
TGCTCTGTGG 
CCCCATGCAG 
ACACGCCTTC 
ACTGGGCCAT 
ATCCGATATC 
ATGTGGTTAT 



21 
I 

TCAACCCCGA 
TCGTGGACGT 
CGCTGCTGCT 
AAGAGCTGAA 
ATTCCTGTGG 
TTGAGGATGG 
ACAGAGCAAA 
AGGAAGGCCA 
TGGATGGCCA 
GTTCAGAGGA 
AGCAAGGAGA 
GAGGGACTTT 
TCTAAAATGC 
CCCTCAGCCA 
TGTGGTGTGA 
TAACGCTTTA 
GTTGTCCTAA 



31 

I 

GATGCTGAAC 
GCTGGGGCTG 
GCTGTTTCCC 
GGGACAAGAA 
CACAATCGGA 
ATCAGTTCTG 
ATGCTTTGAA 
ATGTCGGGTA 
CCTCTATGAA 
CACCCTGCTG 
AGTCCGCTTC 
GCTGATTTCC 
TTCAGTACTT 
CACCCAGGCA 
GCTTCAGATG 
AATGGCTACT 
AGAATAAATT 



41 
I 

AAAGTGCTGT 
GAAGAGGAGT 
CTCACGGCCC 
GTTAGTCCTA 
CTTATTCACG 
AAACAGTTTC 
AAGAATGAGG 
GATGACAAGG 
CTTGATGGAC 
AAGGACGCTG 
TCTGCCGTGG 
CCTCTTCCCT 
GTGAAACACA 
CTTAAGCACA 
GTGAAGCATT 
TTGGTTTCTG 
TTGCTGATAG 



51 
I 

CCCGGCTGGG 
CTCTGGGCTC 
AGCATGAGAA 
AAGTGTACTT 
CAGTGGCCAA 
TTTCTGAAAC 
CCATACAGGC 
TGAATTTCCA 
GAATGCCTTT 
CCAAGGTGTG 
CTCTCTGCAA 
TCAACATGAA 
GCTGTTCTTC 
AGCAGAGTGC 
CTCCCCAGTG 
TCTGTAAGTT 
TAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



Seq ID NO: 128 Protein sequence: 
Protein Accession #: NP_004172 



236 



WO 02/086443 

1 11 



21 



31 



41 



51 

I 



MLNKVLSRLG VAGQWRFVDV LGbEEESLGS VPAPACALLL LPPLTAQHEN FRKKQIBELK 60 

GQEVSPKVYF MKQTIGNSCG TIGLIHAVAN NQDKLGFEDG SVLKQFLSET BKMSPBDRAX 120 

CFEKNEAIQA AHDAVAQEGQ CRVDDKVNFH FILFNNVDGH LYELDGRMPP PVNHGASSED 180 
TLLKDAAKVC REFTEREQGE VRFSAVAIiCK AA 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 129 DNA sequence 
Nucleic Acid Accession #: NM_000213 
Coding sequence: 127-5385 



1 
I 

CGCCCGCGCG 
CCCCGAGGTA 
AAGAGGATGG 
AGCGTCAGCC 
ACGGAGTGTG 
CGGCGCTGCA 
GTCATGGAGA 
AGCCAGATGT 
GAGCTGGAGG 
TCCAACTCCA 
GTCCTGAGCC 
AGCGTCCCGC 
CCCCCCTTCT 
AAACTGCAGG 
ATCCTGCAGA 
CTGGTCTTCT 
GGCATCATGA 
TACAGGACAC 
ATCATCCCCA 
TATTTCCCTG 
CTGGAGGAGG 
CGAGGCCTTC 
CACATCCGGC 
GATGGGACGC 
TCCTTCTCCG 
CTGCAAAAAG 
TGTGTGTGCA 
GACATTCAGC 
CAGTGCGGGC 
GACAACTTCC 
ATGGGCCAGT 
AATGCCACCT 
GGCCGCTGCC 
TCGGCGATCC 
GGCACCGGCG 
GACGAGCTTA 
GACTGCACCT 
CTGGTGCACA 
CTCCTCCTCC 
TGCAAGGCCT 
GAAGACCACT 
CTGCGCAGCG 
CAGCGGCCTG 
GGGCTGTCCT 
GAGTGCGCCC 
TCCGGTGTAC 
CAAGACCACA 
CTGAAGCTTA 
GGCTACTACA 
GTGGAGCTGG 
CAGCTGCTGG 
GTAAACATCA 
TTCTCGGTCA 
GGCGGGAAGT 
TACATCCCCG 
GTGAAGCTCC 
TTCCACGTCC 
ACCATCATCA 
TCACAGCCAC 
GGGTCCAGGA 
GTAAAGTACT 
CCCTCAGTGG 
TACGGGGCTC 
GTGCCCAGCG 
AGCTGGGCTG 
CTGGTCAACG 
AAGAACCGGA 
AAGGCGCGCA 
CAGCCCAAGA 
AGCGGGGAGG 
GGCAGCCAGA 
TTTGCCTTCC 
TATGGCACCC 



11 
I 

CTGCAGCCCC 
GGTCCAGGAC 
CAGGGCCACG 
TCTCTGGGAC 
TCCGTGTGGA 
ACACCCAGGC 
GCAGCTTCCA 
CCCCCCAAGG 
TGTTTGAGCC 
TGTCCGATGA 
AGCTCACCAG 
AGACGGACAT 
CCTTCAAGAA 
GAGAGCGGAT 
CAGCTGTGTG 
CCACCGAGTC 
GCCGCAACGA 
AGGACTACCC 
TCTTTGCTGT 
TCTCCTCACT 
CCTTCAATCG 
GGACAGAGGT 
GGGGGGAAGT 
ACGTGTGCCA 
ACGGCCTCAA 
AGGTGCGGTC 
GCGAGGGCTG 
CCTGCCTGCG 
ACTGTGTGTG 
AGTGTCCCCG 
GTGTGTGTGA 
GCATCGACAG 
ACTGCCACCA 
ACCCGGGCCT 
AGAAGAAGGG 
AGAGAGCCGA 
ACAGCTACAC 
AGAAGAAGGA 
TGCCGCTCCT 
GCCTGGCACT 
ACATGCTGCG 
GGAACCTCAA 
GCTTTGCCAC 
TGCGCCTGGC 
AGCTGCGCCA 
ACAAGCTCCA 
CCATTGTGGA 
CAGAGAAGCA 
CCCTCACTGC 
TGGACGTACG 
TGGAGGCCAT 
CCATCATCAA 
GCCGCGGGGA 
CCCAGGTCTC 
TGGAGGGTGA 
TGGAGCTGCA 
AGCTCAGCAA 
TCAGGGACCC 
CCCCTCACGG 
AGATCCATTT 
GGATTCAGGG 
AGCTCACCAA 
AGGGCGAGGG 
AGCCAGGGCG 
AGCCGGCTGA 
ATGACAACCG 
TGCTGCTTAT 
ACGGGGCCGG 
GGCCCATGTC 
ACTACGACAG 
GGCCCAGCGT 
CGGGCAGCAC 
ACCTGAGCCC 



21 
I 

ATCTCCTAGC 
GGGCGCACAG 
CCCCAGCCCA 
CTTGGCAAAC 
TAAGGACTGC 
GGAGCTGCTG 
AATCACAGAG 
CCTGCGGGTC 
ACTGGAGAGC 
TCTGGACAAC 
CGACTACACT 
GAGGCCTGAG 
CGTCATCAGC 
CTCAGGCAAC 
CACGAGGGAC 
AGCCTTCCAC 
TGAACGGTGC 
GTCGGTGCCC 
CACCAACTAC 
GGGGGTGCTG 
GATCCGCTCC 
CACCTCCAAG 
GGGTATATAC 
GCTGCCGGAG 
GATGGACGCG 
AGCTCGCTGC 
GAGTGGCCAG 
GGAGGGCGAG 
CTACGGCGAA 
CACTTCCGGG 
GCCTGGTTGG 
CAATGGGGGC 
GCAGTCGCTC 
CTGCGAGGAC 
GCGCACGTGT 
GGAGGTGGTG 
CATGGAAGGT 
CTGCCCTCCG 
GGCCCTGCTA 
TCTCCCGTGC 
GGAGAACCTG 
GGGCCGTGAC 
TCATGCCGCC 
CCGCCTTTGC 
GGAGGTGGAG 
GCAGACCAAG 
CACAGTGCTG 
GGTGGAACAG 
AGACCAGGAC 
GGTGCCCCTC 
CGACGTGCCC 
GGAGCAAGCC 
CCAGGTGGCC 
CTACCGCACA 
GCTGCTGTTC 
AGAAGTTGAC 
CCCTAAGTTT 
AGATGAACTG 
CGACCTGGGC 
CAACTGGCTG 
TGACTCCGAA 
CCTGTACCCG 
ACCCTACAGC 
TCTGGCCTTC 
GACCAACGGT 
ACCTATTGGG 
TGAGAACCTT 
CTGGGGGCCT 
CATCCCCATC 
CTTCCTTATG 
CTCCGATGAC 
CAACTCCCTG 
ACACGTGCCC 



31 
I 

GGCAGCCCAG 
CAGCAGCCGA 
TGGGCCAGGC 
CGCTGCAAGA 
GCCTACTGCA 
GCCGCGGGCT 
GAGACCCAGA 
CGTCTGCGGC 
CCCGTGGACC 
CTCAAGAAGA 
ATTGGATTTG 
AAGCTGAAGG 
CTGACAGAAG 
CTGGATGCTC 
ATTGGCTGGC 
TATGAGGCTG 
CACCTGGACA 
ACCCTGGTGC 
TCCTATAGCT 
CAGGAGGACT 
AACCTGGACA 
ATGTTCCAGA 
CAGGTGCAGC 
GACCAGAAGG 
GGCATCATCT 
AGCTTCAACG 
ACCTGCAACT 
GACAAGCCGT 
GGCCGCTACG 
TTCCTCTGCA 
ACAGGCCCAA 
ATCTGTAATG 
TACACGGACA 
CTACGCTCCT 
GAGGAATGCA 
GTGCGCTGCT 
GACGGCGCCC 
GGCTCCTTCT 
CTGCTGCTAT 
TGCAACCGAG 
ATGGCCTCTG 
GTGGTCCGCT 
AGCATCAACC 
ACCGAGAACC 
GAGAACCTGA 
TTCCGGCAGC 
ATGGCGCCCC 
AGGGCCTTCC 
GCCCGGGGCA 
TTTATCCGGC 
GCAGGCACTG 
AGAGACGTGG 
CGCATCCCTG 
CAGGATGGCA 
CAGCCTGGGG 
TCCCTCCTGC 
GGGGCCCACC 
GACCGGAGCT 
GCCCCGCAGA 
CCCCCTTCTG 
TCCGAAGCCC 
TATTGCGACT 
TCCCTGGTGT 
AATGTCGTCT 
GAGATCACAG 
CCCATGAAGA 
CGGGAGTCCC 
GAGCGGGAGG 
ATCCCTGACA 
TACAGCGATG 
ACTGAGCACC 
CACAGGATGA 
CACCGCGTGC 



41 
! 

GCGCGGAGGG 
GGCTGGCCGG 
TGCTCCTGGC 
AGGCCCCAGT 
CAGACGAGAT 
GCCAGCGGGA 
TTGACACCAC 
CCGGTGAGGA 
TGTACATCCT 
TGGGGCAGAA 
GCAAGTTTGT 
AGCCCTGGCC 
ATGTGGATGA 
CTGAGGGCGG 
GCCCGGACAG 
ATGGCGCCAA 
CCACGGGCAC 
GCCTGCTCGC 
ACTACGAGAA 
CGTCCAACAT 
TCCGGGCCCT 
AGACGAGGAC 
TGCGGGCCCT 
GCAACATCCA 
GTGATGTGTG 
GAGACTTCGT 
GCTCCACCGG 
GCTCCGGCCG 
AGGGTCAGTT 
ATGACCGAGG 
GCTGTGACTG 
GACGTGGCCA 
CCATCTGCGA 
GCGTGCAGTG 
ACTTCAAGGT 
CCTTCCGGGA 
CTGGGCCCAA 
GGTGGCTCAT 
GCTGGAAGTA 
GTCACATGGT 
ACCACTTGGA 
GGAAGGTCAC 
CCACAGAGCT 
TGCTGAAGCC 
ACGAGGTCTA 
AGCCCAATGC 
GCTCGGCCAA 
ACGACCTCAA 
TGGTGGAGTT 
CTGAGGATGA 
CCACCCTCGG 
TGTCCTTTGA 
TCATCCGGCG 
CCGCGCAGGG 
AGGCCTGGAA 
GGGGCCGCCA 
TGGGCCAGCC 
TCACGAGTCA 
AGCCCAATGC 
GCAAGCCAAT 
ACCTGCTCGA 
ATGAGATGAA 
CCTGCCGCAC 
CCTCCACGGT 
CCTACGAGGT 
AAGTGCTGGT 
AGCCCTACCG 
CCATCATCAA 
TCCCTATCGT 
ACGTTCTACG 
TGGTGAATGG 
CCACGACCAG 
TAAGCACATC 



51 
I 

AGCGAGTCCG 
GAGAGGGAGG 
AGCCTTGATC 
GAAGAGCTGC 
GTTCAGGGAC 
GAGCATCGTG 
CCTGCGGCGC 
GCGGCATTTT 
CATGGACTTC 
CCTGGCTCGG 
GGACAAAGTC 
CAACAGTGAC 
GTTCCGGAAT 
CTTCGATGCC 
CACCCACCTG 
CGTGCTGGCT 
CTACACCCAG 
CAAGCACAAC 
GCTTCACACC 
CGTGGAGCTG 
AGACAGCCCC 
TGGGTCCTTT 
TGAGCACGTG 
TCTGAAACCT 
CACCTGCGAG 
GTGCGGACAG 
CTCTCTGAGT 
TGGGGAGTGC 
CTGCGAGTAT 
ACGCTGCTCC 
TCCCCTCAGC 
CTGTGAGTGT 
GATCAACTAC 
CCAGGCGTGG 
CAAGATGGTG 
CGAGGATGAC 
CAGCACTGTC ' 
CCCCCTGCTC 
CTGTGCCTGC 
GGGCTTTAAG 
CACGCCCATG 
CAACAACATG 
GGTGCCCTAC 
TGACACTCGG 
CAGGCAGATC 
CGGGAAAAAG 
GCCGGCCCTG 
GGTGGCCCCC 
CCAGGAGGGC 
CGACGAGAAG 
CCGCCGCCTG 
GCAGCCTGAG 
TGTCCTGGAC 
CAACCGGGAC 
AGAGCTGCAG 
GGTCCGCCGT 
CCACTCCACC 
GATGTTGTCA 
TAAGGCCGCT 
GGGGTACAGG 
CAGCAAGGTG 
GGTGTGCGCC 
CCACCAGGAA 
GACCCAGCTG 
CTGCTATGGC 
TGACAACCCT 
CTACACGGTG 
CCTGGCCACC 
GGACGCCCAG 
CTCTCCATCG 
CCGGATGGAC 
TGCTGCTGCC 
CTCCACCCTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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780 
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960 
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2700 
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2820 
2880 
2940 
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3120 
3180 
3240 
3300 
3360 
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3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 



237 



WO 02/086443 

ACACGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTCGACCAC ACTGCCGAGG 4440 

GACTACTCCA CCCTCACCTC CGTCTCCTCC CACGACTCTC GCCTGACTGC TGGTGTGCCC 4500 

GACACGCCCA CCCGCCTGGT GTTCTCTGCC CTGGGGCCCA CATCTCTCAG AGTGAGCTGG 4560 

CAGGAGCCGC GGTGCGAGCG GCCGCTGCAG GGCTACAGTG TGGAGTACCA GCTGCTGAAC 4620 

GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC AGACCTCGGT GGTGGTGGAA 4680 

GACCTCCTGC CCAACCACTC CTACGTGTTC CGCGTGCGGG CCCAGAGCCA GGAAGGCTGG 4740 

GGCCGAGAGC GTGAGGGTGT CATCACCATT GAATCCCAGG TGCACCCGCA GAGCCCACTG 4800 

TGTCCCCTGC CAGGCTCCGC CTTCACTTTG AGCACTCCCA GTGCCCCAGG CCCGCTGGTG 4860 

TTCACTGCCC TGAGCCCAGA CTCGCTGCAG CTGAGCTGGG AGCGGCCACG GAGGCCCAAT 4920 

GGGGATATCG TCGGCTACCT GGTGACCTGT GAGATGGCCC AAGGAGGAGG GCCAGCCACC 4980 

GCATTCCGGG TGGATGGAGA CAGCCCCGAG AGCCGGCTGA CCGTGCCGGG CCTCAGCGAG 5040 

AACGTGCCCT ACAAGTTCAA GGTGCAGGCC AGGACCACTG AGGGCTTCGG GCCAGAGCGC 5100 

GAGGGCATCA TCACCATAGA GTCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCCGT 5160 

GCCGGGCTCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA GCATCACCAC CACCCACACC 5220 

AGCGCCACCG AGCCCTTCCT AGTGGATGGG CCGACCCTGG GGGCCCAGCA CCTGGAGGCA 5280 

GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA GCCGGACACT GACCACCAGC S340 

GGAACCCTTA GCACCCACAT GGACCAACAG TTCTTCCAAA CTTGACCGCA CCCTGCCCCA 5400 

CCCCCGCCAT GTCCCACTAG GCGTCCTCCC GACTCCTCTC CCGGAGCCTC CTCAGCTACT 5460 

CCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA CAGAGCAGGG GCTAGGTGTC 5520 

TCCTGGGAGG CATGAAGGGG GCAAGGTCCG TCCTCTGTGG GCCCAAACCT ATTTGTAACC 5580 

AAAGAGCTGG GAGCAGCACA AGGACCCAGC CTTTGTTCTG CACTTAATAA ATGGTTTTGC 5640 
ACTG 



Seq ID NO: 130 Protein sequence: 
protein Accession #: NP_000204 

1 11 21 31 41 51 

I I I I I I 

MAGPRPSPWA RLLLAALISV SLSGTLANRC KKAPVKSCTE CVRVDKDCAY CTDEMFRDRR 60 

CNTQAELLAA GCQRESIWM ESSFQITEET QIDTTLRRSQ MSPQGLRVRL RPGEERHFEL 120 

EVFEPLESPV DLYILMDFSN SMSDDLDNLK KMGQNLARVL SQLTSDYTIG FGKFVDKVSV 180 

PQTDMRPEKL KEPWPNSDPP FSFKNVISLT EDVDEFRNKL QGERISGNLD APEGGFDAIL 240 

QTAVCTRDIG WRPDSTHLLV FSTESAFHYE ADGANVLAGI MSRNDERCHL DTTGTYTQYR 300 

TQDYPSVPTL VRLLAKHNII PIFAVTNYSY SYYBKLHTYF PVSSLGVLQE DSSNIVELLE 360 

EAFNRIRSNL DIRALDSPRG LRTEVTSKMF QKTRTGSFHI RRGEVGIYQV QLRALEHVDG 420 

THVCQLPEDQ KGNIHLKPSF SDGLKMDAGI ICDVCTCELQ KEVRSARCSF NGDFVCGQCV 480 

CSEGWSGQTC NCSTGSIjSDI QPCLREGEDK PCSGRGECQC GHCVCYGEGR YEGQFCEYDN 540 

FQCPRTSGFL CNDRGRCSMG QCVCEPGWTG PSCDCP&SNA TCIDSNGGIC NGRGHCECGR 600 

CHCHQQSLYT DTICEINYSA IHPGLCEDLR SCVQCQAWGT GEKKGRTCEE CNFKVKMVDE 660 

LKRAEEWVR CSFRDEDDDC TYSYTMEGDG APGPNSTVLV HKKKDCPPGS FWWLIPIjLLIi 720 

LIiPLLALLLL LCWKYCACCK ACLALLPCCN RGHMVGFKED HYMLRENLMA SDHLDTPMLR 780 

SGNLKGRDW RWKVTNNMQR PGFATHAASI NPTELVPYGL SLRLARLCTE NIiLKPDTREC 840 

AQLRQEVEEN LNEVYRQISG VHKLQQTKFR QQPNAGKKQD HTIVDTVLMA PRSAKPALLK 900 

LTEKQVEQRA FHDLKVAPGY YTLTADQDAR GMVEFQEGVE LVDVRVPLFI RPEDDDEKQL 960 

LVEAIDVPAG TATLGRRLVN ITIIKEQARD WSFEQPEFS VSRGDQVARI PVIRRVLDGG 1020 

KSQVSYRTQD GTAQGNRDYI PVEGELLFQP GEAWKELQVK LLELQEVDSL LRGRQVRRFH 1080 

VQLSNPKFGA HLGQPHSTTI IIRDPDELDR SFTSQMLSSQ PPPHGDLGAP QNPNAKAAGS 1140 

RKIHFNWLPP SGKPMGYRVK YWIQGDSESE AHLLDSKVPS VELTNLYPYC DYEMKVCAYG 1200 

AQGEGPYSSL VSCRTHQEVP SEPGRLAFNV VSSTVTQLSW AEPAETNGEI TAYEVCYGLV 1260 

NDDNRPIGPM KKVLVDNPKN RMLLIENLRE SQPYRYTVKA RNGAGWGPER EAIINLATQP 1320 

KRPMSIPIIP DIPIVDAQSG EDYDSFLMYS DDVLRSPSGS QRPSVSDDTE HJjVNGRMDFA 1380 

FPGSTNSLHR MTTTSAAAYG THLSPHVPHR VIiSTSSTLTR DYNSLTRSEH SHSTTLPRDY 1440 

STLTSVSSHD SRLTAGVPDT PTRLVFSALG PTSLRVSWQE PRCERPLQGY SVEYQLLNGG 1500 

ELHRLNIPNP AQTSVWEDL LPNHSYVFRV RAQSQEGWGR EREGVITIES QVHPQSPLCP 1560 

LPGSAFTLST PSAPGPLVFT ALSPDSLQLS WERPRRPNGD IVGYLVTCEM AQGGGPATAF 1620 

RVDGDSPESR LTVPGLSENV PYKFKVQART TEGFGPEREG IITIESQDGG PFPQLGSRAG 1680 

liFQHPLQSEY SSITTTHTSA TEPFLVDGPT LGAQHLEAGG SLTRHVTQEF VSRTLTTSGT 1740 
LSTHMDQQFF QT 

Seq ID NO: 131 DNA sequence 
Nucleic Acid Accession #: BC004372 
Coding sequence: 132.. 2231 

1 11 21 31 , 41 51 

I I i I I 1 

CCTCGTGCCG CGGACCCCAG CCTCTGCCAG GTTCGGTCCG CCATCCTCGT CCCGTCCTCC 60 

GCCGGCCCCT GCCCCGCGCC CAGGGATCCT CCAGCTCCTT TCGCCCGCGC CCTCCGTTCG 120 

CTCCGGACAC CATGGACAAG TTTTGGTGGC ACGCAGCCTG GGGACTCTGC CTCGTGCCGC 180 

TGAGCCTGGC GCAGATCGAT TTGAATATAA CCTGCCGCTT TGCAGGTGTA TTCCACGTGG 240 

AGAAAAATGG TCGCTACAGC ATCTCTCGGA CGGAGGCCGC TGACCTCTGC AAGGCTTTCA 300 

ATAGCACCTT GCCCACAATG GCCCAGATGG AGAAAGCTCT GAGCATCGGA TTTGAGACCT 360 

GCAGGTATGG GTTCATAGAA GGGCATGTGG TGATTCCCCG GATCCACCCC AACTCCATCT 420 

GTGCAGCAAA CAACACAGGG GTGTACATCC TCACATCCAA CACCTCCCAG TATGACACAT 480 

ATTGCTTCAA TGCTTCAGCT CCACCTGAAG AAGATTGTAC ATCAGTCACA GACCTGCCCA 540 

ATGCCTTTGA TGGACCAATT ACCATAACTA TTGTTAACCG TGATGGCACC CGCTATGTCC 600 

AGAAAGGAGA ATACAGAACG AATCCTGAAG ACATCTACCC CAGCAACCCT ACTGATGATG 660 

ACGTGAGCAG CGGCTCCTCC AGTGAAAGGA GCAGCACTTC AGGAGGTTAC ATCTTTTACA 720 

CCTTTTCTAC TGTACACCCC ATCCCAGACG AAGACAGTCC CTGGATCACC GACAGCACAG 780 

ACAGAATCCC TGCTACCAGT ACGTCTTCAA ATACCATCTC AGCAGGCTGG GAGCCAAATG 840 

AAGAAAATGA AGATGAAAGA GACAGACACC TCAGTTTTTC TGGATCAGGC ATTGATGATG 900 

ATGAAGATTT TATCTCCAGC ACCATTTCAA CCACACCACG GGCTTTTGAC CACACAAAAC 960 

AGAACCAGGA CTGGACCCAG TGGAACCCAA GCCATTCAAA TCCGGAAGTG CTACTTCAGA 1020 

CAACCACAAG GATGACTGAT GTAGACAGAA ATGGCACCAC TGCTTATGAA GGAAACTGGA 1080 

ACCCAGAAGC ACACCCTCCC CTCATTCACC ATGAGCATCA TGAGGAAGAA GAGACCCCAC 1140 

ATTCTACAAG CACAATCCAG GCAACTCCTA GTAGTACAAC GGAAGAAACA GCTACCCAGA 1200 

AGGAACAGTG GTTTGGCAAC AGATGGCATG AGGGATATCG CCAAACACCC AGAGAAGACT 1260 



238 



5 

10 
15 
20 
25 
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CCCATTCGAC AACAGGGACA GCTGCAGCCT 
GGACAACACC AAGCCCAGAG GACAGTTCCT 
CCATGGGACG AGGTCATCAA GCAGGAAGAA 
CGCTTCAGCC TACTGCAAAT CCAAACACAG 
CTCTTTCAAT GACAACGCAG CAGAGTAATT 
TGGAAGAAGA TAAAGACCAT CCAACAACTT 
TCACAGGTGG AAGAAGAGAC CCAAATCATT 
ATACCTCTCA TTACCCACAC ACGAAGGAAA 
AGACTGGGTC CTTTGGAGTT ACTGCAGTTA 
GTTCCTTATC AGGAGACCAA GACACATTCC 
GATCTGAATC AGATGGACAC TCACATGGGA 
GTCCTATAAG GACACCCCAA ATTCCAGAAT 
TGGCTTTGAT TCTTGCAGTT TGCATTGCAG 
AAAAGCTAGT GATCAACAGT GGCAATGGAG 
ACGGAGAGGC CAGCAAGTCT CAGGAAATGG 
CTCCAGACCA GTTTATGACA GCTGATGAGA 
TTGGGGTGTA ACACCTACAC CATTATCTTG 
TACAGGGAGC TGGGACACTT AACAGATGCA 
TTTTTAGCAT AAAATTTTCT ACTCTTAAAA 



Seq ID NO: 132 Protein sequence: 
Protein Accession ft: AAH04372 



CAGCTCATAC 
GGACTGATTT 
GGATGGATAT 
GTTTGGTGGA 
CTCAGAGCTT 
CTACTCTGAC 
CTGAAGGCTC 
GCAGGACCTT 
CTGTTGGAGA 
ACCCCAGTGG 
GTCAAGAAGG 
GGCTGATCAT 
TCAACAGTCG 
CTGTGGAGGA 
TGCATTTGGT 
CAAGGAACCT 
GAAAGAAACA 
ATGTGCTACT 
AAAAAAAAAA 



CAGCCATCCA 
CTTCAACCCA 
GGACTCCAGT 
AGATTTGGAC 
CTCTACATCA 
ATCAAGCAAT 
AACTACTTTA 
CATCCCAGTG 
TTCCAACTCT 
GGGGTCCCAT 
TGGAGCAAAC 
CTTGGCATCC 
AAGAAGGTGT 
CAGAAAGCCA 
GAACAAGGAG 
GCAGAATGTG 
ACCGTTGGAA 
GATTGTTTCA 
AAAAAAA 



ATGCAAGGAA 
ATCTCACACC 
CATAGTACAA 
AGGACAGGAC 
CATGAAGGCT 
AGGAATGATG 
CTGGAAGGTT 
ACCTCAGCTA 
AATGTCAATC 
ACCACTCATG 
ACAACCTCTG 
CTCTTGGCCT 
GGGCAGAAGA 
AGTGGACTCA 
TCGTCAGAAA 
GACATGAAGA 
ACATAACCAT 
TTGCGAATCT 
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li 



21 



31 



41 



51 



30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



MDKFWWHAAW 
PTMAQMEKAL 
ASAPPEEDCT 
GSSSERSSTS 
DERDRHLSFS 
MTDVDRNGTT 
FGNRWHEGYR 
GHQAGRRMDM 
KDHPTTSTLT 
FGVTAVTVGD 
TPQIPEWbll 
SKSQEMVHLV 



GLCLVPLSLA 
SIGFETCRYG 
SVTDLPNAFD 
GGYIFYTFST 
GSGIDDDEDF 
AYEGNWNPEA 
QTPREDSHST 
DSSHSTTLQP 
SSNRNDVTGG 
SNSNVNRSLS 
LASLLALALI 
NKESSETPDQ 



QIDLNITCRF 
FIEGHWIPR 
GPITITIVNR 
VHPIPDEDSP 
1SSTISTTPR 
HPPLIHHEHH 
TGTAAASAHT 
TANPNTGLVE 



GDQDTFHPSG 
LAVCIAVNSR 
FMTADETRNL 



AGVFHVEKNG 
IHPNSICAAN 
DGTRYVQKGE 
WITDSTDRIP 
AFDHTKQNQD 
EEEETPHSTS 
SHPMQGRTTP 
DLDRTGPLSM 
TTLLEGYTSH 
GSHTTHGSES 
RRCGQKKKLV 
QNVDMKIGV 



RYSISRTEAA 
NTGVYILTSN 
YRTNPEDIYP 
ATSTSSNTIS 
WTQWNPSHSN 
TIQATPSSTT 
SPEDSSWTDF 
TTQQSNSQSF 
YPHTKESRTF 



DLCKAFNSTL 
TSQYDTYCFN 
SNPTDDDVSS 
AGWEPNEENE 
PEVLLQTTTR 
EETATQKEQW 
FNPISHPMGR 



INSGNGAVED 



IPVTSAKTGS 
GANTTSGPIR 
RKPSGLNGEA 



Seq ID NO: 133 DNA sequence 
Nucleic Acid Accession #: NMJ502882 
Coding sequence: 150-755 



CGAGGTTCGG 
GCGGAGGGAA 
AGCCGAGCCG 
ATGATACTTC 
TTTCTCTTCC 
TGCGGGCAAA 
GCACTGGTGA 
GGAGGGACAA 
AGCCCAACGC 
AGTGCCCCAA 
TCAAAACAAA 
CAGGCAAAAA 
AGGAGACCAA 
TCTCTTTCCT 
ATTCTTTCAT 



11 

I 

GTCGTGGGGC 
GGAGCTACGA 
CCGCCGCCGC 
CACTGAGAAT 
TGAGCAAGAA 
ACTGTTCCGA 
CGTCAAGCTC 
GACCCTGAAG 
AGGTAGCGAC 
GCCAGAGCTG 
GTTTGAAGAA 
TGATCATGCC 
GGAGGATGCT 
TTCCTTTTTT 
TTTTACAAGG 



21 
I 

GGAGGGAAGA 
GTAGCCGCCG 
CGCGCCCCCA 
ACAGACGAGT 
ATTAAAACAC 

CTGAAGCACA 
ATCTGTGCCA 
CGTGCCTGGG 
CTGGCCATCC 
TGCAGGAAAG 
GAAAAAGTGG 
GAGGAGAAGC 
TAAAAAATTT 
GACGTTATAT 



31 
I 

GCGGGCGGGC 
AGAGGCCGCG 
TGGCGGCCGC 
CCAACCATGA 
TGGAAGAAGA 
AGAACGATCT 
AGGAGAAAGG 
ACCACTACAT 
TCTGGAACAC 
GCTTCCTGAA 
AGATCGAAGA 
CGGAAAAGCT 
AATAAATCGT 
TACCCTGCCC 
AAAGAACTGA 



41 
i 

GGGAGGCGCC 
GAGCCAGCGA 
CAAGGACACT 
CCCTCAGTTT 
TGAAGAGGAA 
CCCAGAATGG 
GGCCATCCGC 
CACGCCGATG 
CCACGCTGAC 
TGCTGAGAAT 
GAGAGAAAAG 
AGAAGCTCTC 
CTTATTTTAT 
CTCTTTTTCG 
ACTC 



51 
I 

GGCGCCAGAC 
CGACCGACCC 
CATGAGGACC 
GAGCCAATAG 
CTTTTTAAAA 
AAGGAGCGAG 
CTCCTCATGC 
ATGGAGCTGA 
TTCGCCGACG 
GCACAGAAAT 
AAAGCAGGAT 
TCGGTGAAGG 
TTTCTTTTCC 
GTTTGTTTTT 



Seq ID NO: 134 Protein sequence: 
Protein Accession #: NP_002873 

1 11 



51 



21 31 41 

I 1 t I I I 

MAAAKDTHED HDTSTENTDE SNHDPQFEPI VSLPEQEIKT LEEDEEELFK MRAKLFRFAS 
ENDLPEWKER GTGDVKLLKH KEKGAIRliLM RRDKTLKICA NHYITPMMEIi KPNAGSDRAW 
VWNTHADFAD ECPKPELLAI RFLNAENAQK FKTKFEECRK EIEEREKKAG SGKNDHAEKV 
AEKLEALSVK EETKEDAEEK Q 

Seq ID NO: 135 DNA sequence 

Nucleic Acid Accession #: NM_000077.2 

Coding sequence: 277-742 



I 

CCCAACCTGG 
TCCTCCGAGC 
GGATTTGAGG 
GGGCTGGCTG 
GGAGAGCAGG 
GCCTTCGGCT 
GCTGCTGGAG 
GGTCATGATG 



11 
I 

GGCGACTTCA 
ACTCGCTCAC 
GACAGGGTCG 
GTCACCAGAG 
CAGCGGGCGG 
GACTGGCTGG 
GCGGGGGCGC 
ATGGGCAGCG 



21 
I 

GGTGTGCCAC 
GGCGTCCCCT 
GAGGGGGCTC 
GGTGGGGCGG 
CGGGGAGCAG 
CCACGGCCGC 
TGCCCAACGC 
CCCGAGTGGC 



31 
I 

ATTCGCTAAG 
TGCCTGGAAA 
TTCCGCCAGC 
ACCGCGTGCG 
CATGGAGCCG 
GGCCCGGGGT 
ACCGAATAGT 
GGAGCTGCTG 



41 
I 

TGCTCGGAGT 
GATACCGCGG 
ACCGGAGGAA 
CTCGGCGGCT 
GCGGCGGGGA 
CGGGTAGAGG 
TACGGTCGGA 
CTGCTCCACG 



51 
I 

TAATAGCACC 
TCCCTCCAGA 
GAAAGAGGAG 
GCGGAGAGGG 
GCAGCATGGA 
AGGTGCGGGC 
GGCCGATCCA 
GCGCGGAGCC 



60 
120 
180 
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300 
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120 
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180 
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CAACTGCGCC GACCCCGCCA CTCTCACCCG ACCCGTGCAC GACGCTGCCC GGGAGGGCTT 
CCTGGACACG CTGGTGGTGC TGCACCGGGC CGGGGCGCGG CTGGACGTGC GCGATGCCTG 
GGGCCGTCTG CCCGTGGACC TGGCTGAGGA GCTGG6CCAT CGCGATGTCG CACGGTACCT 
GCGCGCGGCT GCGGGGGGCA CCAGAGGCAG TAACCATGCC CGCATAGATG CCGOGGAAGG 
TCCCTCAGAC ATCCCCGATT GAAAGAACCA GAGAGGCTCT GAGAAACCTC GGGAAACTTA 
GATCATCAGT CACCGAAGGT CCTACAGGGC CACAACTGCC CCCGCCACAA CCCACCCCGC 
TTTCGTAGTT TTCATTTAGA AAATAGAGCT TTTAAAAATG TCCTGCCTTT TAACGTAGAT 
ATATGCCTTC CCCCACTACC GTAAATGTCC ATTTATATCA TTTTTTATAT ATTCTTATAA 
AAATGTAAAA AAGAAAAACA CCGCTTCTGC CTTTTCACTG TGTTGGAGTT TTCTGGAGTG 
•AGCACTCACG CCCTAAGCGC ACATTCATGT GGGCATTTCT TGCGAGCCTC GCAGCCTCCG 
GAAGCTGTCG ACTTCATGAC AAGCATTTTG TGAACTAGGG AAGCTCAGGG GGGTTACTGG 
CTTCTCTTGA GTCACACTGC TAGCAAATGG CAGAACCAAA GCTCAAATAA AAATAAAATA 
ATTTTCATTC ATTCACTC 

Seq ID NO; 136 Protein sequence: 
Protein Accession #: NP_000068.1 

21 



41 



51 



11 21 31 

I I I I I I 

MEPAAGSSME PSADWLATAA ARGRVBEVRA LLEAGALPNA PNSYGRRPIQ VMMMGSARVA 
ELLLLHGAEP NCADPATLTR PVHDAAREGP LDTLWLHRA GARLDVRDAW GRLPVDLAEE 
LGHRDVARYL RAAAGGTRGS NHARIDAAEG PSDIPD 
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60 
120 



25 Seq ID NO: 137 DNA sequence 

Nucieic Acid Accession fl: NM_058196.1 
Coding sequence: 104-421 



30 1 11 21 31 41 51 

III I I I 

TGTGTGGGGG TCTGCTTGGC GGTGAGGGGG CTCTACACAA GCTTCCTTTC CGTCATGCCG 60 

GCCCCCACCC TGGCTCTGAC CATTCTGTTC TCTCTGGCAG GTCATGATGA TGGGCAGCGC 120 

CCGAGTGGCG GAGCTGCTGC TGCTCCACGG CGCGGAGCCC AACTGCGCCG ACCCCGCCAC 180 

35 TCTCACCCGA CCCGTGCACG ACGCTGCCCG GGAGGGCTTC CTGGACACGC TGGTGGTGCT 240 

GCACCGGGCC GGGGCGCGGC TGGACGTGCG CGATGCCTGG GGCCGTCTGC CCGTGGACCT 300 

GGCTGAGGAG CTGGGCCATC GCGATGTCGC ACGGTACCTG CGCGCGGCTG CGGGGGGCAC 360 

CAGAGGCAGT AACCATGCCC GCATAGATGC CGCGGAAGGT CCCTCAGACA TCCCCGATTG 420 

AAAGAACCAG AGAGGCTCTG AGAAACCTCG GGAAACTTAG ATCATCAGTC ACCGAAGGTC 480 

40 CTACAGGGCC ACAACTGCCC CCGCCACAAC CCACCCCGCT TTCGTAGTTT TCATTTAGAA 540 

AATAGAGCTT TTAAAAATGT CCTGCCTTTT AACGTAGATA TAAGCCTTCC CCCACTACCG 600 

TAAATGTCCA TTTATATCAT TTTTTATATA TTCTTATAAA AATGTAAAAA AGAAAAACAC 660 

CGCTTCTGCC TTTTCACTGT GTTGGAGTTT TCTGGAGTGA GCACTCACGC CCTAAGCGCA 720 

CATTCATGTG GGCATTTCTT GCGAGCCTCG CAGCCTCCGG AAGCTGTCGA CTTCATGACA 780 

45 AGCATTTTGT GAACTAGGGA AGCTCAGGGG GGTTACTGGC TTCTCTTGAG TCACACTGCT 840 
AGCAAATGGC AGAACCAAAG CTCAAATAAA AATAAAATAA TTTTCATTCA TTCACTC 



Seq ID NO: 138 Protein sequence: 
50 Protein Accession #: NP_4781Q3.1 

1 11 21 31 41 51 

I I I I I I 

MMMGSARVAE LLLLHGAEPN CADPATLTRP VHDAAREGFL DTLWLHRAG ARLDVRDAWG 60 
55 RLPVDLAEEL GHRDVARYLR AAAGGTRGSN HARIDAAEGP SDIPD 



60 



75 
80 
85 



Seq ID NO: 139 DNA sequence 

Nucleic Acid Accession #: NM_058197.1 

Coding sequence: 272-684 



1 11 21 31 41 51 

I I I 1 I I 

CCCAACCTGG GGCGACTTCA GGTGTGCCAC ATTCGCTAAG TGCTCGGAGT TAATAGCACC 60 

TCCTCCGAGC ACTCGCTCAC GGCGTCCCCT TGCCTGGAAA GATACCGCGG TCCCTCCAGA 120 

65 GGATTTGAGG GACAGGGTCG GAGGGGGCTC TTCCGCCAGC ACCGGAGGAA GAAAGAGGAG 180 

GGGCTGGCTG GTCACCAGAG GGTGGGGCGG ACCGCGTGCG CTCGGCGGCT GCGGAGAGGG 240 

GGAGAGCAGG CAGCGGGCGG CGGGGAGCAG CATGGAGCCG GCGGCGGGGA GCAGCATGGA 300 

GCCGGCGGCG GGGAGCAGCA TGGAGCCTTC GGCTGACTGG CTGGCCACGG CCGCGGCCCG 3 60 

GGGTCGGGTA GAGGAGGTGC GGGCGCTGCT GGAGGCGGGG GCGCTGCCCA ACGCACCGAA 420 

70 TAGTTACGGT CGGAGGCCGA TCCAGGTGGG TAGAAGGTCT GCAGCGGGAG CAGGGGATGG 480 

CGGGCGACTC TGGAGGACGA AGTTTGCAGG GGAATTGGAA TCAGGTAGCG CTTCGATTCT 540 

CCGGAAAAAG GGGAGGCTTC CTGGGGAGTT TTCAGAAGGG GTTTGTAATC ACAGACCTCC 600 

TCCTGGCGAC GCCCTGGGGG CTTGGGAAAC CAAGGAAGAG GAATGAGGAG CCACGCGCGT 660 

ACAGATCTCT CGAATGCTGA GAAGATCTGA AGGGGGGAAC ATATTTGTAT TAGATGGAAG 720 

TCATGATGAT GGGCAGCGCC CGAGTGGCGG AGCTGCTGCT GCTCCACGGC GCGGAGCCCA 780 

ACTGCGCCGA CCCCGCCACT CTCACCCGAC CCGTGCACGA CGCTGCCCGG GAGGGCTTCC 840 

TGGACACGCT GGTGGTGCTG CACCGGGCCG GGGCGCGGCT GGACGTGCGC GATGCCTGGG 900 

GCCGTCTGCC CGTGGACCTG GCTGAGGAGC TGGGCCATCG CGATGTCGCA CGGTACCTGC 960 

GCGCGGCTGC GGGGGGCACC AGAGGCAGTA ACCATGCCCG CATAGATGCC GCGGAAGGTC' 1020 

CCTCAGACAT CCCCGATTGA AAGAACCAGA GAGGCTCTGA GAAACCTCGG GAACTTAGAT 1080 

CATCAGTCAC CGAAGGTCCT ACAGGGCCAC AACTGCCCCC GCCACAACCC ACCCCGCTTT 1140 

CGTAGTTTTC ATTTAGAAAA TAGAGCTTTT AAAAATGTCC TGCCTTTTAA CGTAGATATA 1200 

TGCCTTCCCC CACTACCGTA AATGTCCATT TATATCATTT TTTATATATT CTTATAAAAA 1260 

TGTAAAAAAG AAAAACACCG CTTCTGCCTT TTCACTGTGT TGGAGTTTTC TGGAGTGAGC 1320 

ACTCACGCCC TAAGCGCACA TTCATGTGGG CATTTCTTGC GAGCCTCGCA GCCTCCGGAA 1380 

GCTGTCGACT TCATGACAAG CATTTTGTGA ACTAGGGAAG CTCAGGGGGG TTACTGGCTT 1440 

CTCTTGAGTC ACACTGCTAG CAAATGGCAG AACCAAAGCT CAAATAAAAA TAAAATAATT 1500 



240 



WO 02/086443 

TTCATTCATT CACTC 



PCT/US02/12476 
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Seq ID NO: 140 Protein sequence: 
Protein Accession fl: NP_478104.1 

1 11 21 31 41 51 

I I I I I I 

MEPAAGSSME PAAGSSMEPS ADWLATAAAR GRVEEVRALL EAGALPNAPN SYGRRPIQVG 60 
RRSAAGAGDG GRLWRTKFAG ELESGSASIL RKKGRLPGEF SEGVCNHRPP PGDALGAWET 120 
KEEE 

Seq ID NO: 141 DNA sequence 

Nucleic Acid Accession #: NM_058195.1 

Coding sequence: 163-684 

1 11 21 31 41 51 



CCTCCCTACG GGCGCCTCCG GCAGCCCTTC CCGCGTGCGC AGGGCTCAGA GCCGTTCCGA 60 

GATCTTGGAG GTCCGGGTGG GAGTGGGGGT GGGGTGGGGG TGGGGGTGAA GGTGGGGGGC 120 

20 GGGCGCGCTC AGGGAAGGCG GGTGCGCGCC TGCGGGGCGG AGATGGGCAG GGGGCGGTGC 180 

GTGGGTCCCA GTCTGCAGTT AAGGGGGCAG GAGTGGCGCT GCTCACCTCT GGTGCCAAAG 240 

GGCGGCGCAG CGGCTGCCGA GCTCGGCCCT GGAGGCGGCG AGAACATGGT GCGCAGGTTC 300 

TTGGTGACCC TCCGGATTCG GCGCGCGTGC GGCCCGCCGC GAGTGAGGGT TTTCGTGGTT 360 

CACATCCCGC GGCTCACGGG GGAGTGGGCA GCGCCAGGGG GGCCCGCCGC TGTGGCCCTC 420 

25 GTGCTGATGC TACTGAGGAG CCAGCGTCTA GGGCAGCAGC CGCTTCCTAG AAGACCAGGT 480 

CATGATGATG GGCAGCGCCC GAGTGGCGGA GCTGCTGCTG CTCCACGGCG CGGAGCCCAA . 540 

CTGCGCCGAC CCCGCCACTC TCACCCGACC CGTGCACGAC GCTGCCCGGG AGGGCTTCCT 600 

GGACACGCTG GTGGTGCTGC ACCGGGCCGG GGCGCGGCTG GACGTGCGCG ATGCCTGGGG 660 

CCGTCTGCCC GTGGACCTGG CTGAGGAGCT GGGCCATCGC GATGTCGCAC GGTACCTGCG 720 

30 CGCGGCTGCG GGGGGCACCA GAGGCAGTAA CCATGCCCGC ATAGATGCCG CGGAAGGTCC 780 

CTCAGACATC CCCGATTGAA AGAACCAGAG AGGCTCTGAG AAACCTCGGG AAACTTAGAT 840 

CATCAGTCAC CGAAGGTCCT ACAGGGCCAC AACTGCCCCC GCCACAACCC ACCCCGCTTT 900 

CGTAGTTTTC ATTTAGAAAA TAGAGCTTTT AAAAATGTCC TGCCTTTTAA CGTAGATATA 960 

TGCCTTCCCC CACTACCGTA AATGTCCATT TATATCATTT TTTATATATT CTTATAAAAA 1020 

35 TGTAAAAAAG AAAAACACCG CTTCTGCCTT TTCACTGTGT TGGAGTTTTC TGGAGTGAGC 1080 

ACTCACGCCC TAAGCGCACA TTCATGTGGG CATTTCTTGC GAGCCTCGCA GCCTCCGGAA 1140 

GCTGTCGACT TCATGACAAG CATTTTGTGA ACTAGGGAAG CTCAGGGGGG TTACTGGCTT 1200 

CTCTTGAGTC ACACTGCTAG CAAATGGCAG AACCAAAGCT CAAATAAAAA TAAAATAATT 1260 
TTCATTCATT CACTC 



Seq ID NO: 142 Protein sequence: 
Protein Accession #: NP 478102.1 



45 1 11 21 31 41 51 

I I I I I I 

MGRGRCVGPS LQLRGQEWRC SPLVPKGGAA AAELGPGGGE NMVRRFLVTL RIRRACGPPR 60 
VRVFWHIPR IiTGEWAAPGA PAAVALVLML LRSQRLGQQP LPRRPGHDDG QRPSGGAAAA 120 
PRRGAQLRRP RHSHPTRARR CPGGLPGHAG GAAPGRGAAG RARCLGPSAR GPG 



Seq ID NO: 143 DNA sequence 
Nucleic Acid Accession #: NM__018131 
Coding sequence: 412.. 1107 



1 11 21 '31 41 51 

I I I I I I 

GAAATTGCAC ACTTAAAGAC ATCAGTGGAT GAAATCACAA GTGGGAAAGG AAAGCTGACT 60 

' GATAAAGAGA GACAGAGACT TTTGGAGAAA ATTCGAGTCC TTGAGGCTGA GAAGGAGAAG 120 

60 AATGCTTATC AACTCACAGA GAAGGACAAA GAAATACAGC GACTGAGAGA CCAACTGAAG 180 

GCCAGATATA GTACTACCGC ATTGCTTGAA CAGCTGGAAG AGACAACGAG AGAAGGAGAA 240 

AGGAGGGAGC AGGTGTTGAA AGCCTTATCT GAAGAGAAAG ACGTATTGAA ACAACAGTTG 300 

TCTGCTGCAA CCTCACGAAT TGCTGAACTT GAAAGCAAAA CCAATACACT CCGTTTATCA 360 

CAGACTGTGG CTCCAAACTG CTTCAACTCA TCAATAAATA ATATTCATGA AATGGAAATA 420 

65 CAGCTGAAAG ATGCTCTGGA GAAAAATCAG CAGTGGCTCG TGTATGATCA GCAGCGGGAA 480 

GTCTATGTAA AAGGACTTTT AGCAAAGATC TTTGAGTTGG AAAAGAAAAC GGAAACAGCT 540 

GCTCATTCAC TCCCACAGCA GACAAAAAAG CCTGAATCAG AAGGTTATCT TCAAGAAGAG 600 

AAGCAGAAAT GTTACAACGA TCTCTTGGCA AGTGCAAAAA AAGATCTTGA GGTTGAACGA 660 

CAAACCATAA CTCAGCTGAG TTTTGAACTG AGTGAATTTC GAAGAAAATA TGAAGAAACC 720 

70 CAAAAAGAAG TTCACAATTT AAATCAGCTG TTGTATTCAC AAAGAAGGGC AGATGTGCAA 780 

CATCTGGAAG ATGATAGGCA TAAAACAGAG AAGATACAAA AACTCAGGGA AGAGAATGAT 840 

ATTGCTAGGG GAAAACTTGA AGAAGAGAAG AAGAGATCCG AAGAGCTCTT ATCTCAGGTC 900 

CAGTCTCTTT ACACATCTCT GCTAAAGCAG CAAGAAGAAC AAACAAGGGT AGCTCTGTTG 960 

GAACAACAGA TGCAGGCATG TACTTTAGAC TTTGAAAATG AAAAACTCGA CCGTCAACAT 1020 

GTGCAGCATC AATTGCATGT AATTCTTAAG GAGCTCCGAA AAGCAAGAAA AAATAACACA 1080 

GTTGGAATCC TTGAAACAGC TTCATGAGTT TGCCATCACA GAGCCATTAG TCACTTTCCA 1140 

AGGAGAGACT GAAAACAGAG AAAAAGTTGC CGCCTCACCA AAAAGTCCCA CTGCTGCACT 1200 

CAATGGAAGC CTGGTGGAAT GTCCCAAGTG CAATATACAG TATCCAGCCA CTGAGCATCG 1260 

_ _ CGATCTGCTT GTCCATGTGG AATACTGTTC AAAGTAGCAA AATAAGTATT TGTTTTGATA 1320 

80 TTAAAAGATT CAATACTGTA TTTTCTGTTA GCTTGTGGGC ATTTTGAATT ATATATTTCA 1380 

CATTTTGCAT AAAACTGCCT ATCTACCTTT GACACTCCAG CATGCTAGTG AATCATGTAT 1440 

CTTTTAGGCT GCTGTGCATT TCTCTTGGCA GTGATACCTC CCTGACATGG TTCATCATCA 1500 

GGCTGCAATG ACAGAATGTG GTGAGCAGCG TCTACTGAGA TACTAACATT TTGCACTGTC IS 60 

oc AAAATACTTG GTGAGGAAAA GATAGCTCAG GTTATTGCTA ATGGGTTAAT GCACCAGCAA 1620 

85 GCAAAATATT TTATGTTTCG GGGGTTTTGA AAAATCAAAG ATAATTAACC AAGGATCTTA 1680 

ACTGTGTTCG CATTTTTTAT CCAAGCACTT AGAAAACCTA CAATCCTAAT TTTGATGTCC 1740 

ATTGTTAAGA GGTGGTGATA GATACTATTT TTTTTTCATA TTGTATAGCG GTTATTAGAA 1800 
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WO 02/086443 

AAGTTGGGGA TTTTCTTGAT CTTTATTGCT GCTTACCATT 
TCCCCAACTC TGTTCTGCGC ACGAAACAGT ATCTGTTTGA 
CACACAATGT TTTCTCTTAT GTTATCTGGC AGTAACTGTA 
TTCTGCTTAG CTAAAATTGT TAAAATAAAC TTTAATAAAC 
ATTGACAGTA TTTTAGTTAT TTTTGGCATT CTTAAAGCTG 
TTTGTTTGTC TGAACAGGTA TTTTTATACA TGCTTTTTGT 
TCTTCAGGTT TTCTAACATG CTTACCACTG GGCTACTGTA 
ATTTAATGTT TT 



Seq ID NO: 144 Protein sequence: 
Protein Accession #: NP 060601 



GAAACTTAAC 
GGCATAATCT 
ACTTGAATTA 
CCATGTAGCC 
GGCAATGTAA 
AAACCAAAAA 
AATGAGAAAA 



CCAGCTGTGT 
TAAGTGGCCA 
CATTAGCACA 
CTCTCATTTG 
TGATCAGATC 
CTTTTAAATT 
GAATAAAATT 



51 



CCGCCAGATT 
GACGTTGCCC 
CTGGCCCTTC 
CCACTGCCCC 
GGAAGGCTGG 
CGCTTTCCTT 
GGACAGAGAA 
TGAGGAAACT 
CCTCTGGCCG 
GTGCCACCAG 
CAAATTAGAT 
TGCCTGTGCA 
GGGGGCTCAT 
AAGGCAGTGT 
GTGAATGTGT 
GGTGCCTGTT 
ACAGTTTTTT 
GTGATGAGAG 
TATTTTGTTT 
AGCCATTCTA 
AGTGATAGGA 
AGTGAGCCGC 
CTTTTTAAAT 
TCTGTCAGCC 
CCAGGTCCCC 
GATGGATTTG 
GCTGGAAACC 



11 
I 

TGAATCGCGG 
CCTGCCTGGC 
TTGGAGGGCT 
ACTGAGAACG 
GAGCCAGATG 
TCTGTCAAGA 
AGAGCCAAGA 
GCGAAGAAAG 
GAGCTGCCTG 
CCTTCCTGTG 
GTTTCAACTG 
GCGGGTGCTG 
TTTTGCTGTT 
CCCTTTTGCT 
CTGGACCTCA 
GAATCTGAGC 
TGTTGTTGTG 
AATGGAGACA 
GAATTGTTAA 
AGTCATTGGG 
AGCGTCTGGC 
GGGGCACATG 
GACTTGGCTC 
CAACCTTCAC 
GCTTTCTTTG 
ATTCGCCCTC 
TCTGGAGGTC 



21 
I 

GACCCGTTGG 
AGCCCTTTCT 
GCGCCTGCAC 
AGCCAGACTT 
ACGACCCCAT 
AGCAGTTTGA 
ACAAAATTGC 
TGCGCCGTGC 
GTCCCAGAGT 
GGCCCCTTAG 
TGCTCCTGTT 
CTGGTAACAG 
TTGATTCCCG 
AGAGCTGACA 
TGTTGTTGAG 
TGCAGGTTCC 
TTTTTTTGTT 
GAGTCCCTGG 
TTCACAGAAT 
GAAACGGGGT 
AGATACTCCT 
CTGGCCGCTC 
GATGCTGTGG 
ATCTGTCACG 
GAGGCAGCAG 
CTCCCTGTCA 
ATCTCGGCTG 



31 
I 

CAGAGGTGGC 
CAAGGACCAC 
CCCGGAGCGG 
GGCCCAGTGT 
AGAGGAACAT 
AGAATTAACC 
AAAGGAAACC 
CATCGAGCAG 
GGCTGCACCA 
CAATGTCTTA 
TTGTCTTGAA 
TGGCTGCTTC 
GGCTTACCAG 
GCTTTGTTCG 
GCTGTCACAG 
TTATCTGTCA 
TTTTTTTTTT 
CTCCTCTACT 
AGCACAAACT 
GAACTTCAGG 
TTTGCCACTG 
CTCCCTCAGA 
GGGACTGGCT 
TTCTCCACAC 
CTCCCGCAGG 
TAGAGCTGCA 
TTCCTGAGAA 



41 
i 

GGCGGCGGCA 
CGCATCTCTA 
ATGGCCGAGG 
TTCTTCTGCT 
AAAAAGCATT 
CTTGGTGAAT 
AACAATAAGA 
CTGGCTGCCA 
CTTCCAGGGT 
GGAAAGGAGA 
AGTGGCACCA 
TCTCTCTCTC 
GTGAGAAGTG 
CGTGGGCAGA 
TCCTGAGTGT 
CACCTGTGCC 
GGTAGATGCA 
GTTTAACAAC 
ACAATTAAAA 
TGGATGAGGA 
CTGTGTGATT 
AAAAGGCAGT 
GGGCTGCTGC 
GGGGGAGAGA 
GCTGAAGTCT 
GGGTGGATTG 
ATAAAAAGCC 



51 
I 

TGGGTGCCCC 
CATTCAAGAA 
CTGGCTTCAT 
TCAAGGAGCT 
CGTCCGGTTG 
TTTTGAAACT 
AGAAAGAATT 
TGGATTGAGG 
TTATTCCCTG 
TCAACATTTT 
GAGGTGCTTC 
TCTCTTTTTT 
AGGGAGGAAG 
GCCTTCCACA 
GGACTTGGCA 
TCCTCAGAGG 
TGACTTGTGT 
ATGGCTTTCT 
CTAAGCACAA 
GACAGAATAG 
AGACAGGCCC 
GGCCTAAATC 
AGGCCGTGTG 
CGCAGTCCGC 
GGCGTAAGAT 
TTACAGCTTC 
TGTCATTTC 



Seq ID NO: 146 Protein sequence: 
Protein Accession #: NP_001159 



11 
I 



21 



31 



41 



51 



MGAPTLPPAW QPFLKDHRIS TFKNWPFLEG CACTPERMAE AGPIHCPTEN EPDLAQCFPC 
FKELEGWEPD DDPIEEHKKH SSGCAPLSVK KQFEELTLGE FLKliDRERAK NKIAKETNNK 
KKEFEETAKK VRRAIEQLAA MD 

Seq ID NO: 147 DNA sequence 

Nucleic Acid Accession #: NM_014176.1 

Coding sequence: 127-720 



1 
I 

GCGCGCAGCG 
AGTGCATCCC 
GGGATCATGC 
CCCCCAGGCA 
TTAGGTGGAG 
GAGAGGTACC 
ATTGATTCTG 
AGACCATCCC 
AACCCTGATG 
TTCOTCAAGA 
GAGGAAGAGA 
CAGAAAAGGA 
GGGACTTGTC 
ACCTTGAATT. 
GTACATATGT 



11 
I 

CTGGTACCCC 
AGGCAGCTCT 
AGAGAGCTTC 
TCACATGTTG 
CCAACACACC 
CATTTGAACC 
CTGGAAGGAT 
TCAACATCGC 
ACCCGCTCAT 
ATGCCAGACA 
TGCTTGATAA 
AGGCCAGTCA 
CTGGTTCATC 
TTTTTTTAAA 
ATTTTGAAAT 



21 

I 

GTTGGTCCGC 
TAGTGTGGAG 
ACGTCTGAAG 
GCAAGATAAA 
TTATGAGAAA 
TCCTCAGATC 
TTGTCTGGAT 
AACTGTGTTG 
GGCTGACATA 
GTGGACAGAG 
TCTACCAGAG 
GCTAGTAGGC 
TTAGTTAATG 
TATATTTGAT 
CTTTTAAACC 



31 
I 

GCGTTGCTGC 
CAGTGAACTG 
AGAGAGCTGC 
GACCAAATGG 
GGTGTTTTTA 
CGATTTCTCA 
GTTCTCAAAT 
ACCTCTATTC 
TCCTCAGAAT 
AAGCATGCAA 
GCTGGTGACT 
ATAGAAAAGA 
TGTTCTTTGC 
GACATAATTT 
TGAAAAATAA 



41 
I 

GTTGTGAGGG 
TGTGTGGTTC 
ACATGTTAGC 
ATGACCTGCG 
AGCTAGAAGT 
CTCCAATTTA 
TGCCACCAAA 
AGCTGCTCAT 
TTAAATATAA 
GACAGAAACA 
CCAGAGTACA 
AATTTCATCC 
CAAGGTGATC 
TTGTGTAGTT 
ATAGTCATTT 



51 - 
I 

GTGTCAGCTC 
CTTCTACTTG 
CACAGAGCCA 
AGCTCAAATA 
TATCATTCCT 
TCATCCAAAC 
AGGTGCTTGG 
GTCAGAACCC 
TAAGCCAGCC 
AAAGGCTGAT 
CAACTCAACA 
TGATGTTTAG 
TAAGTTGCCT 
TATTTATCTT 
AATGTTGAAA 



I860 
1920 
1980 
2040 
2100 
2160 
2220 



PCTAJS02/12476 



1 11 21 31 41 

I I I I I I 

MEIQLKDALE KNQQWbVYDQ QREVYVKGLL AKIFELEKKT ETAAHSLPQQ TKKPESEGYL 
QEEKQKCYND LLASAKKDLE VERQTITQLS FELSEFRRKY EETQKEVHNL NQLLYSQRRA 
DVQHLEDDRH KTEKIQKLRE ENDIARGKLE EEKKRSEELL SQVQSLYTSL LKQQEEQTRV 
ALLEQQMQAC TLDFENEKLD RQHVQHQLHV ILKELRKARK NNTVGILETA S 

Seq ID NO: 145 DNA sequence 
Nucleic Acid Accession &: NMJJ01168 
Coding sequence: 50.. 478 
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WO 02/086443 

AAAAAAAAAA AAAAAAAAAA AAAAAAAA 

Seq ID NO: 148 Protein sequence: 
Protein Accession #: NP_054895.1 

1 11 21 31 41 51 

1 I 1 I I I 

MQRASRLKRE LHMLATBPPP GITCWQDKDQ MDDLRAQILG GANTPYEKGV FKLEVIIPER 60 

YPFEPPQIRF LTPIYHPNID SAGRICLDVL KLPPKGAWRP SLNIATVLTS IQLLMSEPNP 120 

DDPLMADISS EFKYNKPAFL KNARQWTEKH ARQKQKADEE EMLDNLPEAG DSRVHNSTQK 180 
RKASQLVGIE KKFHPDV 

Seq ID NO: 149 DNA sequence 
Nucleic Acid Accession &: NM_003812 
Coding sequence: 224-2722 

1 11 21 31 41 51 

I I I I I I 

TCCTCTGCGT CCCGCCCCGG GAGTGGCTGC GAGGCTAGGC GAGCCGGGAA AGGGGGCGCC 60 

GCCCAGCCCC GAGCCCCGCG CCCCGTGCCC CGAGCCCGGA GCCCCCTGCC CGCGGCGGCA 120 

CCATGCGCGC CGAGCCGGCG TGACCGGCTC CGCCCGCGGC CGCCCCGCAG CTAGCCCGGC 180 

GCTCTCGCCG GCCACACGGA GCGGCGCCCG GGAGCTATGA GCCATGAAGC CGCCCGGCAG 240 

CAGCTCGCGG CAGCCGCCCC TGGCGGGCTG CAGCCTTGCC GGCGCTTCCT GCGGCCCCCA 300 

ACGCGGCCCC GCCGGCTCGG TGCCTGCCAG CGCCCOGGCC CGCACGCCGC CCTGCCGCCT 360 

GCTTCTCGTC CTTCTCCTGC TGCCTOCGCT CGCCGCCTCG TCCCGGCCCC GCGCCTGGGG 420 

GGCTGCTGCG CCCAGCGCTC CGCATTGGAA TGAAACTGCA GAAAAAAATT TGGGAGTCCT 480 

GGCAGATGAA GACAATACAT TGCAACAGAA TAGCAGCAGT AATATCAGTT ACAGCAATGC 540 

AATGCAGAAA GAAATCACAC TGCCTTCAAG ACTCATATAT TACATCAACC AAGACTCGGA 600 

AAGCCCTTAT CACGTTCTTG ACACAAAGGC AAGACACCAG CAAAAACATA ATAAGGCTGT 660 

CCATCTGGCC CAGGCAAGCT TCCAGATTGA AGCCTTCGGC TCCAAATTCA TTCTTGACCT 720 

CATACTGAAC AATGGTTTGT TGTCTTCTGA TTATGTGGAG ATTCACTACG AAAATGGGAA 780 

ACCACAGTAC TCTAAGGGTG GAGAGCACTG TTACTACCAT GGAAGCATCA GAGGCGTCAA 840 

AGACTCCAAG GTGGCTCTGT CAACCTGCAA TGGACTTCAT GGCATGTTTG AAGATGATAC 900 

CTTCGTGTAT ATGATAGAGC CACTAGAGCT GGTTCATGAT GAGAAAAGCA CAGGTCGACC 960 

ACATATAATC CAGAAAACCT TGGCAGGACA GTATTCTAAG CAAATGAAGA ATCTCACTAT 1020 

GGAAAGAGGT GACCAGTGGC CCTTTCTCTC TGAATTACAG TGGTTGAAAA GAAGGAAGAG 1080 

AGCAGTGAAT CCATCACGTG GTATATTTGA AGAAATGAAA TATTTGGAAC TTATGATTGT 1140 

TAATGATCAC AAAACGTATA AGAAGCATCG CTCTTCTCAT GCACATACCA ACAACTTTGC 1200 

AAAGTCCGTG GTCAACCTTG TGGATTCTAT TTACAAGGAG CAGCTCAACA CCAGGGTTGT 1260 

CCTGGTGGCT GTAGAGACCT GGACTGAGAA GGATCAGATT GACATCACCA CCAACCCTGT 1320 

GCAGATGCTC CATGAGTTCT CAAAATACCG GCAGCGCATT AAGCAGCATG CTGATGCTGT 1380 

GCACCTCATC TCGCGGGTGA CATTTCACTA TAAGAGAAGC AGTCTGAGTT ACTTTGGAGG 1440 

TGTCTGTTCT CGCACAAGAG GAGTTGGTGT GAATGAGTAT GGTCTTCCAA TGGCAGTGGC 1500 

ACAAGTATTA TCGCAGAGCC TGGCTCAAAA CCTTGGAATC CAATGGGAAC CTTCTAGCAG 1560 

AAAGCCAAAA TGTGACTGCA CAGAATCCTG GGGTGGCTGC ATCATGGAGG AAACAGGGGT 1620 

GTCCCATTCT CGAAAATTTT CAAAGTGCAG CATTTTGGAG TATAGAGACT TTTTACAGAG 1680 

AGGAGGTGGA GCCTGCCTTT TCAACAGGCC AACAAAGCTA TTTGAGCCCA CGGAATGTGG 1740 

AAATGGATAC GTGGAAGCTG GGGAGGAGTG TGATTGTGGT TTTCATGTGG AATGCTATGG 1800 

ATTATGCTGT AAGAAATGTT CCCTCTCCAA CGGGGCTCAC TGCAGCGACG GGCCCTGCTG 1860 

TAACAATACC TCATGTCTTT TTCAGCCACG AGGGTATGAA TGCCGGGATG CTGTGAACGA 1920 

GTGTGATATT ACTGAATATT GTACTGGAGA CTCTGGTCAG TGCCCACCAA ATCTTCATAA 1980 

GCAAGACGGA TATGCATGCA ATCAAAATCA GGGCCGCTGC TACAATGGCG AGTGCAAGAC 2040 

CAGAGACAAC CAGTGTCAGT ACATCTGGGG AACAAAGGCT GCAGGGTCTG ACAAGTTCTG 2100 

CTATGAAAAG CTGAATACAG AAGGCACTGA GAAGGGAAAC TGCGGGAAGG ATGGAGACCG 2160 

GTGGATTCAG TGCAGCAAAC ATGATGTGTT CTGTGGATTC TTACTCTGTA CCAATCTTAC 2220 

TCGAGCTCCA CGTATTGGTC AACTTCAGGG TGAGATCATT CCAACTTCCT TCTACCATCA 2280 

AGGCCGGGTG ATTGACTGCA GTGGTGCCCA TGTAGTTTTA GATGATGATA CGGATGTGGG 2340 

CTATGTAGAA GATGGAACGC CATGTGGCCC GTCTATGATG TGTTTAGATC GGAAGTGCCT 2400 

ACAAATTCAA GCCCTAAATA TGAGCAGCTG TCCACTCGAT TCCAAGGGTA AAGTCTGTTC 2460 

GGGCCATGGG GTGTGTAGTA ATGAAGCCAC CTGCATTTGT GATTTCACCT GGGCAGGGAC 2520 

AGATTGCAGT ATCCGGGATC CAGTTAGGAA CCTTCACCCC CCCAAGGATG AAGGACCCAA 2580 

GGGTCCTAGT GCCACCAATC TCATAATAGG CTCCATCGCT GGTGCCATCC TGGTAGCAGC 2640 

TATTGTCCTT GGGGGCACAG GCTGGGGATT TAAAAATGTC AAGAAGAGAA GGTTCGATCC 2700 

TACTCAGCAA GGCCCCATCT GAATCAGCTG CGCTGGATGG ACACCGCCTT GCACTGTTGG 2760 

ATTCTGGGTA TGACATACTC GCAGCAGTGT TACTGGAACT ATTAAGTTTG TAAACAAAAC 2820 

CTTTGGGTGG TAATGACTAC GGAGCTAAAG TTGGGGTGAC AAGGATGGGG TAAAAGAAAA 2880 

CTGTCTCTTT TGGAAATAAT GTCAAAGAAC ACCTTTCACC ACCTGTCAGT AAACGGGGGA 2940 

GGGGGCAAAA GACCATGCTA TAAAAAGAAC TGTTCCAGAA TCTTTTTTTT TCCCTAATGG 3000 
ACGAAGGAAC AACACACACA CAAAAATTAA ATGCAATAAA GGAATCATTA AAAA 



Seq ID NO: 150 Protein sequence: 
Protein Accession #: NP_003803 

1 11 21 31 41 51 

I I I I I 1 

MKPPGSSSRQ PPLAGCSLAG ASCGPQRGPA GSVPASAPAR TPPCRLLLVL LLLPPIoAASS 60 

RPRAWGAAAP SAPHWNETAE KNLGVLADED NTLQQNSSSN ISYSNAMQKE ITLPSRLIYY 120 

INQDSESPYH VLDTKARHQQ KHNKAVHLAQ ASFQIEAFGS KFILDLILNN GLLSSDYVEI 180 

HYENGKPQYS KGGEHCYYHG SIRGVKDSKV ALSTCNGLHG MFEDDTFVYM IEPLELVHDE 240 

KSTGRPHIIQ KTLAGQYSKQ MKNLTMERGD QWPFLSELQW LKRRKRAVNP SRGIFEEMKY 300 

LELMIVNDHK TYKKHRSSHA HTNNFAKSW NLVDSIYKEQ LNTRWLVAV ETWTEKDQID 360 

ITTNPVQMLH EFSKYRQRIK QHADAVHLIS RVTFHYKRSS LSYFGGVCSR TRGVGVNEYG 420 

LPMAVAQVLS QSLAQNLGIQ WEPSSRKPKC DCTESWGGCI MEETGVSHSR KFSKCSILEY 480 

RDFLQRGGGA CLFNRPTKLF EPTECGNGYV EAGEECDCGF HVECYGLCCK KCSLSNGAHC 540 

SDGPCCNNTS CLFQPRGYEC RDAVNECDIT EYCTGDSGQC PPNLHKQDGY ACNQNQGRCY 600 

NGECKTRDNQ CQYIWGTKAA GSDKFCYEKL NTEGTEKGNC GKDGDRWIQC SKHDVFCGFL 660 
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LCTNLTRAPR IGQLQGEIIP TSFYHQGRVI DCSGAHWLD DDTDVGYVED GTPOGPSMMC 720 

LDRKCLQIQA LNMSSCPLDS KGKVCSGHGV CSNEATCICD FTWAGTDCSI RDPVRNLHPP 780 
KDEGPKGPSA TNLIIGSIAG AILVAAIVLG GTGWGFXNVK KRRFDPTQQG PI 

Seq ID NO: 151 DNA sequence 
Nucleic Acid Accession ft: NM_02391S 
Coding sequence: 250-1326 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGGG TTTCGTTTTC ATGCTTTACC AGAAAATCCA CTTCCCTGCC GACCTTAGTT 60 

TCAAAGCTTA TTCTTAATTA GAGACAAGAA ACCTGTTTCA ACTTGAAGAC ACCGTATGAG 120 

GTGAATGGAC AGCCAGCCAC CACAATGAAA GAAATCAAAC CAGGAATAAC CTATGCTGAA 180 

CCCACGCCTC AATCGTCCCC AAGTGTTTCC TGACACGCAT CTTTGCTTAC AGTGCATCAC 240 

AACTGAAGAA TGGGGTTCAA CTTGACGCTT GCAAAATTAC CAAATAACGA GCTGCAOGGC 300 

CAAGAGAGTC ACAATTCAGG CAACAGGAGC GACGGGCCAG GAAAGAACAC CACCCTTCAC 360 

AATGAATTTG ACACAATTGT CTTGCCGGTG CTTTATCTCA TTATATTTGT GGCAAGCATC 420 

TTGCTGAATG GTTTAGCAGT GTGGATCTTC TTCCACATTA GGAATAAAAC CAGCTTCATA 480 

TTCTATCTCA AAAACATAGT GGTTGCAGAC CTCATAATGA CGCTGACATT TCCATTTCGA 540 

ATAGTCCATG ATGCAGGATT TGGACCTTGG TACTTCAAGT TTATTCTCTG CAGATACACT 600 

TCAGTTTTGT TTTATGCAAA CATGTATACT TCCATCGTGT TCCTTGGGCT GATAAGCATT 660 

GATCGCTATC TGAAGGTGGT CAAGCCATTT GGGGACTCTC GGATGTACAG CATAACCTTC 720 

ACGAAGGTTT TATCTGTTTG TGTTTGGGTG ATCATGGCTG TTTTGTCTTT GCCAAACATC 780 

ATCCTGACAA ATGGTCAGCC AACAGAGGAC AATATCCATG ACTGCTCAAA ACTTAAAAGT 840 

CCTTTGGGGG TCAAATGGCA TACGGCAGTC ACCTATGTGA ACAGCTGCTT GTTTGTGGCC 900 

GTGCTGGTGA TTCTGATCGG ATGTTACATA GCCATATCCA GGTACATCCA CAAATCCAGC 960 

AGGCAATTCA TAAGTCAGTC AAGCCGAAAG CGAAAACATA ACCAGAGCAT CAGGGTTGTT 1020 

GTGGCTGTGT TTTTTACCTG CTTTCTACCA TATCACTTGT GCAGAATTCC TTTTACTTTT 1080 

AGTCACTTAG ACAGGCTTTT AGATGAATCT GCACAAAAAA TCCTATATTA CTGCAAAGAA 1140 

ATTACACTTT TCTTGTCTGC GTGTAATGTT TGCCTGGATC CAATAATTTA CTTTTTCATG 1200 

TGTAGGTCAT TTTCAAGAAG GCTGTTCAAA AAATCAAATA TCAGAACCAG GAGTGAAAGC 1260 

ATCAGATCAC TGCAAAGTGT GAGAAGATCG GAAGTTCGCA TATATTATGA TTACACTGAT 1320 

GTGTAGGCCT TTTATTGTTT GTTGGAATCG ATATGTACAA AGTGTAAATA AATGTTTCTT 1380 
TTCATTATCC TTAAAAAAAA AA 



Seq ID NO: 152 Protein sequence: 
Protein Accession #: NP_076404 

1 11 21 31 41 51 

I I I I I i 

MGFNLTLAKL PNNELHGQES HNSGNRSDGP GKNTTLHNEF DTIVLPVLYL IIFVASILLN 60 

GLAVWIFFHI RNKTSFIFYL KNIWADLIM TLTFPFRIVH DAGFGPWYFK FILCRYTSVL 120 

FYANMYTSIV FLGLISIDRY LKWKPFGDS RMYSITFTKV LSVCVWVIMA VLSLPNIILT 180 

NGQPTEDNIH DCSKLKSPLG VKWHTAVTYV NSCLFVAVLV ILIGCYIAIS RYIHKSSRQF 240 

ISQSSRKRKH NQSIRVWAV FFTCFLPYHL CRIPFTFSHL DRLIJDESAQK IkYYCKEITL 300 
FLSACNVCLD PIIYFFMCRS FSRRLFKKSN IRTRSESIRS LQSVRRSEVR IYYDYTDV 

Seq ID NO: 153 DNA sequence 
Nucleic Acid Accession #: D80008.1 
Coding sequence: 149-739 

1 11 21 31 41 51 . 

11)111 

GTTCGGCGCC AAAGCGCGGA GCGGAGGCCG AGGCGAGAGC CTGGCGCTGT AGGACTAGAA 60 

CGAAAGGAGT GAGGCGCCGA GAGCCCAGAT ACCATTTTGG CGTGAGAGCT GGTGGTTGGC 120 

AAGGCCGCGG GAGTGGGAAG CGTCCGCCAT GTTCTGCGAA AAAGCCATGG AACTGATCCG 180 

CGAGCTGCAT CGCGCGCCCG AAGGGCAACT GCCTGCCTTC AACGAGGATG GACTCAGACA 240 

AGTTCTGGAG GAGATGAAAG CTTTGTATGA ACAAAACCAG TCTGATGTGA ATGAAGCAAA 300 

GTCAGGTGGA CGAAGTGATT TGATACCAAC TATCAAATTT CGACACTGTT CTCTGTTAAG 360 

AAATCGACGC TGCACTGTAG CATACCTGTA TGACCGCTTG CTTCGGATCA GAGCACTCAG 420 

ATGGGAATAT GGTAGCGTCT TGCCAAATGC ATTACGATTT CACATGGCTG CTGAAGAAAT 480 

GGAGTGGTTT AATAATTATA AAAGATCTCT TGCTACTTAT ATGAGGTCAC TGGGAGGAGA 540 

TGAAGGTTTG GACATTACAC AGGATATGAA ACCACCAAAA AGCCTATATA TTGAAGTCCG 600 

GTGTCTAAAA GACTATGGAG AATTTGAAGT TGATGATGGC ACTTCAGTCC TATTAAAAAA 660 

AAATAGCCAG CACTTTTTAC CTCGATGGAA ATGTGAGCAG CTGATCAGAC AAGGAGTCCT 720 

GGAGCACATC CTGTCATGAC CATGCGCCGA GGCACTTCCA GGCTTCACTC AACTCATGGA 780 

CTCCTCTGTA CTCACTCTCT CCACCACTCC CTTCACCTCC CTCTTTGATT TTAGAAGCTA 840 

TAGACATTGT TTAAGATAAC TAAGAATACT TGGCTAAGAA GTATAATTTG CTAACTATTA 900 

AGGACTTTCT TTTTTTAATG TTGTACACTA TTCTTCCTAC TCTTTTTTGG TTTTGGTTTT 960 

GTTTTGTAGA GACTGTCTCA CTATGTTGCC CAAGCTGGTC TCAAACTCCT GGCCTCAAGC 1020 

AGTCCTCCCA CCTTAGCTTC TCAAAGTGTT GAGATCACAG GCGTGAGCCA CTGCACCCGG 1080 

CCCCTACTCC TTTTTCTAAT AAGCTGTATC TGTAATCACA GCATTCCTAC AGTTGTTACA 1140 

GTGTGTTTTT TAAATGAAAG TAAACATGGT TACATTTGAA TCTCTTAAAT AAGCAGTCAC 1200 

TTGGCTGGAC AGGAAGAAGG TAGATCCTGT GTGTCTTGTT TTCTGGTCAT GTGTATTGTA 1260 

CAAGCTAGAG AGCTGAATTT CTGAGATACA CATTTTCAAA TCACATGCAA GTGAAGATGA 1320 

TGGTCTGTAG AAATTTTCAG TATATATAAT GTTTAATGAC ATACTAATTT ATCATCTGGC 1380 

TATTTGGGAA GGAAGGACAC ACATGGATTT TGCACATTTC CACCATGGTG GCTGGTGTGG 1440 

CTTGTGGCTA TGGGGTGATC ACCAGTATCA CCACTTTGGA AGGGGACAGT GAAATTGGGG 1500 

CTAGAGAAGG AACTTTGTAC AGTTTTCCCT GAGATTCAGA TTGACTGAAA AGTCACATGA 1560 

AGAGTTGATT GTCTTTTAAT GGTATGTTTT AAACAGCTGA CATTTTAAAT TTTGATGAAA 1620 

TCCAGTTTAT TCGTTTGTTC TTTTATGCTT TGGGTGTTGC ATCCGAGAAA TCTTTTCCCA 1680 

TCCCAAGATC ACAATTTTTT TTCCTTTTTA CTTCTAGAAG TGTTATAATT TTAAGCTTTA 1740 

TACTTTGGTC TATGACCCGT TTTTTTTTTT GTTTTGTTTT GTTTTTTCGT TTGTTTCTTT 1800 

GTTTTGAGAT GGAGTCTTGT TCTGTCACCC AGGCTGGGGT GCAGTGGCGT GATCTTGGCT 1860 

CACTGCAATC TCTATCCCCT GGGTTCAAGT GATTCTCTTG TCTCAGCCTC CCAAGTAGCT 1920 

GGGATTACAG GCACAGGCCG CCACGCCTGG CTAATTTTTG TATTTTTAGT AGAGACAGAG 1980 
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TTTTACCATG TTGGCCAGGC TGGTTTCAAA 
CCCAAAGTTT TGGGATTACA AGTGTGGGCC 
GAATTTTTTA TATGGTGCAA GGTGTCAATC 
TCCAGCTGTT TCACTACCAT TTTTTGAAAG 
TTTGTTAAAA AGTAGTTGTC AATGTATATG 
ATTGACCTGT TTTTCTCTCC TGAATGCCAA 
TCTAATAATT CTTGAAACAG ATAGTATTAA 
TTTGTAGAGA TGGGGTTTCA CCGTGTTGGC 
ATACACTTGC CTCGTCCTCC CCATGTGCTG 
CAGTGTACCA CATTTCTTTT TGAGATTTGT 
GTGAAATTTG GGAACAGGCA GGGTGTGGTG 
GGCCTAGATG GGTGGATCAC TTGAGCTCAG 
AACTCCGTCT CTACAAAAAA TAGAAAAAAT 
CACAGTTACA CGGCAGGCTG AGGTGGGAGG 
GTGAGCTGAG ATCACACCAC TGTACTCCAG 
AAAGAAATTA GGATCAATTT GTCAATTTCT 
CACCTTGATT GAGATTGCAT TGAATTTATA 
AATATTGAGT CTTCTGGCCT ATAAACAAGG 
TCTATTTCTC TTAATAATCT TTTGTAGTTT 
CATAGTTTTG ATGCTAAATG GTATTTTAAA 
AATAGAAATA CAATTGATGT TGAACTTGTA 
ATGGTGTTTT TGTAAATTAC ATCAACAGTC 
TTC 

Seq ID NO: 154 Protein sequence: 
Protein Accession #: BAA11503.1 

1 11 



CTCCTGACCT 
ACCGCGGCCA 
CACCTTCACT 
GACTGCCCTT 
TGGGTTTATT 
TACCATATTT 
TGTGTCATAT 
CAGGCTGTGT 
GGATTACAGG 
TTTGGCTATG 
GCTTATGCCT 
GAGTTCCAGA 
TAGCCAGGTG 
ATCACTTGAA 
CCTGGGTGAC 
ACAACAACAA 
TAAAACTGTT 
TCTGTCTTCC 
TCAGTGTACA 
ATTTCAAATT 
TCCTTCAGCC 
ATGTGTTCTA 



CAAGTGACCC' 
GCCTATGATC 
TTTTCTTGGG 
TGCTCTATCA 
TCAGGACTCT 
GTATGTAGTG 
TTTTGCTGTT 
TGAACTCCTG 
CGTGAGCCTT 
TTAAGTCCTT 
GTAATCCTAG 
CCAGCCCGGG 
TGGTGGTGCA 
CCCCAGAGGT 
AAAGTGAGAC 
CAACAAAAAC 
GGGAGAATTG 
TAGGTATTAA 
GGTCTACCAT 
CTAACCACTT 
TTGCTAAACT 
TGAATAAAGA 



ACCTTGGCCT 
CATTTTGAAT 
AATATAGATA 
CCTTTGCATT 
GTTTTGTTCC 
TATGTAATTT 
GTTTGTATTT 
AGCTAAAGCA 
GGTGCTGGCC 
TGCTTTTGAT 
AACTTTGGGA 
CCTATGGCAA 
TGCCTGTAGT 
CAAGACTGCA 
TCTATCTCAA 
CCCTGTTGGG 
ACATCTTAAT 
TGTTTTGTCT 
GTCAGCATTT 
GTTGCTAGTA 
GTGAGTTCTC 
GTTTTACTCC 



51 



21 31 41 

I ! I I I I 

MFCEKAMELI RELHRAPEGQ LPAFNEDGLR QVLEEMKALY EQNQSDVNEA KSGGRSDLIP 
TIKFRHCSLL RNRRCTVAYL YDRLLRIRAL RWEYGSVLPN ALRFHMAAEE MEWFNNYKRS 
LATYMRSLGG DEGLDITQDM KPPKSLYIEV RCLKDYGEFE VDDGTSVLLK KNSQHFLPRW 
KCEQLIRQGV LEHILS 

Seq ID NO: 155 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 149-709 



I 

GTTCGGCGCC 
CGAAAGGAGT 
AAGGCCGCGG 
CGAGCTGCAT 
AGTTCTGGAG 
GTCAGGTGGA 
AAATCGACGC 
ATGGGAATAT 
GGAGTGGTTT 
TGAAGGTTTG 
ATGCAGTGGC 
CAACCTCCAC 
GCACTTCAGT 
AGCTGATCAG 
CAGGCTTCAC 
CCCTCTTTGA 
AAGTATAATT 
ACTCTTTTTT 
TCTCAAACTC 
AGGCGTGAGC 
CAGCATTCCT 
AATCTCTTAA 
TTTTCTGGTC 
AATCACATGC 
ACATACTAAT 
TCCACCATGG 
GAAGGGGACA 
GATTGACTGA 
GACATTTTAA 
GCATCCGAGA 
AGTGTTATAA 
TTGTTTTTTC 
GTGCAGTGGC 
TGTCTCAGCC 
TGTATTTTTA 
CTCAAGTGAC 
CAGCCTATGA 
CTTTTTCTTG 
TTTGCTCTAT 
TTTCAGGACT 
TTGTATGTAG 
ATTTTTGCTG 
GTTGAACTCC 
GGCGTGAGCC 
TGTTAAGTCC 
CTGTAATCCT 
GACCAGCCCG 



11 
I 

AAAGCGCGGA 
GAGGCGCCGA 
GAGTGGGAAG 
CGCGCGCCCG 
GAGATGAAAG 
CGAAGTGATT 
TGCACTGTAG 
GGTAGCGTCT 
AATAATTATA 
GACATTACAC 
GCGATCTCGG 
CTCCCAGGTC 
CCTATTAAAA 
ACAAGGAGTC 
TCAACTCATG 
TTTTAGAAGC 
TGCTAACTAT 
GGTTTTGGTT 
CTGGCCTCAA 
CACTGCACCC 
ACAGTTGTTA 
ATAAGCAGTC 
ATGTGTATTG 
AAGTGAAGAT 
TTATCATCTG 
TGGCTGGTGT 
GTGAAATTGG 
AAAGTCACAT 
ATTTTGATGA 
AATCTTTTCC 
TTTTAAGCTT 
GTTTGTTTCT 
GTGATCTTGG 
TCCCAAGTAG 
GTAGAGACAG 
CCACCTTGGC 
TCCATTTTGA 
GGAATATAGA 
CACCTTTGCA 
CTGTTTTGTT 
TGTATGTAAT 
TTGTTTGTAT 
TGAGCTAAAG 
TTGGTGCTGG 
TTTGCTTTTG 
AGAACTTTGG 
GGCCTATGGC 



21 
I 

GCGGAGGCCG 
GAGCCCAGAT 
CGTCCGCCAT 
AAGGGCAACT 
CTTTGTATGA 
TGATACCAAC 
CATACCTGTA 
TGCCAAATGC 
AAAGATCTCT 
AGGATATGAA 
CTCAACCTGC 
CGGTGTCTAA 
AAAAATAGCC 
CTGGAGCACA 
GACTCCTCTG 
TATAGACATT 
TAAGGACTTT 
TTGTTTTGTA 
GCAGTCCTCC 
GGCCCCTACT 
CAGTGTGTTT 
ACTTGGCTGG 
TACAAGCTAG 
GATGGTCTGT 
GCTATTTGGG 
GGCTTGTGGC 
GGCTAGAGAA 
GAAGAGTTGA 
AATCCAGTTT 
CATCCCAAGA 
TATACTTTGG 
TTGTTTTGAG 
CTCACTGCAA 
CTGGGATTAC 
AGTTTTACCA 
CTCCCAAAGT 
ATGAATTTTT 
TATCCAGCTG 
TTTTTGTTAA 
CCATTGACCT 
TTTCTAATAA 
TTTTTGTAGA 
CAATACACTT 
CCCAGTGTAC 
ATGTGAAATT 
GAGGCCTAGA 
AAAACTCCGT 



31 

I 

AGGCGAGAGC 
ACCATTTTGG 
GTTCTGCGAA 
GCCTGCCTTC 
ACAAAACCAG 
TATCAAATTT 
TGACCGCTTG 
ATTACGATTT 
TGCTACTTAT 
ACCACCAAAA 
AACCTCCACC 
AAGACTATGG 
AGCACTTTTT 
TCCTGTCATG 
TACT CACTCT 
GTTTAAGATA 
CTTTTTTTAA 
GAGACTGTCT 
CACCTTAGCT 
CCTTTTTCTA 
TTTAAATGAA 
ACAGGAAGAA 
AGAGCTGAAT 
AGAAATTTTC 
AAGGAAGGAC 
TATGGGGTGA 
GGAACTTTGT 
TTGTCTTTTA 
ATTCGTTTGT 
TCACAATTTT 
TCTATGACCC 
ATGGAGTCTT 
TCTCTATCCC 
AGGCACAGGC 
TGTTGGCCAG 
TTTGGGATTA 
TATATGGTGC 
TTTCACTACC 
AAAGTAGTTG 
GTTTTTCTCT 
TTCTTGAAAC 
GATGGGGTTT 
GCCTCGTCCT 
CACATTTCTT 
TGGGAACAGG 
TGGGTGGATC 
CTCTACAAAA 



41 

I 

CTGGCGCTGT 
CGTGAGAGCT 
AAAGCCATGG 
AACGAGGATG 
TCTGATGTGA 
CGACACTGTT 
CTTCGGATCA 
CACATGGCTG 
ATGAGGTCAC 
AGCCTATATA 
TCCCAGGTTC 
AGAATTTGAA 
ACCTCGATGG 
ACCATGCGCC 
CTCCACCACT 
ACTAAGAATA 
TGTTGTACAC 
CACTATGTTG 
TCTCAAAGTG 
ATAAGCTGTA 
AGTAAACATG 
GGTAGATCCT 
TTCTGAGATA 
AGTATATATA 
ACACATGGAT 
TCACCAGTAT 
ACAGTTTTCC 
ATGGTATGTT 
TCTTTTATGC 
TTTTCCTTTT 
GTTTTTTTTT 
GTTCTGTCAC 
CTGGGTTCAA 
CGCCACGCCT 
GCTGGTTTCA 
CAAGTGTGGG 
AAGGTGTCAA 
ATTTTTTGAA 
TCAATGTATA 
CCTGAATGCC 
AGATAGTATT 
CACCGTGTTG 
CCCCATGTGC 
TTTGAGATTT 
CAGGGTGTGG 
ACTTGAGCTC 
AATAGAAAAA 
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51 
I 

AGGACTAGAA 
GGTGGTTGGC 
AACTGATCCG 
GACTCAGACA 
ATGAAGCAAA 
CTCTGTTAAG 
GAGCACTCAG 
CTGAAGAAAT 
TGGGAGGAGA 
TTGAAGCTGG 
ACCTCAACTG 
GTTGATGATG 
AAATGTGAGC 
GAGGCACTTC 
CCCTTCACCT 
CTTGGCTAAG 
TATTCTTCCT 
CCCAAGCTGG 
TTGAGATCAC 
TCTGTAATCA 
GTTACATTTG 
GTGTGTCTTG 
CACATTTTCA 
ATGTTTAATG 
TTTGCACATT 
CACCACTTTG 
CTGAGATTCA 
TTAAACAGCT 
TTTGGGTGTT 
TACTTCTAGA 
TTGTTTTGTT 
CCAGGCTGGG 
GTGATTCTCT 
GGCTAATTTT 
AACTCCTGAC 
CCACCGCGGC 
TCCACCTTCA 
AGGACTGCCC 
TGTGGGTTTA 
AATACCATAT 
AATGTGTCAT 
GCCAGGCTGT 
TGGGATTACA 
GTTTTGGCTA 
TGGCTTATGC 
AGGAGTTCCA 
ATTAGCCAGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
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TGTGGTGGTG CATGCCTGTA GTCACAGTTA CACGGCAGGC TGAGGTGGGA GGATCACTTG 2880 

AACCCCAGAG GTCAAGACTG CAGTGAGCTG AGATCACACC ACTGTACTCC AGCCTGGGTG 2940 

ACAAAGTGAG ACTCTATCTC AAAAAGAAAT TAGGATCAAT TTGTCAATTT CTACAACAAC 3000 

AACAACAAAA ACCCCTGTTG GGCACCTTGA TTGAGATTGC ATTGAATTTA TATAAAACTG 3060 

TTGGGAGAAT TGACATCTTA ATAATATTGA GTCTTCTGGC CTATAAACAA GGTCTGTCTT 3120 

CCTAGGTATT AATGTTTTGT CTTCTATTTC TCTTAATAAT CTTTTGTAGT TTTCAGTGTA 3180 

CAGGTCTACC ATGTCAGCAT TTCATAGTTT TGATGCTAAA TGGTATTTTA AAATTTCAAA 3240 

TTCTAACCAC TTGTTGCTAG TAAATAGAAA TACAATTGAT GTTGAACTTG TATCCTTCAG 3300 

CCTTGCTAAA CTGTGAGTTC TCATGGTGTT TTTGTAAATT ACATCAACAG TCATGTGTTC 3360 
TATGAATAAA GAGTTTTACT CCTTC 

Seq ID NO: 156 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I 1 I I 

MFCEKAMELI RELHRAPEGQ LPAFNEDGLR QVLEEMKALY EQNQSDVNEA KSGGRSDLIP 60 

TIKFRHCSLL RNRRCTVAYL YDRLLRIRAL RWEYGSVLPN ALRFHMAAEE MEWFNNYKRS 120 

LATYMRSLGG DEGLDITQDM KPPKSLYIEA GCSGAISAQP ATSTSQVHLN CNLHLPGPVS 180 
KRJUWRI 

Seq ID NO: 157 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 148-621 

1 11 21 31 41 51 

I I I I I I 

TTCGGCGCCA AAGCGCGGAG CGGAGGCCGA GGCGAGAGCC TGGCGCTGTA GGACTAGAAC ' 60 

GAAAGGAGTG AGGCGCCGAG AGCCCAGATA CCATTTTGGC GTGAGAGCTG GTGGTTGGCA 120 

AGGCCGCGGG AGTGGGAAGC GTCCGCCATG TTCTGCGAAA AAGCCATGGA ACTGATCCGC 180 

GAG CTG CATC GCGCGCCCGA AGGGCAACTG CCTGCCTTCA ACGAGGATGG ACTCAGACAA 240 

GTTCTGGAGG AGATGAAAGC TTTGTATGAA CAAAACCAGT CTGATGTGAA TGAAGCAAAG 300 

TCAGGTGGAC GAAGTGATTT GATACCAACT ATCAAATTTC GACACTGTTC TCTGTTAAGA 360 

AATCGACGCT GCACTGTAGC ATACCTGTAT GACCGCTTGC TTCGGATCAG AGCACTCAGA 420 

TGGGAATATG GTAGCGTCTT GCCAAATGCA TTACGATTTC ACATGGCTGC TGAAGAAGTC 480 

CGGTGTCTAA AAGACTATGG AGAATTTGAA GTTGATGATG GCACTTCAGT CCTATTAAAA 540 

AAAAATAGCC AGCACTTTTT ACCTCGATGG AAATGTGAGC AGCTGATCAG ACAAGGAGTC 600 

CTGGAGCACA TCCTGTCATG ACCATGCGCC GAGGCACTTC CAGGCTTCAC TCAACTCATG 660 

GACTCCTCTG TACTCACTCT CTCCACCACT CCCTTCACCT CCCTCTTTGA TTTTAGAAGC 720 

TATAGACATT GTTTAAGATA ACTAAGAATA CTTGGCTAAG AAGTATAATT TGCTAACTAT 780 

TAAGGACTTT CTTTTTTTAA TGTTGTACAC TATTCTTCCT ACTCTTTTTT GGTTTTGGTT 840 

TTGTTTTGTA GAGACTGTCT CACTATGTTG CCCAAGCTGG TCTCAAACTC CTGGCCTCAA 900 

GCAGTCCTCC CACCTTAGCT TCTCAAAGTG TTGAGATCAC AGGCGTGAGC CACTGCACCC 960 

GGCCCCTACT CCTTTTTCTA ATAAGCTGTA TCTGTAATCA CAGCATTCCT ACAGTTGTTA 1020 

CAGTGTGTTT TTTAAATGAA AGTAAACATG GTTACATTTG AATCTCTTAA ATAAGCAGTC 1080 

ACTTGGCTGG ACAGGAAGAA GGTAGATCCT GTGTGTCTTG TTTTCTGGTC ATGTGTATTG 1140 

TACAAGCTAG AGAGCTGAAT TTCTGAGATA CACATTTTCA AATCACATGC AAGTGAAGAT 1200 

GATGGTCTGT AGAAATTTTC AGTATATATA ATGTTTAATG ACATACTAAT TTATCATCTG 1260 

GCTATTTGGG AAGGAAGGAC ACACATGGAT TTTGCACATT TCCACCATGG TGGCTGGTGT 1320 

GGCTTGTGGC TATGGGGTGA TCACCAGTAT CACCACTTTG GAAGGGGACA GTGAAATTGG 1380 

GGCTAGAGAA GGAACTTTGT ACAGTTTTCC CTGAGATTCA GATTGACTGA AAAGTCACAT 1440 

GAAGAGTTGA TTGTCTTTTA ATGGTATGTT TTAAACAGCT GACATTTTAA ATTTTGATGA 1500 

AATCCAGTTT ATTCGTTTGT TCTTTTATGC TTTGGGTGTT GCATCCGAGA AATCTTTTCC 1560 

CATCCCAAGA TCACAATTTT TTTTCCTTTT TACTTCTAGA AGTGTTATAA TTTTAAGCTT 1620 

TATACTTTGG TCTATGACCC GTTTTTTTTT TTGTTTTGTT TTGTTTTTTC GTTTGTTTCT 1680 

TTGTTTTGAG ATGGAGTCTT GTTCTGTCAC CCAGGCTGGG GTGCAGTGGC GTGATCTTGG 1740 

CTCACTGCAA TCTCTATCCC CTGGGTTCAA GTGATTCTCT TGTCTCAGCC TCCCAAGTAG 1800 

CTGGGATTAC AGGCACAGGC CGCCACGCCT GGCTAATTTT TGTATTTTTA GTAGAGACAG I860 

AGTTTTACCA TGTTGGCCAG GCTGGTTTCA AACTCCTGAC CTCAAGTGAC CCACCTTGGC 1920 

CTCCCAAAGT TTTGGGATTA CAAGTGTGGG CCACCGCGGC CAGCCTATGA TCCATTTTGA 1980 

ATGAATTTTT TATATGGTGC AAGGTGTCAA TCCACCTTCA CTTTTTCTTG GGAATATAGA 2040 

TATCCAGCTG TTTCACTACC ATTTTTTGAA AGGACTGCCC TTTGCTCTAT CACCTTTGCA 2100 

TTTTTGTTAA AAAGTAGTTG TCAATGTATA TGTGGGTTTA TTTCAGGACT CTGTTTTGTT 2160 

CCATTGACCT GTTTTTCTCT CCTGAATGCC AATACCATAT TTGTATGTAG TGTATGTAAT 2220 

TTTCTAATAA TTCTTGAAAC AGATAGTATT AATGTGTCAT ATTTTTGCTG TTGTTTGTAT 2280 

TTTTTGTAGA GATGGGGTTT CACCGTGTTG GCCAGGCTGT GTTGAACTCC TGAGCTAAAG 2340 

CAATACACTT GCCTCGTCCT CCCCATGTGC TGGGATTACA GGCGTGAGCC TTGGTGCTGG 2400 

CCCAGTGTAC CACATTTCTT TTTGAGATTT GTTTTGGCTA TGTTAAGTCC TTTGCTTTTG 2460 

ATGTGAAATT TGGGAACAGG CAGGGTGTGG TGGCTTATGC CTGTAATCCT AGAACTTTGG 2520 

GAGGCCTAGA TGGGTGGATC ACTTGAGCTC AGGAGTTCCA GACCAGCCCG GGCCTATGGC 2580 

AAAACTCCGT CTCTACAAAA AATAGAAAAA ATTAGCCAGG TGTGGTGGTG CATGCCTGTA 2640 

GTCACAGTTA CACGGCAGGC TGAGGTGGGA GGATCACTTG AACCCCAGAG GTCAAGACTG 2700 

CAGTGAGCTG AGATCACACC ACTGTACTCC AGCCTGGGTG ACAAAGTGAG ACTCTATCTC 2760 

AAAAAGAAAT TAGGATCAAT TTGTCAATTT CTACAACAAC AACAACAAAA ACCCCTGTTG 2820 

GGCACCTTGA TTGAGATTGC ATTGAATTTA TATAAAACTG TTGGGAGAAT TGACATCTTA 2880 

ATAATATTGA GTCTTCTGGC CTATAAACAA GGTCTGTCTT CCTAGGTATT AATGTTTTGT 2940 

CTTCTATTTC TCTTAATAAT CTTTTGTAGT TTTCAGTGTA CAGGTCTACC ATGTCAGCAT 3000 

TTCATAGTTT TGATGCTAAA TGGTATTTTA AAATTTCAAA TTCTAACCAC TTGTTGCTAG 3060 

TAAATAGAAA TACAATTGAT GTTGAACTTG TATCCTTCAG CCTTGCTAAA CTGTGAGTTC 3120 

TCATGGTGTT TTTGTAAATT ACATCAACAG TCATGTGTTC TATGAATAAA GAGTTTTACT 3180 
CCTTC 

Seq id NO: 158 Protein sequence: 
Protein Accession tt: Eos sequence 

1 11 21 31 41 51 

I I I I I I 
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MFCEKAMELI RELHRAPEGQ LPAFNEDGLR QVLEEMKALY EQNQSDVNEA KSGGRSDLIP 60 

TIKFRHCSLL RNRRCTVAYL YDRLLRIRAL RWBYGSVLPN ALRFHMAAEE VRCLKDYGEF 120 
EVDDGTSVLL KKNSQHFLPR WKCEQLIRQG VLEHILS 

Seq ID NO: 159 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 149-229 

1 11 21 31 41 51 

I 1 I I I I 

GTTOGGCGCC AAAGCGCGGA GCGGAGGCCG AGGCGAGAGC CTGGCGCTGT AGGACTAGAA 60 

CGAAAGGAGT GAGGCGCCGA GAGCCCAGAT ACCATTTTGG CGTGAGAGCT GGTGGTTGGC 120 

AAGGCCGCGG GAGTGGGAAG CGTCCGCCAT GTTCTGCGAA AAAGCCATGG AACTGATCCG 180 

CGAGCTGCAT CGCGCGCCCG AAGGGCAACT GCCTGCCTTC AACAATTAGC TGGGTGTGGT 240 

GGCACACACC TGTAGTCCCA GCAACTTAGG AGGCTGAAGT GAGAGGATTG CATGGCTCCA 300 

GGAAGTTGAA ACTGCAGTGA ACTGTGGTCA CGCTATTACA CTCCAGCCTG GGTGACAGAC 360 

TGAATCCCTG TCTCAAAAAG GAAAAGGAGG ATGGACTCAG ACAAGTTCTG GAGGAGATGA 420 

AAGCTTTGTA TGAACAAAAC CAGTCTGATG TGTTCTCTGT TAAGAAATCG ACGCTGCACT 480 
GTAGCATACC TGTATGACCG CTTGCTTCGG ATCAGAGCAC TCAGATGG 

Seq ID NO: 160 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

ATGTTCTGCG AAAAAGCCAT GGAACTGATC CGCGAGCTGC ATCGCGCGCC CGAAGGGCAA 60 
CTGCCTGCCT TCAACAATTA G 

Seq ID NO: 161 DNA sequence 
Nucleic Acid Accession 8: U10694 
Coding sequence: 1333-2280 

1 11 21 31 41 51 

I I I I I.I 

GGATCCGGCC GGATCTCAGG GAGGTGAGGA CTTTGTTCTC AGAGGGTGTG TGTGGACAAA 60 

ACAGGGAGGC CCTGTGTTCG ACAGACACAG TGGTCCCAGG ATTGGAGAGC AGTCCAGGTG 120 

AGGAACCTAA GGGAGGATCG AGGGTACCTC CAGGCCAGAG AAACTCTCAG ATCAAGAGAG 180 

TTTGCCCTGC CCCTACTGTC ACCCCAGAGA GCCCGGGCAG GGCTGTCTGC TGAGGTCCCT 240 

CCTTTATCCT GGGATCACTG GTGTCGGGGA GGGCTGGCCT TGGTCTGAGG GGGCTGCACT 300 

CACGTCAGCA GAGGGAGGGT CCCAGGCCCT GCCAGGAGTC CAGGTGCAGA CTGAGGGGAC 360 

CCCACTCACC AAACACAGAG GACCTAGCCC CACCCTGCCC CTTGTGTCAG CTGAGGGAAG 420 

CCGCTGGGTG GATGGACTCC CCTCACTTCC TCTTCAGGTG TCTCCTGGAG ATAGGGCCTC 480 

AGGTCAACAG AGGGAGGGTT CCAGACCCTG CAGGCATCAA GATGAGGACC AGGCAGTATC 540 

CTCACCCCAG GACACATGGA CCCCATTGAA TTTAGACATC TCTTACTGTA CTTCCGAGGA 600 

AACCCTGGGC AGGTGTGGGC AGATGTTGGT TGGGGCATGT CCTTCTGTTC CATATCAGGG 660 

ATGTGAGCTC CTGATCTGAG AGACTCTCAG GCAAGTAGAG GAGTAGAGTC CAGTCCCTGC 720 

CAGGAGAAAG GTCAGGGCCC TGAGTGAGCG CAGAGGGGAC CATCCACCCC AAAAGTGTGT 780 

AGAACTCAAG AGTGTCCAGC CCGCCCTCTT GACAGCACTG AGGGACCGGG GCTCTGCCTG 840 

CAGTCTGCAG CCTAAGGGCC CCTCGATTCC TCTTCCAGGA GCTCCAGGAA GCAGGCAGGC 900 

CTTGGTCTGA GACAGTGTCC TCAGGTCGCA GAGCAGAGGA GACCCAGGCA GTGTCAGCAG 960 

TGAAGGTGAA GTGTTCACCC TGAATGTGCA CCAAGGGCCC CACCTGCCCC AGCACACATG 1020 

GGACCCCATA GCACCTGGCC CCATTCCCCC TACTGTCACT CATAGAGCCT TGATCTCTGC 1080 

AGGCTAGCTG CACGCTGAGT AGCCCTCTCA CTTCCTCCCT CAGGTTCTCG GGACAGGCTA 1140 

ACCAGGAGGA CAGGAGCCCC AAGAGGCCCC AGAGCAGCAC TGACGAAGAC GTGTAAGTCA 1200 

GCCTTTGTTA GAACCTCCAA GGTTCGGTTC TCAGCTGAAG TCTCTCACAC ACTCCCTCTC 1260 

TCCCCAGGCC TGTGGGTCTC CATCGCCCAG CTCCTGCCCA CGCTCCTGAC TGCTGCCCTG 1320 

ACCAGAGTCA TCATGTCTCT CGAGCAGAGG AGTCCGCACT GCAAGCCTGA TGAAGACCTT 1380 

GAAGCCCAAG GAGAGGACTT GGGCCTGATG GGTGCACAGG AACCCACAGG CGAGGAGGAG 1440 

GAGACTACCT CCTCCTCTGA CAGCAAGGAG GAGGAGGTGT CTGCTGCTGG GTCATCAAGT 1500 

CCTCCCCAGA GTCCTCAGGG AGGCGCTTCC TCCTCCATTT CCGTCTACTA CACTTTATGG 1560 

AGCCAATTCG ATGAGGGCTC CAGCAGTCAA GAAGAGGAAG AGCCAAGCTC CTCGGTCGAC 1620 

CCAGCTCAGC TGGAGTTCAT GTTCCAAGAA GCACTGAAAT TGAAGGTGGC TGAGTTGGTT 1680 

CATTTCCTGC TCCACAAATA TCGAGTCAAG GAGCCGGTCA CAAAGGCAGA AATGCTGGAG 1740 

AGCGTCATCA AAAATTACAA GCGCTACTTT CCTGTGATCT TCGGCAAAGC CTCCGAGTTC 1800 

ATGCAGGTGA TCTTTGGCAC TGATGTGAAG GAGGTGGACC CCGCCGGCCA CTCCTACATC 1860 

CTTGTCACTG CTCTTGGCCT CTCGTGCGAT AGCATGCTGG GTGATGGTCA TAGCATGCCC 1920 

AAGGCCGCCC TCCTGATCAT TGTCCTGGGT GTGATCCTAA CCAAAGACAA CTGCGCCCCT 1980 

GAAGAGGTTA TCTGGGAAGC GTTGAGTGTG ATGGGGGTGT ATGTTGGGAA GGAGCACATG 2040 

TTCTACGGGG AGCCCAGGAA GCTGCTCACC CAAGATTGGG TGCAGGAAAA CTACCTGGAG 2100 

TACCGGCAGG TGCCCGGCAG TGATCCTGCG CACTACGAGT TCCTGTGGGG TTCCAAGGCC 2160 

CACGCTGAAA CCAGCTATGA GAAGGTCATA AATTATTTGG TCATGCTCAA TGCAAGAGAG 2220 

CCCATCTGCT ACCCATCCCT TTATGAAGAG GTTTTGGGAG AGGAGCAAGA GGGAGTCTGA 2280 

GCACCAGCCG CAGCCGGGGC CAAAGTTTGT GGGGTCAGGG CCCCATCCAG CAGCTGCCCT 2340 

GCCCCATGTG ACATGAGGCC CATTCTTCGC TCTGTGTTTG AAGAGAGCAA TCAGTGTTCT 2400 

CAGTGGCAGT GGGTGGAAGT GAGCACACTG TATGTCATCT CTGGGTTCCT TGTCTATTGG 2460 

GTGATTTGGA GATTTATCCT TGCTCCCTTT TGGAATTGTT CAAATGTTCT TTTAATGGTC 2520 

AGTTTAATGA ACTTCACCAT CGAAGTTAAT GAATGACAGT AGTCACACAT ATTGCTGTTT 2 5 B0 

ATGTTATTTA GGAGTAAGAT TCTTGCTTTT GAGTCACATG GGGAAATCCC TGTTATTTTG 2640 

TGAATTGGGA CAAGATAACA TAGCAGAGGA ATTAATAATT TTTTTGAAAC TTGAACTTAG 2700 

CAGCAAAATA GAGCTCATAA AGAAATAGTG AAATGAAAAT GTAGTTAATT CTTGCCTTAT 2760 

ACCTCTTTCT CTCTCCTGTA AAATTAAAAC ATATACATGT ATACCTGGAT TTGCTTGGCT 2820 

TCTTTGAGCA TGTAAGAGAA ATAAAAATTG AAAGAATAAT TTTTCCTGTT CACTGGCTCA 2880 
TTTTTTCTTC AGACACGCAC TGAACATCTG TTATTCGGAA CACCCTGGGT T 



Seq ID NO: 162 Protein sequence: 
Protein Accession ft: AAA6B877.1 
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1 I I I I I 

MSLEQRSPHC KPDEDLEAQG EDLGLMGAQE PTGEEEETTS SSDSKEEEVS AAGSSSPPQS 60 

PQGGASSSIS VYYTLWSQFD EGSSSQEEEE PSSSVDPAQL EFMFQEALKL KVAELVHFLL, 120 

HKYRVKEPVT KAEMLESVIK NYKRYPPVIF GKASEFMQVI FGTDVKEVDP AGHSYILVTA 180 

LGLSCDSMLG DGHSMPKAAL LIIVLGVILT KDNCAPEEVI WEALSVMGVY VGKEHMFYGE 240 

PRKLLTQDWV QENYLEYRQV PGSDPAHYEP LWGSKAHAET SYEKVINYLV MLNAREPICY 300 
PSLYEEVLGE EQEGV 

Seg ID NO: 163 DNA sequence 
Nucleic Acid Accession 8: AF292100 
Coding sequence: 30-809 

1 11 21 31 41 ' 51 

I I I I I I 

GGGGGGGGAG AGGCCTGGAG GACACCAACA TGAACAAGTT GAAATCATCG CAGAAGGATA 60 

AAGTTCGTCA GTTTATGATC TTCACACAAT CTAGTGAAAA AACAGCAGTA AGTTGTCTTT 120 

CTCAAAATGA CTGGAAGTTA GATGTTGCAA CAGATAATTT TTTCCAAAAT CCTGAACTTT 180 

ATATACGAGA GAGTGTAAAA GGATCATTGG ACAGGAAGAA GTTAGAACAG CTGTACAATA 240 

GATACAAAGA CCCTCAAGAT GAGAATAAAA TTGGAATAGA TGGCATACAG CAGTTCTGTG 300 

ATGACCTGGC ACTCGATCCA GCCAGCATTA GTGTGTTGAT TATTGCGTGG AAGTTCAGAG 360 

CAGCAACACA GTGCGAGTTC TCCAAACAGG AGTTCATGGA TGGCATGACA GAATTAGGAT 420 

GTGACAGCAT AGAACAACTA AAGGCCCAGA TACCCAAGAT GGAACAAGAA TTGAAAGAAC 480 

CAGGACGATT TAAGGATTTT TACCAGTTTA CTTTTAATTT TGCAAAGAAT CCAGGACAAA 540 

AAGGATTAGA TCTAGAAATG GCCATTGCCT ACTGGAACTT AGTGCTTAAT GGAAGATTT A . 600 

AATTCTTAGA CTTATGGAAT AAATTTTTGT TGGAACATCA TAAACGATCA ATACCAAAAG 660 

ACACTTGGAA TCTTCTTTTA GACTTCAGTA CGATGATTGC AGATGACATG TCTAATTATG 720 

ATGAAGAAGG AGCATGGCCT GTTCTTATTG ATGACTTTGT GGAATTTGCA CGCCCTCAAA 780 

TTGCTGGGAC AAAAAGTACA ACAGTGTAGC ACTAAAGGAA CCTTTTAGAA TGTACATAGT 840 

CTGTACAATA AATACAACAG AAAATTGCAC AGTCAATTTC TGCTGGCTGG ACTGAACTGA 900 

AGATCAATCC TCACAATTCA GACTGAGGGT TGAGACAAAA CTTTAAGGAT ACATCTTGGA 960 

CCATATCGTA TTTCATTCTT CTAATGGTGG TTTGGGCTTG TCTTCTAGTC TGGGCCGCTC 1020 

TAAACATTTA TAATTCCAAC ATTGTGGATT TCATCTTATA TCTGTGGACC ATCCTAGTTT 1080 

ATTCTCCCAT AAGTCTTAGA AGCTTTATGG TGATTATTTT GAGGTTTTCA TTCTCGCATA 1140 

AAGCACAATG CTGTCTTCAT CAGAAAACAG TTGGCATAAG AATTAAACAT ATGAACATCA 1200 

CAAAACAATT TATAAAAACT TCTTAAATAT ACGCTTTGGG CTAGTTGCAA AGACTATGCT 1260 

AATAGCACTT ' CCAGTGAGAG TGATATATTT AAGTGTACTG GATCTGGAAT GGTGTTTTGG 1320 

TTTGGGGGGA ATTTTTTTTT TTTCCTGGCA AATCACATAT GTTGTTGATG TGAGTATCTG 1380 

ATGAAAAAAC AATGTCAGAA TAACCGACAT GAAAATTTTT TAGGATAACT TGGTGCCTAC 1440 

CTGAAAAATG TATTGTGTTT TAGACTCTTG ATTTCAAAAG GTTCCACAGA ACTAGTCTGC 1500 

GCTTACCTTA CCCATGTTTA TATATAGCTG TCCTACAGGG AGCTTTTATT TAGAAAATGT 1560 

CTGCATAATG TTAGATTCTT CTCCTGTCTA CATTATGCAC TACATAATTG GACTTCATTA 1620 

TGCTTTTGAA ATGCTTATCT GCCTGTCACA TAAGTTAAAC TATTTAATTT GTTTTGAATG 1680 

TTTTGGATTG CTACACAATA CAATATTCTA AATTTAGGCA TGAGGGTTTT TTTGTTTTAT 1740 

TTTTACTTTT TTTTTGTCAT TGCACTATGG AACACAAATG AAATTCTCTT AATTTATAAG 1800 

AAGATAGTAG GAGTTAAATT TTGAAAATGG TTGTGATGAG CCACGAAATT CAATCTTTAT 1860 

AATATAGGTA CTGCTCTTTC AGACAAACAG TCCATTTTTA ATGACTTCTT ATTTTGTTGA 1920 

AATTACTTTA ACTGCTAATC ACTGTGGTTG CCAAATATTT ACTTCAGAAG CAAAGATTTT 1980 

CAAACAAGCA TACACGATGC AAAATACCAG TCTGGCTTCT AGTCTATTTA CTGTTTTGTT 2040 

TCACTCAGAT TAGCTCAGTT TTCTCATCAA AGCAGAATGC TATCTTGCGT GTGTGTGTGT 2100 

GTGTGTGTGT GTGTGTGTGT GTATGTGTGT ATATATATAT ATATATATAT ATATATATTT 2160 

TTTTTTTTTT TTTTTTTTAA ATTACAAAAG CCATGAGCTG CTTTTATGCT GAAAATGGTC 2220 

ATTTCCCTGT TCACTTACTG ACATGTGAAG AAGGGTTTCT TGCTTTCTTA AACATTTCCG 2280 

TAAGGCAGGC TAGAAATGTA ATACTTCAAA TGTTTGATGA TTATGGTCTT TTGATAGGAA 2340 

TAGATTCTGC TTGGGATATA TATCCAGGCA CTCTCTAAGG TCTAGGGTTG ATATTAACAA 2400 

AGGAATGTAC TTAGAATAGC AGTACATTTT ATGCAAATAT GGAAATTATT TTAAGAAACA 2460 

ATGACATATC AAAACTGCTT TTTACATGAT TTTGAAATAG ACTAGAAAGC TTTCCCTATA 2520 

GACATATTAA TATTCCAATC ATAACTTTAA TTCAAGAATG CAGTTTTACC AAAAGAAAAA 2580 

TTTGAAAATT TCTATTCAGG CTACTGGAAT TGGTTATTAA AAGAAAAAGG AAAAAGAAGA 2640 

ATCTTGCTGC TTTCAGTATT TCCTGATTTT TTTGTAAATA TAAAGAGGAA CTTCAATTAT 2700 

GAAAAATTTT TAAAAGATAT ATATATCTAT ATATCTATAT ATATGTACTG TTTTGTTTCC 2760 

TGTCTTGAAG ATTTTGAGTT ATGGTTATTG GTTTCAGATT GATTAATTCA CATATGCTGT 2820 

GTTTTCTTTA AAAGTCATAT GGGTTCGTGG CCTAATGCCT TGGATTTTAC ATATTTTTCT 2880 

TTTTAAATGC AAAACCTTTT CAACAAAATA GTGTTTGTCA TCAGGTTGGT ACTAAACATT 2940 

TATAATTACT GTGTAATTAT AAACAAAAAT ACATAAAGCT TTGAATATAA TTATGTAGCA 3000 

TAAAAGTTAA GGTTGTTCAC TATGATGGCA TCTTAGAATT AAACAAAACT TTTACTAGGG 3060 

CTGAAAAGAG AAGACTGATT TAATGTGGTG TGATTATTCT GAAGATAAAT GTCTGGCTAC 3120 

AGGGAATATT TTGTACTAAA AAATGATTAC ACATATGGCT GTGTGTGTTT GAGTCTGTGT 3180 

CTGTGAGAGA GCCAGAGAGA GTGAGAGAGA TTGACAGAGA AAGGGAGAGA CACACACACG 3240 

CCCCTTGAAT TGCTTTAACT CCTAAGTGTT TCAGTCCTCA TTCCGGTAAA CTCCCCATGC 3300 

TGATTCTTTG TTTTAAACTG AACCATAGGT ACAGTTTCCT TTTTGCCAAA TGTCAAAACA 3360 

GGTACAAATT TTAAAATGTA ATGCTTTTTA AATAGAAAAA TGTATAAAAT TAGAAGTGCC 3420 

CACATATAAA AAATACTTGA GATGAAGATT ATCTTTAGTG AATATCATCT GCATATCTCT 3480 

GTAAGTTCAA TTGTGTTTCT TACAGTCCCT GTCATATTAC CAACAGAGGC AATAAAAGCT 3540 
GCAGTGAAAT TG 

Seq ID NO: 164 Protein sequence: 
Protein Accession #: AAG00606 

1 11 21 31 41 51 

I I I I I I 

MNKLKSSQKD KVRQFMIFTQ SSEKTAVSCL SQNDWKLDVA TDNFFQNPEL YIRESVKGSL 60 

DRKKLEQLYN RYKDPQDENK IGIDGIQQFC DDLALDPASI SVLIIAWKFR AATQCEFSKQ 120 

EFMDGMTELG CDSIEQLKAQ IPKMEQELKE PGRFKDFYQF TFNFAKNPGQ KGLDLEMAIA 180 

YWNLVLNGRF KFLDLWNKFL LEHHKRSIPK DTWNLLLDFS TMIADDMSNY DEEGAWPVLI 240 
DDFVEFARPQ IAGTKSTTV 
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Seq ID NO: 165 DNA sequence 
Nucleic Acid Accession 8: AF256215 
Coding sequence: 220-2028 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CTCCAGTCCG 
GCCTACGGGC 
CCTGCTCCAG 
CAGCCGCCGG 
GCGGCGGGAG 
CGCGTGAGTC 
TTTCCACGAA 
CAAGTTAAAA 
AAAATGAATA 
CGTAAACTGG 
GGCTTGACAA 
CTCAGACATT 
GGAAAAATTC 
TTGACTGGAC 
CAACTTTCTT 
CAAGTTCACA 
TTTTTCTGTC 
AACTCAAAGA 
AGCTGGCCTC 
AATTTTACCT 
GGAGAGATTA 
GTCTATGTAG 
ACTTCTTGTT 
GCAGTTCTAC 
QGCTCTTTTG 
CTGGAATATA 
TCATTTTTAC 
GTACCTGGAA 
AATGAAATTC 
ACAGGTTTAA 
TTTCGACCAA 
GTTGCTGTCC 
CTATGTGACA 
GGCCTGGGAG 
AACTCCAAAA 
TTAAGTACTG 
TTGCATCTTC 
TATTATAGAC 
TAAAATATTT 
TGCAGTTTTT 
GATTTATAAT 
ATTGAGGATA 
AGTGCAATTT 
AGCACAGAGC 
GATTTCTTGC 
GGATCTTGGG 
CCAGACGTGG 
CTTGAGGTCA 
AATACAAAAA 
TGACAGTCCA 
ACTGCTTGGG 
GCCGAGATCG 
AGTAAAAAAA 
TGACATTGGA 
GATTAAATGT 
TTTAAATGTT 
CACAGATTAA 
GCAATTACAG 
AAATTATGAG 
GGAGAGCATT 
ATTATTTAAA 
TCAGTAGCTC 
ATGAAAGTCA 
AATCCAGTTT 
TCGCCAAAAC 
TTAGAACTTC 
TGTTTCCTTC 
CCTCTTGAAC 
TTGAGGACCT 
AATTGGTGTG 
GTGTTCTCTA 
CATTGCAGGT 
CATCATAGCT 
TTTGAGAGGC 
TCTTTGCAAG 
ACTCTGTGCA 
GTCCTGGACT 
GTTTCCTTCA 
CATATTTTTA 
TCGTGAGAAA 



11 
I 

CATGCTCAGT 
TGCGGTGGCG 
AGCCGCCGCC 
GCTGCGGAGC 
GTAAAGTGTT 
CAGGGACAAG 
AACGCAAAGG 
TGAAGGCCTT 
ACCTGATTGA 
ACAAACTTAC 
ATTCTTATGT 
TAATCCTTAA 
TCTTCGTTTC 
AAAGCTTATT 
CTTTTGATAT 
GTAATCTCCA 
GGATAAAGAG 
AG AAA GAG CA 
CAAATATTGT 
GCCTTGTGGC 
ATGTGAAACC 
ATCAAAGGGC 
ATGAATATTT 
AGAGTAAGGA 
TAACTTTAAA 
TTGTATCTGT 
CTTGTAGCTC 
TGTCTACTGG 
TGGATTTACA 
TGAAAGATAC 
GTCCTTCTGA 
ACAGCCATGA 
ATGATGACAC 
ACCCTGGGGA 
ATGAGAAACA 
TATTGATATT 
CTGTCACAGG 
TCCTTTATTC 
CTAACCAAGA 
TTTAGTTGAG 
AGTAGGTTTG 
GGGCTTACAC 
ATAGTCATAA 
TGGGATATTT 
TTAAAACTCA 
CCCTAGATCT 
TGCTCACGCC 
GGAGTTCAAG 
AAATTACCCA 
TTCATGAGCG 
AGGCTGAGGC 
CACCACTGCA 
AAGATTTAAT 
AACATACTTA 
CTTGTGTAAA 
TAATGTATAT 
ATGATTAACT 
TTATTGCCTG 
GCAATGAGAA 
TTTAACAGTA 
ATACTGCATG 
TTCATAGCTT 
GATAGGCTGG 
CCTGTTCTAT 
AAAGCATTAT 
TGTCAGACAT 
AAGTGTGAAA 
TGATAGTGTC 
GAGAGACATC 
CAGCAGCCTA 
AATTGAGCAG 
TAGACTTGGC 
GGATGGCCTC 
AGAGTTTCTA 
AGGAGGGGCT 
TGGTTTGATC 
CAAAATCTAA 
AAATGTTTTA 
TGGTATACAC 
GAATATATCT 



21 
I 

AGCTGCTGCC 
GCCGCCGCGG 
TGGGCCGGGG 
CGACCAAGTG 
GAGAGAGGAG 
ACCAACAGCT 
AAGTGATTCA 
CAGAGAAGCT 
AGAACTGTCT 
AGTTTTAAGA 
GGGAAGTAAT 
GACTGCAGAA 
TAAGTCAGTC 
TGACTTCTTA 
TTCACCAAGA 
CGCTGGAAGG 
TTGTAAAATC 
CAGAAAATTC 
TGGAATGGAA 
CATTGGAAGA 
AACTGAATTT 
AACAGCGATT 
TCATCAAGAT 
GAAAATACTT 
AAGCCAATGG 
CAACACTTTA 
TCAATCATCA 
AACAGTACTT 
GAGGTTACAG 
TCATACTGTA 
AATGGGGGAG 
GCCACTCCTC 
AGCCATGGCT 
CTTCAGTGAC 
TTTTAAAGCA 
GTTTGTATCT 
GATGTGGGGA 
AGTGAAATGG 
ATACTACTTA 
GTAATGTAAT 
TATAATTTGG 
ACTTTAAGAA 
TCACATTGAA 
ATGCTCAGTT 
GAAAGTCAAA 
TGGGGATTAA 
TGTAATCCCA 
ACCAGCCTGG 
GGCACTCACT 
CAAAGGCCTC 
GAGAGGATTG 
CTCCAGTCTG 
ATAATCACTG 
GGGATAGATT 
GTCACACATT 
AAACCAGTTT 
AACTGACCCA 
TAACCAAAGG 
ATAATTTAAA 
ATTAATCCAG 
TCTACCTTCT 
GCCAAGTATG 
GAACTCATGG 
ATGGTGCTAC 
TTTGACCCTG 
GTTAATGACA 
TGTGCAGGGG 
CCAGCAATGT 
AGGTTTAGAA 
AGTGTTATAG 
AGGGGCCTGC 
TTCCAAAGAG 
AAAAGCAGAT 
GAGGGAGACC 
GTTCAATTCC 
CTTTATTAGT 
TCCATGCATT 
GTTTTCTTCA 
TATGTTCCTT 
ACAAATTGCA 



31 
I 

GGCCGGGCTG 
CACCCGGCAG 
CAGGGCGGGC 
GCTCCTGCGA 
AACCAGTGCA 
ATGGGGTCTT 
GACCCATCCC 
CATAGCCAAA 
GCAATGATCC 
ATGGCTGTTC 
TATAGACCAT 
GGCTTCTTAT 
TCCAAAATAC 
CATCCAAAAG 
GAAAAGCTAA 
ACACGTGTGT 
TCTGTCAAAG 
TATACTATCC 
GAAGAAAGGA 
TTACAGCCAT 
ATAACCCGGT 
TTAGGATATC 
GACCACAATA 
ACAGATTCCT 
TTTAGTTTCA 
GTTTTGGGAC 
GAAGAATCCT 
GGTGCTGGTA 
TCTTCTTCAT 
AACTGCAGGA 
CTAGAGGCTA 
AGTGATGGTG 
GCATTTATGA 
ATCCAGTGGA 
TTATTTACGA 
TTTATTAATG 
AATACX3TTTT 
CTTATAATCC 
CATATTGTTT 
ATATTGATGT 
AACATTTTCC 
AACAGTGAGT 
TACTGTATTT 
GAGCACTTTA 
AGAGTTTCAG 
CCTCTGCATA 
GTACTTTGGG 
CCAATATGGT 
CTTGAGGTAA 
ATGACCTAAT 
CTTGAACCTG 
GGCAACAGAG 
AAGATCTCTA 
TGTCCTAAAG 
AAATTCAGTC 
CTTTATACAC 
GGAACTAGTT 
TAATAAAACA 
AACCAATTTT 
AGGTGGCTCA 
CGGGGATCAT 
CTCCCATATT 
GGCAGCCCTC 
ATCTTTCCAG 
CATGCTATTT 
AACATACCAA 
CTCGTGGGCA 
TGGAGGTTGG. 
TGAGCCAAAG 
TTAAGTCTAA 
CTACCAATAT 
TCTGCCTAAG 
GGGGGCAGAC 
AGTGCTGCCT 
ATAGACCAGT 
TCGCTCTAAT 
GTATGATACC 
ACTAAATTTG 
TTTTCTACTG 
CGAAAGTATC 



41 

I 

CGGGGCGGCG 
GGCCCGCCAG 
CCGGGGCTCC 
TGGCGGCGGA 
TTGCTCCTGT 
TCAGCTCACA 
AAGTGGAAGA 
CTGAAAAGCG 
CTCAGTGCAA 
AACACTTGAG 
CATTTCTTCA 
TTGTGGTTGG 
TTAATTATGA 
ATGTTGCCAA 
TAGATGCCAA 
ATTCTGGCTC 
AAGAGCATGG 
ATTGCACTGG 
ACAGTAAGAA 
ATATTGTTCC 
TTGCAGTGAA 
TGCCTCAGGA 
ATTTGACTGA 
ACAAATTCAG 
CAAATCCTTG 
ATAGTGAGCC 
CTAGACAGTC 
GTATTGGAAC 
ACCTTGATGA 
GTATGTCAAA 
CCAGGCAAAA 
CACAGTTGGA 
ATTACTTAGA 
CCCTCTAGCC 
AAAAACTGTC 
TTCTACCACT 
CCTCCCAAGA 
ACTAGTTGCC 
TGGCTTTGTT 
TTTCCTTTGT 
ATGCCTTGCG 
ACTTGAACAT 
GATCTTTGGA 
AGATGAATTT 
CTTTCCTTAC 
TAAGATTTAC 
AGGCTGAGAC 
GAAACCCCGT 
CTAACCAACT 
GGCACACACC 
GGAGGCAGAG 
TGAGACTTCA 
TTATAGATAG 
GAAAAAAGTA 
ACACATTAAA 
ATTTGGGAAA 
GTAGCTTTCT 
AAATGACAAG 
CTAGTTATAA 
AATTGAGTAT 
ACTTTATAAC 
TTCTCTCTCG 
AGACTTCAAT 
AAAATTTCCC 
CTTTAGCTGT 
CAGACAATAA 
AGGATGTATT 
CACCATTCCT 
AAATCCTACA 
AGAAGTATGA 
CACTTTTTAG 
CCAGGGGTGG 
TTGCCCTCGT 
CTCACAGTGG 
GGGCAGATAG 
ATTTTTCTGT 
GTAGCTCTCC 
ATTTTTGCTG 
CGAGTCAATT 
ATAAAAACAG 



51 
I 

TCCGCTGCGC 
TCCCCGCTTC 
TCCATGCTGC 
AGAGGAGGCT 
GGTTTCCAGC 
CATGACAGAG 
TGGTGAACAC 
GAGGAGAGAT 
CCCCATGGCG 
ATCTTTAAAA 
GGATAATGAG 
ATGTGAAAGA 
TCAGGCTAGT 
AGTAAAGGAA 
AACTGGTTTG 
AAGACGATCT 
ATGCTTACCC 
TTACTTGAGA 
AGACAACAGT 
ACAGAACAGT 
TGGAAAATTT 
ACTTTTGGGA 
CAAGCACAAA 
AGCAAAAGAT 
GACAAAAGAA 
TGGAGAAGCA 
CTGTATGAGT 
AGATATTGCA 
TTCGAGTCCA 
TAAGGAGTTG 
CCAGAGTACT 
TTTCGATGCC 
AGCAGAGGGG 
TTTGATTTTTr 
TCAACTATTC 
TTTTATAGAT 
GAACCAAGTT 
ATATTTTTGC 
TTATTTTTGA 
GTCTAAGATT 
AATTTCCTTA 
TTAAAGGGAC 
GACTTAGGCA 
TAAGTGAGAT 
AGAAAAGGAA 
TCTTAATAGG 
GGGCAGATCA 
TTCTACTAAA 
CCCACGATAA 
TGTAATCCCA 
GTTGCAGTGA 
TCTCAAAAAA 
ATTAGGTTTT 
GGCCCGGGCA 
TTCATAGAGT 
ACATTGGTCT 
AAGTAATTAG 
TACATGTTTA 
TTTAAAATTT 
AAGAATTAAG 
ACTTTCTGCT 
TGCCTCGCAA 
GTGGGCTTCA 
TCAGAGCCCC 
AGGTGATAGA 
CCAAAGCAAA 
GGCACACTGT 
GGTCCGACAC 
AGATGGGGAG 
AAGATCCCCT 
GGGACTGAAC 
CAGGGTAGGC 
GATGCCAGGA 
CAGTTTTTTC 
CCAGTTGAAT 
AGATCCTTTT 
TAAGGTTTGT 
TTAGAAGTGA 
TTTTGAATTT 
TACTCTAGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 ' 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
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CAGCGCTGTC CAATAGAAAT ATAATCTGAG CCACATGTAT AATTTTATTT TCTTCTAGCC 4860 

ACATTAAAGA AGTAAAAAGA TACAAGTAGA ACTAATTTTA ATGTTTTAAT TCAGTATATC 4920 

CAAAATATCA TTTGAACATG TAATTAATAT AAAATTATTA ATGTGATATT TTACATTCTT 4980 

TTGGTAATAC TAGTCTTCAA AATCTGGTAT GTATCTTACA TTGATAGCAC ATCTCACTTT 5040 

GTACTAGCCA CATTGCAAGT GCTCAGTAGC CACATGTGGC TAGTGGCTAC TGCACTGGAC 5100 

AGCACAGTTC TAGGTTCCAC CCTAACACCC AAGTCCTGTG GATTAGAATC CCAGAATCAG 5160 

AGCTGGAAGT AAACATAGAG ATCAAACCTC CTTTTAAAAA TGAGGACGCT GAGGCACAGA 5220 

GTTTAAATGG CTTGCATGAG GTCATACAGC TAAATTCAGC CTCAACAGGG TCTTCTGATT 5280 

CCAGGCACTC TTCCCACTCC ACTACATTAC TGTAGTGGTA ATTCTTAGGG TTAAAAAAAG 5340 

TGTAGAGTAG GCCGGGCGCA GTGGCTCATG CCTGTAATCC CAGCACTTTG GGAGGCCGAA 5400 

GTGGGCGGAT CACGAGGTCA GGAGATCGAG ACCATCCTGG CCAACATGGT GAAACCCCGT 5460 

CTCTACTGAA AATACAAAGC AAAATTAGCC AGGTGTGGTG GCGGGCGCCT GTGGTCCCAG 5520 

CTGCTCTGGA GGCTGAGGCA GAATGGCGTG AACCCAGGAG GCAGAGATGG CAGTGAGCCA 5580 

AGATCGCGCC ACTGCACCCC AGCCTGGGCG ACAGAGCGAG ACTCCATCTC AAAAAAAAAA 5640 

AAAAAAAAAA AAGAAAAGAA AAGAAAAGTC TAGAGAACAT TATATTAAGT GGTTATTATT 5700 

GAAGTAGACC AAAGTTTATA CCATAAGGAT ATTTTTCCTT AAATACCATG TTTGAAGAAC 5760 

AATTATTTAT TGATCCTTGA ATCTGTAAGA TCAAATAACA AGTCTCTATC CATGTTACCA 5820 

AATTTAACCT TTTGAAAATA ATAAACTTTA AAATATCAGA TGTGTTATTA CAGGATGATA 5880 

CTTGGAATCA AGTGAAATGA GTTATATGGT CATCACTAAA TTTAGAAATC TATTGTGAAA 5940 

CAAAGACAAA CAGGAAAGTA CAGAATAGAG ACTTTTAGTA AATAAATGGA ATTTAAAAGA 6000 

AAGTGTTTAT TTACAGTGTC ACGACAGAAA AGGATGTCTT TGTTGTCATA GTCTTTGAGG 6060 

GATCTCCGTA AAATCTGGGG CACAGGTACA AGAAATAGCC AATATTTAGT TCCCAGACCA 6120 

TGTTTAGTAG TGTCCAGTTT CAGATCATGC TGCCAAGAGG TATCTCCCCC TCAGGTGGGT 6180 

CATCACTGAG CCCTGGAATT GGAGACTCAT ACTTGCCCAG CACAATGTTA CGGGCAGACA 6240 

GGCCGACATC TATGATTAGC TAGAAGCCAT AAAGAAAAGC TGCTAAGTGG CCACTAGGTG 6300 

CCACTTTTCT GTTTTTGTAA TGCTTTCATT AGCAGATCTT TTTTTTCCAA GCTCCATGGG 6360 

GCCTATGAGA GGCATTTATG ATTTTTGTGC CTACAATAAG TCAGCCTGTC TGGTGTGAGT 6420 

TGTTTTATGA GAAATGCTTT CCAAGGGAGG TCTAGGAAGA TCCTGACACA TAAGAACTTT 6480 

GG CTTAGAGA GCTTTCCAGG TGTAGTGCCA ATAAAAACTG ACCTGGAAAG AAAACCTGCC 6540 

CAGCACGGAA CATGCTTTCT GAACTCACTT GAGAGTGTAT GGTGTATGTC ACTTCTCATA 6600 

TATTCTTGAG TTTAGATTTG TCTTTTATAC AATTTTTAGC TCTTTTCCAG TTCACTTGTG 6660 

CTCGTCTGTA TATTGGTATT TTTAAATTTT TGTGGTAAAT AATGAAAAGA GTGAAATTAT 6720 

ATTTTATAAT TACTCATTTG TAGTTTTTTT . TTTT AATTTA ATAAACTTCC TCCAAAAAGT 6780 
GCTCCCTTAA AA 



Seq ID NO: 166 Protein sequence: 
protein Accession #: AAG34652 

1 11 21 31 41 51 

I I I I 1 I 

MAAEEEAAAG GKVLREENQC IAPWSSRVS PGTRPTAMGS FSSHMTEFPR KRKGSDSDPS 60 

QVEDGEHQVK MKAFREAHSQ TEKRRRDKMN NL I EEL SAM I PQCNPMARKL DKLTVLRMAV 120 

QHLRSLKGLT NSYVGSNYRP SPLQDNELRH LILKTAEGFL FWGCERGKI LFVSKSVSKI 180 

LNYDQASLTG QSLFDFLHPK DVAKVKEQLS SFDISPREKL IDAKTGLQVH SNLHAGRTRV 240 

YSGSRRSFFC RIKSCKISVK EEHGCLPNSK KKEHRKFYTI HCTGYLRSWP PNIVGMEEER 300 

NSKKDNSNFT CliVAIGRLQP YIVPQNSGEI NVKPTEFITR FAVNGKFVYV DQRATAILGY 360 

LPQELLGTSC YEYFHQDDHN NLTDKHKAVL QSKEKILTDS YKFRAKDGSF VTLKSQWFSF 420 

TNPWTKELEY IVSVNTLVLG HSEPGEASFL PCSSQSSEES SRQSCMSVPG MSTGTVLGAG 480 

SIGTDIANEI LDLQRLQSSS YLDDSSPTGL MKDTHTVNCR SMSNKELFPP SPSEMGELBA 540 

TRQNQSTVAV HSHEPLLSDG AQLDFDALCD NDDTAMAAFM NYLEAEGGLG DPGDFSDIQW 600 
Tli 

Seq ID NO: 167 DNA sequence 
Nucleic Acid Accession #: NM_J)14400 
Coding sequence: 86-1126 

1 11 21 31 41 51 

I I I I I I 

GGTTACTCAT CCTGGGCTCA GGTAAGAGGG CCCGAGCTCG GAGGCGGCAC ACCCAGGGGG 60 

GACGCCAAGG GAGCAGGACG GAGCCATGGA CCCCGCCAGG AAAGCAGGTG CCCAGGCCAT 120 

GATCTGGACT GCAGGCTGGC TGCTGCTGCT GCTGCTTCGC GGAGGAGCGC AGGCCCTGGA 180 

GTGCTACAGC TGCGTGCAGA AAGCAGATGA CGGATGCTCC CCGAACAAGA TGAAGACAGT 240 

GAAGTGCGCG CCGGGCGTGG ACGTCTGCAC CGAGGCCGTG GGGGCGGTGG AGACCATCCA 300 

CGGACAATTC TCGCTGGCAG TGCSGGGTTG CGGTTCGGGA CTCCCCGGCA AGAATGACCG 360 

CGGCCTGGAT CTTCACGGGC TTCTGGCGTT CATCCAGCTG CAGCAATGCG CTCAGGATCG 420 

CTGCAACGCC AAGCTCAACC TCACCTCGCG GGCGCTCGAC CCGGCAGGTA ATGAGAGTGC 480 

ATACCCGCCC AACGGCGTGG AGTGCTACAG CTGTGTGGGC CTGAGCCGGG AGGCGTGCCA 540 

GGGTACATCG CCGCCGGTCG TGAGCTGCTA CAACGCCAGC GATCATGTCT ACAAGGGCTG 600 

CTTCGACGGC AACGTCACCT TGACGGCAGC TAATGTGACT GTGTCCTTGC CTGTCCGGGG 660 

CTGTGTCCAG GATGAATTCT GCACTCGGGA TGGAGTAACA GGCCCAGGGT TCACGCTCAG 720 

TGGCTCCTGT TGCCAGGGGT CCCGCTGTAA CTCTGACCTC CGCAACAAGA CCTACTTCTC 780 

CCCTCGAATC CCACCCCTTG TCCGGCTGCC CCCTCCAGAG CCCACGACTG TGGCCTCAAC 84 0 

CACATCTGTC ACCACTTCTA CCTCGGCCCC AGTGAGACCC ACATCCACCA CCAAACCCAT 900 

GCCAGCGCCA ACCAGTCAGA CTCCGAGACA GGGAGTAGAA CACGAGGCCT CCCGGGATGA 960 

GGAGCCCAGG TTGACTGGAG GCGCCGCTGG CCACCAGGAC CGCAGCAATT CAGGGCAGTA 1020 

TCCTGCAAAA GGGGGGCCCC AGCAGCCCCA TAATAAAGGC TGTGTGGCTC CCACAGCTGG 1080 

ATTGGCAGCC CTTCTGTTGG CCGTGGCTGC TGGTGTCCTA CTGTGAGCTT CTCCACCTGG 1140 

AAATTTCCCT CTCACCTACT TCTCTGGCCC TGGGTACCCC TCTTCTCATC ACTTCCTGTT 1200 

CCCACCACTG GACTGGGCTG GCCCAGCCCC TGTTTTTCCA ACATTCCCCA GTATCCCCAG 1260 

CTTCTGCTGC GCTGGTTTGC GGCTTTGGGA AATAAAATAC CGTTGTATAT ATTCTGGCAG 1320 

GGGTGTTCTA GCTTTTTGAG GACAGCTCCT GTATCCTTCT CATCCTTGTC TCTCCGCTTG 1380 

TCCTCTTGTG ATGTTAGGAC AGAGTGAGAG AAGTCAGCTG TCACGGGGAA GGTGAGAGAG 1440 

AGGATGCTAA GCTTCCTACT CACTTTCTCC TAGCCAGCCT GGACTTTGGA GCGTGGGGTG 1500 

GGTGGGACAA TGGCTCCCCA CTCTAAGCAC TGCCTCCCCT ACTCCCCGCA TCTTTGGGGA 1560 

ATCGGTTCCC CATATGTCTT CCTTACTAGA CTGTGAGCTC CTCGAGGGCA GGGACCGTGC 1620 

CTTATGTCTG TGTGTGATCA GTTTCTGGCA CATAAATGCC TCAATAAAGA TTTAATTACT 1680 
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TTGTATAGTG AAAAAAAA 
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Seq ID NO: 168 Protein sequence: 
Protein Accession 8: NP_055215 

1 11 21 31 41 SI 

I I I I I.I 

MDPARKAGAQ AMIWTAGWLL LLLLRGGAQA LECYSCVQKA DDGCSPNKMK TVKCAPGVDV 60 

CTEAVGAVET IHGQFSLAVX GCGSGLPGKN DRGLDLHGLL AFIQLQQCAQ DRCNAKLNLT 120 

10 SRALDPAGNE SAYPPNGVEC YSCVGLSREA CQGTSPPWS CYNASDHVYK GCFDGNVTliT 180 

AANVTVSLPV RGCVQDEFCT RDGVTGPGPT LSGSCCQGSR CNSDLRNKTY FSPRIPPLVR 240 

LPPPEPTTVA STTSVTTSTS APVRPTSTTK PMPAPTSQTP RQGVEHEASR DEEPRLTGGA 300 
AGHQDRSNSG QYPAKGGPQQ PHNKGCVAPT AGLAALLLAV AAGVLL 

15 Seq ID NO: 169 DNA sequence 

Nucleic Acid Accession #: NM_006875 
Coding sequence: 186-1190 

1 11 21 . 31 41 51 

20 t | || I I 

GAATTCGGCA CGAGCGCGCG GCGAATCTCA ACGCTGCGCC GTCTGCGGGC GCTTCCGGGC 60 

CACCAGTTTC TCTGCTTTCC ACCCTGGCGC CCCCCAGCCC TGGCTCCCCA GCTGCGCTGC 120 

CCCGGGCGTC CACGCCCTGC GGGCTTAGCG GGTTCAGTGG GCTCAATCTG CGCAGCGCCA 180 

CCTCCATGTT GACCAAGCCT CTACAGGGGC CTCCCGCGCC CCCCGGGACC CCCACGCCGC 240 

25 CGCCAGGAGG CAAGGATCGG GAAGCGTTCG AGGCCGAGTA TCGACTCGGC CCCCTCCTGG 300 

GTAAGGGGGG CTTTGGCACC GTCTTCGCAG GACACCGCCT CACAGATCGA CTCCAGGTGG 360 

CCATCAAAGT GATTCCCCGG AATCGTGTGC TGGGCTGGTC CCCCTTGTCA GACTCAGTCA 420 

CATGCCCACT CGAAGTCGCA CTGCTATGGA AAGTGGGTGC AGGTGGTGGG CACCCTGGCG 480 

TGATCCGCCT GCTTGACTGG TTTGAGACAC AGGAAGGCTT CATGCTGGTC CTCGAGCGGC 540 

30 CTTTGCCCGC CCAGGATCTC TTTGACTATA TCACAGAGAA GGGCCCACTG GGTGAAGGCC 600 

CAAGCCGCTG CTTCTTTGGC CAAGTAGTGG CAGCCATCCA GCACTGCCAT TCCCGTGGAG 660 

TTGTCCATCG TGACATCAAG GATGAGAACA TCCTGATAGA CCTACGCCGT GGCTGTGCCA 720 

AACTCATTGA TTTTGGTTCT GGTGCCCTGC TTCATGATGA ACCCTACACT GACTTTGATG 780 

GGACAAGGGT GTACAGCCCC CCAGAGTGGA TCTCTCGACA CCAGTACCAT GCACTCCCGG 840 

35 CCACTGTCTG GTCACTGGGC ATCCTCCTCT ATGACATGGT GTGTGGGGAC ATTCCCTTTG 900 

AGAGGGACCA GGAGATTCTG GAAGCTGAGC TCCACTTCCC AGCCCATGTC TCCCCAGACT 960 

GCTGTGCCCT AATCCGCCGG TGCCTGGCCC CCAAACCTTC TTCCCGACCC TCACTGGAAG 1020 

AGATCCTGCT GGACCCCTGG ATGCAAACAC CAGCCGAGGA TGTTACCCCT CAACCCCTCC 1080 

AAAGGAGGCC CTGCCCCTTT GGCCTGGTCC TTGCTACCCT AAGCCTGGCC TGGCCTGGCC 1140 

40 TGGCCCCCAA TGGTCAGAAG AGCCATCCCA TGGCCATGTC ACAGGGATAG ATGGACATTT 1200 

GTTGACTTGG TTTTACAGGT CATTACCAGT CATTAAAGTC CAGTATTACT AAGGTAAGGG 1260 

ATTGAGGATC AGGGGTTAGA AGACATAAAC CAAGTTTGCC CAGTTCCCTT CCCAATCCTA 1320 

CAAAGGAGCC TTCCTCCCAG AACCTGTGGT CCCTGATTTT GGAGGGGGAA CTTCTTGCTT 1380 

CTCATTTTGC TAAGGAAGTT TATTTTGGTG AAGTTGTTCC CATTTTGAGC CCCGGGACTC 1440 

45 TTATTTTGAT GATGTGTCAC CCCACATTGG CACCTCCTAC TACCACCACA CAAACTTAGT 1500 

TCATATGCTT TTACTTGGGC AAGGGTGCTT TCCTTCCAAT ACCCCAGTAG CTTTTATTTT 1560 

AGTAAAGGGA CCCTTTCCCC TAGCCTAGGG TCCCATATTG GGTCAAGCTG CTTACCTGCC 1620 

TCAGCCCAGG ATTTTTTATT TTGGGGGAGG TAATGCCCTG TTGTTACCCC AAGGCTTCTT 1680 

TTTTTTTTTT TTTTTTTTTG GGTGAGGGGA CCCTACTTTG TTATCCCAAG TGCTCTTATT 1740 

CTGGTGAGAA GAACCTTAAT TCCATAATTT GGGAAGGAAT GGAAGATGGA CACCACCGGA 1800 

CACCACCAGA CAATAGGATG GGATGGATGG TTTTTTGGGG GATGGGCTAG GGGAAATAAG 1860 

GCTTGCTGTT TGTTTTCCTG GGGCGCTCCC TCCAATTTTG CAGATTTTTG CAACCTCCTC 1920 

CTGAGCCGGG ATTGTCCAAT TACTAAAATG TAAATAATCA CGTATTGTGG GGAGGGGAGT 1980 

TCCAAGTGTG CCCTCCTTTT TTTTCCTGCC TGGATTATTT AAAAAGCCAT GTGTGGAAAC 2040 
CCACTATTTA ATAAAAGTAA TAGAATCAGA AAAAAAAAAA AAAAAAAA 



50 
55 
60 
65 



Seq ID NO: 170 Protein sequence: 
Protein Accession #: NP_006866 

1 11 21 31 41 51 

I I I I I I 

MLTKPLQGPP APPGTPTPPP GGKDREAFEA EYRLGPLLGK GGFGTVFAGH RLTDRLQVAI 60 
KVIPRNRVLG WSPLSDSVTC PLEVALLWKV GAGGGHPGVI RLLDWFETQE GFMLVLERPL 120 
PAQDLFDYIT EKGPLGEGPS RCFFGQWAA IQHCHSRGW HRDIKDENIL IDLRRGCAKL 180 
IDFGSGALLH DEPYTDFDGT RVYSPPEWIS RHQYHALPAT VWSLGILLYD MVCGDIPFER 240 
DQEILEAELH FPAHVSPDCC ALIRRCLAPK PSSRPSLEEI LLDPWMQTPA EDVTPQPLQR 300 
RPCPFGLVLA TLSLAWPGLA PNGQKSHPMA MSQG 

70 Seq ID NO: 171 DNA sequence 

Nucleic Acid Accession ft: NM_003646 
Coding sequence: 89.. 2875 

1 11 21 31 41 51 

75 1 | | | | | 

GCGGCGCGGA GCGGGCGTGC TGAGCCCCGG CCGCCGGCCC GGCATGGGCG TCTCCCGCGG 60 

GCCCTCCGCC GGCCGGGGCT AGGGCCGGAT GGAGCCGCGG GACGGTAGCC CCGAGGCCCG 120 

GAGCAGCGAC TCCGAGTCGG CTTCCGCCTC GTCCAGCGGC TCCGAGCGCG ACGCCGGTCC 180 

CGAGCCGGAC AAGGCGCCGC GGCGACTCAA CAAGCGGCGC TTCCCGGGGC TGCGGCTCTT 240 

CGGGCACAGG AAAGCCATCA CCAAGTCGGG CCTCCAGCAC CTGGCCCCCC CTCCGCCCAC 300 

CCCTGGGGCC CCGTGCAGCG AGTCAGAGCG GCAGATCCGG AGTACAGTGG ACTGGAGCGA 360 

GTCAGCGACA TATGGGGAGC ACATCTGGTT CGAGACCAAC GTGTCCGGGG ACTTCTGCTA 420 

CGTTGGGGAG CAGTACTGTG TAGCCAGGAT GCTGAAGTCA GTGTCTCGAA GAAAGTGCGC 480 

AGCCTGCAAG ATTGTGGTGC ACACGCCCTG CATCGAGCAG CTGGAGAAGA TAAATTTCCG 540 

CTGTAAGCCG TCCTTCCGTG AATCAGGCTC CAGGAATGTC CGCGAGCCAA CCTTTGTACG 600 

GCACCACTGG GTACACAGAC GACGCCAGGA CGGCAAGTGT CGGCACTGTG GG AAGGGATT 660 

CCAGCAGAAG TTCACCTTCC ACAGCAAGGA GATTGTGGCC ATCAGCTGCT CGTGGTGCAA 720 



80 
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GCAGGCATAC CACAGCAAGG TGTCCTGCTT 
GCTGGGGGTC CAOGCAGCCG TGGTCATCCC 
CCAGAATACT CTGAAAGCAA GCAAGAAGAA 
CAAGAAAGGG CCTGAGGAGG GCCGCTGGAG 
GCTCATGAAG CCCCTGCTGG TGTTTGTGAA 
GATCATCCAG TCTTTCCTCT GGTATCTCAA 
AGGGCCCAAG GAGGCGCTGG AGATGTACCG 
CGGGGGCGAC GGCACGGTGG GCTGGATCCT 
GCCACCCCCT GTTGCCATCC TGCCCCTGGG 
CTGGGGTGGG GGCTACACAG ATGAGCCTGT 
GAACGTGGTA CAGCTGGACC GCTGGGACCT 
TGAGGACCGA GATGAAGGCG CCACCGACCG 
CAGCCTGGGC TTTGACGCCC ACGTCACCCT 
AGAGAAATTC AACAGCCGCT TTCGGAATAA 
CTTCCTGATG GGCAGCTCCA AGGACCTGGC 
GGACTTGACT CCCAAGATCC AGGACCTGAA 
CAGGTACTGT GCGGGCACCA TGCCCTGGGG 
CCAGCGGCAT GACGACGGCT ACCTCGAGGT 
GCTGCAGGTG GGCGGACACG GCGAGCGGCT 
ATCCAAGGCC ATCCCGGTGC AGGTGGATGG 
CCGCATCGCC CTGCGCAACC AGGCCACCAT 
CCCCCTGCAC AGCGACCAGC AGCCGGTGCC 
CAGCATGCAC GACTATGAGG CCCTGCACTA 
GCCGCTGGGC ACTGTGGTGG TCCCAGGAGA 
TGAGAGACTC CAGCAGGAGC CCGATGGTGC 
GTCCCCCAAG TGGTGCTTCC TGGACGCCAC 
AGCCCAGGAG CACCTCAACT ATGTGACTGA 
CCCTGAGCTG CTGGGGGCAT CGGCCCGGCC 
CACCTCACCC TGCTCACCCA CGCCCCGGTC 
TGAAGAGCTG ATTGAGGCTG CCAAGAGGAA 
AGCTGGGGGC GACCTCATGC ACCGAGACGA 
CAGCACTGGC AGCAAGGATG TGGTCCGCTA 
TGATGCGGTG GAGGAAAACG GGGAGACCTG 
CACCATCTGC CACTACATCG TGGAGGCCGG 
CGACACTCCC CGGCAGCGGG CTGAGAAGGC 
GAACCGGCAG CACTACCAGA TGATCCAGCG 

Seq ID NO: 172 Protein sequence: 
Protein Accession #: NP 003637 



PCT/US02/12476 



CATGCTGCAG 
GCCCACCTGG 
GAAGAGGGCA 
ACCCTTCATC 
CCCCAAGAGT 
TCCCCGACAA 
CAAAGTGCAC 
CTCCACCCTG 
TACTGGCAAC 
GTCCAAGATC 
CCACGCTGAG 
GTTGCCCCTG 
GGAGTTCCAC 
GATGTTCTAC 
CAAGCACATC 
ACCCCAGTGT 
CCACCCTGGG 
CATTGGCTTC 
GACGCAGTGT 
CGAGCCCTGC 
GGTGCAGAAG 
AGAGCAGTTG 
CGACAAGGAG 
CAGTGACCTA 
TGGAGCCAAG 
CACTGCCAGC 
GATCGCACAG 
TGACCTCCCA 
ACTGCAAGGG 
CGACTTCTGT 
GCAGAGTCGC 
CCTGCTGGAC 
TTTGCACCAA 
GGCCTCGCTC 
TCAGGACACC 
GGAGGACCAG 



CAGATCGAGG 
ATCCTCCGOG 
TCCTTCAAGA 
ATCAGGCCCA 
GGGGGCAACC 
GTCTTCGACC 
AACCTGCGGA 
GACCAGCTAC 
GACTTGGCCC 
CTCTCCCACG 
CCCAACCCCG 
GATGTCTTCA 
GAGTCTCGAG 
GCCGGGACAG 
CGAGTGGTGT 
GTTGTTTTCC 
GAGCACCACG 
ACCATGACGT 
CGCGAGGTGG 
AAGCTTGCAG 
GCCAAGCGGC 
CGCATCCAGG 
CAGCTCAAGG 
GAGCTCTGCC 
TCCCCGACAT 
CGCTTCTACA 
GATGAGATTT 

acccccactt 
gatgctgcac 
aagctccagg 
acgctcctgc 
cacgcccccc 

GCAGCGGCCC 
ATGAAGACAG 
GAGCTGGCCG 
GAGACGGCTG 



AGCCGTGCTC 
CCCGGAGGCC 
GGAAGTCCAG 
CCCCCTCCCC 
AGGGTGCAAA 
TGAGCCAGGG 
TCCTGGCGTG 
GCCTGAAGCC 
GAACCCTCAA 
TGGAGGAGGG 
AGGCAGGGCC 
ACAACTACTT 
AGGCCAACCC 
CTTTCTCTGA 
GTGATGGAAT 
TGAACATCCC 
ACTTTGAGCC 
CGTTGGCCGC 
TGCTCACCAC 
CCTCACGCAT 
GGAGCGCCGC 
TGAGTCGCGT 
AGGCCTCTGT 
GTGCCCACAT 
GCCAGAAACT 
GGATCGACCG 
ATATCCTGGA 



CCCCTCAAGG 
AGCTGCACCG 
ACCACGCAGT 
CAGAGATCCT 
TGGGCCAGCG 
ACCAGCAGGG 
CCTACCTGGA 
TGTAGCGGGC 



MEPRDGSPEA 
GLQHLAPPPP 
MliKSVSRRKC 
DGKCRHCGKG 
PPTWILRARR 
NPKSGGNQGA 
LSTLDQLRLK 
LHAEPNPEAG 
KMPYAGTAFS 
GHPGEHHDFE 
GEPCKLAASR 
YDKEQLKEAS 
TTASRFYRID 
SLQGDAAPPQ 
YLLDHAPPEI 
AQDTELAAYL 



11 
I 

RSSDSESASA 
TPGAPCSESE 
AACKIWHTP 
FQQKFTFHSK 
PQNTLXASKK 
KIIQSFLWYL 
PPPPVAILPL 
PEDRDEGATD 
DFLMGSSKDL 
PQRHDDGYLE 
IRIALRNQAT 
VPLGTWVPG 
RAQEHLNYVT 
GEELIEAAKR 
LDAVEENGET 
ENRQHYQMIQ 



21 
1 

SSSGSERDAG 
RQIRSTVDWS 
CIEQLEKINF 
EIVAISCSWC 
KKRASFKRKS 
NPRQVFDLSQ 
GTGNDLARTL 
RLPLDVFNNY 
AKHIRWCDG 
VIGFTMTSLA 
MVQKAKRRSA 
DSDLELCRAH 
EIAQDEIYIL 
NDFCKLQELH 
CLHQAAALGQ 
REDQETAV 



31 
I 

PEPDKAPRRL 
ESATYGEHIW 
RCKPSFRESG 
KQAYHSKVSC 
SKKGPEEGRW 
GGPKEALEMY 
NWGGGYTDEP 
FSLGFDAHVT 
MDLTPKIQDL 
ALQVGGHGER 
APLHSDQQPV 
IERLQQEPDG 
DPELLGASAR 
RAGGDLMHRD 
RTICHYIVEA 



Seq ID NO: 173 DNA sequence 
Nucleic Acid Accession ft: AF232772 
Coding sequence: 1-1662 



ATGCCGGTGC 
GTGCTGGGTG 
CACTACCTGT 
CTTTTTGCCT 
TCCCCGCGGC 
TTGCGCAAGT 
GTGGTGGATG 
GGCGGCACCG 
GGTGAGACGG 
AGCACCTTCT 
TTCAAGGCCC 
GATCGAGCCT 
GTCGGGGGAG 
GTGCGGTACT 
CAGTGTATTA 
GACTGGTACC 
ACCAACCGAG 
ACAGAGACCC 
TACTTCCGGG 
TACGAGTCAG 
TTCTACCGGG 
ATTATCAAGG 



11 
I 

AGCTGACGAC 
GCATCCTGGC 
CCTTCGGCCT 
TCCTGGAGCA 
GGGGCTCGGT 
GCCTGCGCTC 
GCAACCGCCA 
AGCAGGCCGG 
AGGCCAGCCT 
CGTGCATCAT 
TCGGCGATTC 
GCACCATCGA 
ATGTCCAGAT 
GGATGGCCTT 
GTGGGCCCTT 
ATCAGAAGTT 
TCCTGAGCCT 
CCACTAAGTA 
AGTGGCTCTA 
TGGTCACGGG 
GCCGCATCTG 
CCACCTACGC 



21 
I 

AGCCCTGCGT 
AGCCTATGTG 
GTACGGCGCC 
CCGGCGCATG 
GGCACTGTGC 
GGCCCAGCGC 
GGAGGACGCC 
CTTCTTTGTG 
GCAGGAGGGC 
GCAGAAGTGG 
GGTGGACTAC 
GATGCTTCGA 
CCTCAACAAG 
CAACGTGGAG 
GGGCATGTAC 
CCTAGGCAGC 
TGGCTACCGA 
CCTCCGGTGG 
CAACTCTCTG 
TTTCTTCCCC 
GAACATTCTC 
CTGCTTCCTT 



31 
I 

GTGGTGGGCA 
ACGGGCTACC 
ATCCTGGGCC 
CGACGTGCCG 
ATTGCCGCAT 
ATCTCCTTCC 
TACATGCTGG 
TGGCGCAGCA 
ATGGACCGTG 
GGAGGCAAGC 
ATCCAGGTGT 
GTCCTGGAGG 
TACGACTCAT 
CGGGCCTGCC 
CGCAACAGCC 
AAGTGCAGCT 
ACTAAGTATA 
CTCAACCAGC 
TGGTTCCATA 
TTCTTCCTCA 
CTCTTCCTGC 
CGGGGCAATG 



41 
I 

CCAGCCTGTT 
AGTTCATCCA 
TGCACCTGCT 
GCCAGGCCCT 
ACCAGGAGGA 
CTGACCTCAA 
ACATCTTCCA 
ACTTCCATGA 
TGCGGGATGT 
GCGAGGTCAT 
GCGACTCTGA 
AGGATCCCCA 
GGATTTCCTT 
AGTCCTACTT 
TCCTCCAGCA 
TCGGGGATGA 
CCGCGCGCTC 
AAACCCGCTG 
AGCACCACCT 
TTGCCACGGT 
TGACGGTGCA 
CAGAGATGAT 



51 
I 

TGCCCTGGCA 
CACGGAAAAG 
CATTCAGAGC 
GAAGCTGCCC 
CCCTGACTAC 
GGTGGTCATG 
CGAGGTGCTG 
GGCAGGCGAG 
GGTGCGGGCC 
GTACACGGCC 
CACTGTGCTG 
AGTAGGGGGA 
CCTGAGCAGC 
TGGCTGTGTG 
GTTCCTGGAG 
CCGGCACCTC 
CAAGTGCCTC 
GAGCAAGTCT 
CTGGATGACC 
TATACAGCTT 
GCTGGTGGGC 
CTTCATGTCC 



780 
840 
900 
. 960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
'2820 
2880 



41 


51 




1 

NKRRFPGLRL 


1 

FGHRKAITKS 


60 


FETNVSGDFC 


YVGEQYCVAR 


120 


SRNVREPTFV 


RHHWVHRRRQ 


180 


FMLQQIEEPC 


SLGVHAAWI 


240 


RPFIIRPTPS 


PLMKPLLVFV 


300 


RKVHNLRILA 


CGGDGTVGWI 


360 


VSKILSHVEE 


GNWQLDRWD 


420 


LEFHESREAN 


PEKFNSRFRN 


480 


KPQCWFLNI 


PRYCAGTMPW 


540 


LTQCREWLT 


TSKAIPVQVD 


600 


PEQLRIQVSR 


VSMHDYEALH 


660 


AGAKSPTCQK 


LSPKWCFLDA 


720 


PDLPTPTSPL 


PTSPCSPTPR 


780 


EQSRTLLHHA 


VSTGSKDWR 


840 


GASLMKTDQQ 


GDTPRQRAEK 


900 



60 
120 
180 
240 
300 
360 
• 420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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WO 02/086443 

CTCTACTCCC TCCTCTATAT GTCCAGCCTT 
ATCAACAAAT CTGGCTGGGG CACCTCTGGC 
CTCATTCCTG TGTCCATCTG GGTGGCAGTT 
TGCCAGGACC TGTTCAGTGA GACAGAGCTA 
GGCTGCTACT GGGTGGCCCT CCTCATGCTA 
AAGAAGCOGG AGCAGTACAG CTTGGCTTTT 
CGGGTAAAGT GCAATGGGTA AGGGAGGGAA 
GGGAGGAGGG AGTGCTGTGT TTTAGTCTCT 
AAGAACGGTG ATGTAGTATG GCCTGACAGC 
AGATGCAGGG CTGCAGGGGA TTCTGTGTTT 
TAGGCAGTAG CCTCCTCCTG GGCTCCAGAG 
AGTCCCATTC AGTGGCAACT TGTGATAGGT 
TTCTCCCAGC CCATCTGAAC ACAACCAGAG 
GGCOGGTTAG TGTATGTCAC CCCCACCCCA 
GOGCGTGAGA TACAAGGCCC AGAAGCCTGA 
ATGGTGCTTT ATGTGAGATA CCCCACTCCA 
CAGGGAGTTA GCACTGAACT GCTTTTAAAA 
GGAACAAAGA GATTGTGGTG GTGCTAAAGG 
TTCCACCTGG AAACTGCTCA GACGTCTAGA 
GGGGAGATAA AAAGATTAAG CCCCAACATG 
TAACCTGTAT ACTCTTGAAT TCCTCTCAAA 
CCCCACTTCA CTTTCTTCAA AGCCACATTT 
CCCTCATCAT CATAGGTAAG GTTTTCAAGG 
TAGAAGCTTC AGCAGGAGGC AAGCGTGTTC 
TGATCAAATT GGCTACAATC TTGGAGCTGC 
ATGTGTGACT TTCAGGCTAC TGTTCTTGAC 
TCCCAAAGTG AACTCTCAAA TCCAAAATGG 
GGCTTCTCCA GGGAATTCTT ACAGCCAAGT 
TCCAGAAACC AAACTAGGAG ATGAAACTGG 
CTCATGCCTC CTGAGGCTGT TTTTGGCTGT 
GAAGCCATTT TCCAAGTGAC TTGCAATCCA 
TGGGATCCTG ACTAAGCCTT TGACTTAAGG 
TCAAAGGGGC CAACTAACCC GTGCAGAACC 
AAGCCTCTAA TGTACCAAGT GCTTCCTACA 
GGGCAAACCC TGGTGCTTTC CTTCATCTCC 
AACAGTTGCC * ACATCACACA GACCTCCAGT 
CCAACCCATT GAAGGCTGGA AGGCAGCAGG 
GTTCTGCTGG CCAAGAAGTT AAACTATTTT 
CAAGTGCAGA GTTCAGACTT CGCTAAGGGC 
AATGTAGGAT GACAGGCTCT CCTGGCTGTC 
GTCAACTTTC CTCAAATCAA AAACAGGCAG 
TCGACTGGTT TTTCTAAGTT ATTTTGTACA 
CAGCTTTATC CCCGTTTCTT GCAAGGGAAG 
TTTCCTCATT GAGAATTCAA ATCCTCTTTT 
TTTATTTTTA CCTGCTTTTT TTTTTTTTTT 
CTTATTTGTC AAAATAAAGA TTCTCACAT 

Seq ID NO: 174 Protein sequence: 
Protein Accession #: AAF36984 



PCT/US02/12476 



CTGCCGGCCA 
CGAAAAACCA 
CTCCTGGAGG 
GCCTTCCTTG 
TATCTGGCCA 
GCTGAGGTGT 
GGGGAATGGA 
TAATGGTCCA 
TCTGTTTAGA 
TCAGACTGCC 
GGCACTCAGA 
ACCTGAGTGA 
GTGGCAGGAG 
CCCATAAGTA 
TCTTTGGGCA 
CATCAACATT 
GTGCACATTA 
AGGCCATAAG 
TGGGTTCTTA 
TTCAGAAAAG 
TTCAGCTCTG 
TTTGAGGTAT 
TGGCAATTGG 
TCAGCACATA 
TTGGACGGAT 
AATCATCTCC 
TTATCTTTGA 
TGTGACAGTC 
TTCCTACATC 



GGCTGTTCTC 
GTTGCTTGCT 
AGCACTAAGG 
AAGACGCAAG 
CACGAACTCA 
TTCTGGTAAG 
CATTTGCTAA 
GAGCATTAGA 

CTACCATCAG 
GTACAGGTAG 
TTTTTCAGCA 
AGCCTTTATA 
GTATTGTTTC 
TAATTTTCAG 



AGATCTTTGC 
TTGTGGTGAA 
GGCTGGCCTA 
.TCTCTGGGGC 
TCATCGCCCG 
GACATGGCCC 
AGAGAAAAGA 
AAGGACAAAT 
GGAGGCAACA 
TGTCTGCTTG 
AGTTGTGCTA 
CGGCAACCTG 
AATTTCTACT 
GTCATCAATG 
TCAGAAAACA 
CCAGGGATGA 
AAAAGGAAAG 
CTACACAGAG 
GCTTGTCTGT 
AAGTGAAGTC 
ATCTGAGGCT 
CACTGCAGTC 
GGCGGAGCCC 
TGGGAACTAT 
TCCTTGGCAG 
AATGGAAAGC 
GACCATCCAT 
ACTGCATTTG 
CTAAGGTTCT 
CTGCTTTTGG 
AGCGTTTTGA 
TGCCCTCCAA 
TGGACAGCAG 
GTGTGCTCCG 
AGGGTTTTCC 
ACTGCTGGTT 
GGCAGCTGAT 
ATGGAGGAAA 
TCAGCATTTA 
CTCTGCCTTG 
TGGGCTCACA 
GCAAAACCAA 
CAATTGGACG 
TACAATAATT 
GTCAAGTTTT 



MPVQLTTALR 
LFAFLEHRRM 
WDGNRQEDA 
STFSCIMQKW 
VGGDVQILNK 
DWYHQKFLGS 
YFREWLYNSL 
I IKATYACFL 
LIPVSIWVAV 
KKPEQYSLAF 



11 

I 

WGTSLFALA 
RRAGQALKLP 
YMLDIFHEVL 
GGKREVMYTA 
YDSWISFLSS 
KCSFGDDRHL 
WFHKHHIiWMT 
RGNAEMIFMS 
LLEGLAYTAY 
ABV 



21 
I 

VLGGILAAYV 
SPRRGSVALC 
GGTEQAGFFV 
FKALGDSVDY 
VRYWMAFNVE 
TNRVLSLGYR 
YESWTGFFP 
LYSLLYMSSL 
CQDLFSETEL 



31 
I 

TGYQFIHTEK 
IAAYQEDPDY 
WRSNFHEAGE 
IQVCDSDTVL 
RACQSYFGCV 
TKYTARSKCL 
FFliIATVIQL 
LPAKIFAIAT 
AFLVSGAILY 



41 

I 

HYLSFGLYGA 
LRKCLRSAQR 
GETEASLQEG 
DPACTIEMLR 
QCISGPLGMY 
TETPTKYLRW 
FYRGRIWNIL 
INKSGWGTSG 
GCYWVALLML 



CATTGCTACC 

CTTCATTGGC 

CACAGCTTAT 

TATACTGTAT 

GCGATGTGGG • 

CCAAGCAGAG 

CAGGGTGGGA 

CTAAAATGCA 

CTGATCCCCC 

CATCTGCACA 

AACCAAGTTA 

CGGAAGGAGG 

GAGCGAGGTG 

CAATAAGATT 

GGGTCCAGGA 

GCCAAACCAG 

TTTGCCAGGA 

GCCTTGGGTG 

GATCTCTGCT 

TTGGGTATTT 

AAGACACACT 

ACCTCTTCTA 

CGGCTTCTTA 

GAGGAGCCTC 

CCGGGTTAGC 

TTTTCAGTGT 

TCTCCTCAGT 

CCTGCTTCTT 

TGCTTTCTCT 

GGAATGAGGG 

GTTTAAAACC 

ATGTCCTTTC 

ACAAGAGGGC 

AACCACAGAT 

AAGTGTAGCT 

GACATCAGAC 

CCAGGCAATC 

TCCGGTCAGC 

CTTGAAGATT 

CACTGTGGTC 

ACGTTTGACC 

ACTGGGTCTT 

CATTTTGGTT 

TGTAAACATA 

TTATACTGCA 



51 
I 

ILGLHLLIQS 
ISFPDLKWM 
MDRVRDWRA 
VLEEDPQVGG 
RNSLLQQFLE 
LNQQTRWSKS 
LFLLTVQLVG 
RKTIWNFIG 
YLAIIARRCG 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
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2040 
2100 
2160 
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2460 
2520 
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2640 
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2820 
2880 
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3240 
3300 
3360 
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3900 
3960 
4020 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 175 DNA sequence 
65 Nucleic Acid Accession #: NM_000691 
Coding sequence: 43.. 1404 



70 
75 
80 
85 



1 
I 

CCAGGAGCCC 
GCCGTGAAGC 
TTCCAGCAGC 
GCGCTGGCCG 
GTCCTAGAGG 
GTGGAGAAGA 
GTGGTCCTCG 
GGCGCCATCG 
GCGAGCCTGC 
AATGGGGGTG 
ACGGGCAGCA 
GTCACGCTGG 
GTGGCCTGCC 
CCAGACTACA 
TCACTGAAAG 
AGTGCCCGGC 
GGCACCGGGG 



11 
I 

CAGTTACCGG 
GCGCCCGCGC 
TGGAGGCGCT 
CAGACCTGCA 
AGATCGAGTA 
CGCCCCAGAC 
TCATTGGCAC 
CTGCAGGGAA 
TGGCTACCAT 
TCCCTGAGAC 
CGGGGGTGGG 
AGCTGGGAGG 
GACGCATCGC 
TCCTCTGTGA 
AGTTCTACGG 
ACTTCCAGAG 
ATGCCGCCAC 



21 
I 

GAGAGGCTGT 
CGCCTTCAGC 
GCAGCGCCTG 
CAAGAATGAA 
CATGATCCAG 
TCAGCAGGAC 
CTGGAACTAC 
CGCAGTGGTC 
CATCCCCCAG 
CACGGAGCTG 
GAAGATCATC 
GAAGAGTCCC 
CTGGGGGAAA 
CCCCTCGATC 
GGAAGATGCT 
GGTGATGGGC 
TCGCTACATA 



31 
I 

GTCAAAGGCG 
TCGGGCAGGA 
ATCCAGGAGC 
TGGAACGCCT 
AAGCTCCCTG 
GAGCTCTACA 
CCCTTCAACC 
CTCAAGCCCT 
TACCTGGACA 
CTCAAGGAGA 
ATGACGGCTG 
TGCTACGTGG 
TTCATGAACA 
CAGAACCAAA 
AAGAAATCCC 
CTGATTGAGG 
GCCCCCACCA 



41 
I 

CCATGAGCAA 
CCCGTCCGCT 
AGGAGCAGGA 
ACTATGAGGA 
AGTGGGCCGC 
TCCACTCGGA 
TCACCATCCA 
CGGAGCTGAG 
AGGATCTGTA 
GGTTCGACCA 
CTGCCAAGCA 
ACAAGAACTG 
GTGGCCAGAC 
TTGTGGAGAA 
GGGACTATGG 
GCCAGAAGGT 
TCCTCACGGA 



51 
I 

GATCAGCGAG 
GCAGTTCCGA 
GCTGGTGGGC 
GGTGGTGTAC 
GGATGAGCCC 
GCCACTGGGC 
GCCCATGGTG 
TGAGAACATG 
CCCAGTAATC 
TATCCTGTAC 
CCTGACCCCT 
TGACCTGGAC 
CTGCGTGGCC 
GCTCAAGAAG 
AAGAATCATT 
GGCTTATGGG 
CGTGGACCCC 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 • 
600 
660 
720 
780 
840 
900 
960 
1020 
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CAGTCCCCGG 
AGCCTGGAGG 
TTCTCCAGCA 
GCGGCCAACG 
AACAGCGGCA 
TCTTGCCTGG 
CCGGCCAAGA 
CCCATCGGAG 
CCCCAGCCCA 
GGGCCCAGGC 
CAATTTTCTA 



TGATGCAAGA 
AGGCCATCCA 
ACGACAAGGT 
ATGTCATCGT 
TGGGATCCTA 
TGAGGCCTCT 
TGACCCAGCA 
TGCGGACCAC 
GCCCCACTCC 
CTCACCATTC 
ACTCGG 



GGAGATCTTC 
GTTCATCAAC 
GATTAAGAAG 
CCACATCACC 
CCATGGCAAG 
GATGAATGAT 
CTGAGGAGGG 
CCTCACTGGC 
TCTGCTGACC 
CAAGTCTCCA 



Seq ID NO: 176 Protein sequence: 
Protein Accession fl: NP_000682 



1 
I 

MSKISEAVKR 
YEEWYVLEE 
TIQPMVGAIA 
FDHILYTGST 
GQTCVAPDYI 
QKVAYGGTGD 
PLALYMFSSN 
TFSHRRSCLV 



11 
I 

ARAAPSSGRT 
IEYMIQKLPE 
AGNAWLKPS 
GVGKIIMTAA 
LCDPSIQNQI 
AATRYIAPTI 
DKVIKKMIAE 
RPLMNDEGLK 



21 
I 

RPLQFRFQQL 
WAADEPVEKT 
ELSENMASLI* 
AKHLTPVTLE 
VEKLKKSLKE 
LTDVDPQSPV 
TSSGGVAAND 
VRYPPSPAKM 



GGGCCTGTGC 


TGCCCATCGT 


GTGCGTGCGC 


1080 


CAGCGTGAGA 


AGCCCCTGGC 


CCTCTACATG 


1140 


ATGATTGCAG 


AGACATCCAG 


TGGTGGGGTG 


1200 


TTGCACTCTC 


TGCCCTTCGG 


GGGCGTGGGG 


1260 


AAGAGCTTCG 


AGACTTTCTC 


TCACCGCCGC 


1320 


GAAGGCCTGA 


AGGTCAGATA 


CCCCCCGAGC 


1380 


GTTGCTCCGC 


CTGGCCTGGC 


CATACTGTGT 


1440 


TCTCCTGGCC 


CTGGAGAATC 


GCTCCTGCAG 


1500 


TGCTGACCTG 


TGCACACCCC 


ACTCCCACAT 


1560 


CCCCTTTCTA 


GACCAATAAA 


GAGACAAATA 


1620 


31 
1 


41 


51 




i 

EALQRLIQEQ 


| 

EQELVGALAA 


| 

DLHKKEWNAY 


60 


PQTQQDELYI 


HSEPLGWLV 


IGTWNYPFNL 


120 


ATIIPQYLDK 


DLYPVINGGV 


PETTELLKER 


180 


LGGKSPCYVD 


KNCDLDVACR 


RIAWGKFMNS 


240 


FYGEDAKKSR 


DYGRIISARH 


FQRVMGLIEG 


300 


MQEEIFGPVL 


PIVCVRSLEE 


AIQFINQREK 


360 


VIVHITLHSL 


PFGGVGNSGM 


GSYHGKKSFE 


420 


TQH 









Seq ID NO: 177 DNA sequence 

Nucleic Acid Accession #: NM_001067. 

Coding sequence: 108-4703 



CTAACCGACG 
CTGTTTAGTC 
CATTGCAGCC 
AGAAAAGACT 
TCCGCCCAGA 
ATGAAGATGT 
TTGATGAGAT 
TTAGAGTCAC 
TTCCTGTTGT 
TCCTAACTTC 
ATGGAGCCAA 
AATACAAGAA 
AACTCAAGCC 
AGTTTAAAAT 
ATATTGCTGG 
AAGGATTTCG 
CCTTGAAAGT 
AAAAAGGCTT 
ATGTTGATTA 
ACAAGGGTGG 
ATGCCTTAAT 
CCAAGAGCTT 
GTGGTATTGT 
AGAAGTGTTC 
ATGATGCAGG 
CCAAAACTTT 
CTCTTAGAGG 
CTGAGATTAA 
AAGATTCATT 
ATGGTTCCCA 
TGCGACATCG 
AAGAAATGGC 
ATAAAAAATG 
AAGAATACTT 
ATGATGCTGC 
TAACTAATTT 
TGTATGGACA 
TGTTCTCAAA 
GTCAGAGAAA 
CCCAATTAGC 
TGATGACCAT 
AGCCCATTGG 
ACATCTTTAC 
CGTTGAAGTT 
TTCCCATGGT 
CCAACTTTGA 
CTTTGCCAAT 
ATCAATATGT 
AGCTTCCCGT 
ATGGCACCGA 
CTGTGAAATT 
TACACAAAGT 
ACGTAGGCTG 
TCAGACTTAA 
• CTGCTAAACT 



11 

I 

CGCGTCTGTG 
GCTTTCAGGG 
TGTAAATGAA 
GTCTGTTGAA 
CACCTACATT 
TGGCATTAAC 
TCTAGTTAAT 
AATTGATCCG 
TGAACACAAA 
TAGTAACTAT 
ATTGTGTAAC 
AATGTTCAAA 
CTTCAATGGA 
GCAAAGCCTG 
ATCCACCAAA 
TAGTTATGTG 
AATACATGAA 
TCAGCAAATT 
TGTAGCTGAT 
TGTTGCAGTA 
TGAAAACCCA 
TGGATCAACA 
AGAAAGCATA 
AGCTGTAAAA 
GGGCCGAAAC 
GGCTGTTTCA 
AAAAATACTC 
CAATATCATC 
GAAGACGCTT 
CATCAAAGGC 
TTTTCTGGAG 
ATTTTACAGC 
GAAAGTCAAA 
TGCAGATATG 
TATCAGCCTG 
CATGGAGGAT 
AACTACCACA 
TTCTGATAAC 
GGTTTTGTTT 
TGGATCAGTG 
TATCAATTTG 
TCAGTTTGGT 
AATGCTCAGC 
TTTATATGAT 
GCTGATAAAT 
TGTGCGTGAA 
GCTTCCAAGT 
GATTAGTGGT 
CAGAACATGG 
GAAGACACCT 
TGTTGTGAAG 
CTTCAAACTC 
TTTAAAGAAA 
ATATTATGGA 
GAATAATCAG 



21 
I 

GAGAAGCGGC 
TTCTTGAGCC 
AATATGCAAG 
AGAATCTATC 
GGTTCTGTGG 
TATAGGGAAG 
GCTGCGGACA 
GAAAACAATT 
GTTGAAAAGA 
GATGATGATG 
ATATTCAGTA 
CAGACATGGA 
GAAGATTATA 
GACAAAGATA 
GATGTCAAAG 
GACATGTATT 
CAAGTAAACC 
AGCTTTGTCA 
CAGATTGTGA 
AAAGCACATC 
ACCTTTGACT 
TGCCAATTGA 
CTAAACTGGG 
CATAATAGAA 
TCCACTGAGT 
GGCCTTGGTG 
AATGTTCGAG 
AAGATTGTGG 
CGTTATGGGA 
TTGCTGATTA 
GAATTTATCA 
CTTCCTGAAT 
TATTACAAAG 
AAAAGACATC 
GCCTTTAGCA 
AGAAGACAAC 
TATCTGACAT 
GAGAGATCTA 
ACTTGCTTCA 
GCTGAAATGT 
GCTCAGAATT 
ACCAGGCTAC 
TCTTTGGCTC 
GACAACCAGC 
GGTGCTGAAG 
ATTGTAAATA 
TACAAGAACT 
GAAGTAGCTA 
ACCCAGACAT 
CCTCTCATAA 
ATGACTGAAG 
CAAACTAGTC 
TATGACACGG 
TTAAGAAAAG 
GCTCGCTTTA 



31 
I 

TTGGTCGGGG 
CCTTCACGAC 
TCAACAAAAT 
AAAAGAAAAC 
AATTAGTGAC 
TCACTTTTGT 
ACAAACAAAG 
TAATTAGTAT 
TGTATGTCCC 
AAAAGAAAGT 
CCAAATTTAC 
TGGATAATAT 
CATGTATCAC 
TTGTTGCACT 
TCTTTCTTAA 
TGAAGGACAA 
ACAGGTGGGA 
ACAGCATTGC 
CTAAACTTGT 
AGGTGAAAAA 
CTCAGACAAA 
GTGAAAAATT 
TGAAGTTTAA 
TCAAGGGAAT 
GTACGCTTAT 
TGGTTGGGAG 
AAGCTTCTCA 
GTCTTCAGTA 
AGATAATGAT 
ATTTTATCCA 
CTCCCATTGT 
TTGAAGAGTG 
GTTTGGGCAC 
GTATCCAGTT 
AAAAACAGAT 
GAAAGTTACT 
ATAATGACTT 
TCCCTTCTAT 
AACGGAATGA 
CTTCTTATCA 
TTGTGGGTAG 
ATGGTGGCAA 
GATTGTTATT 
GTGTTGAGCC 
GAATCGGTAC 
ACATCAGGCG 
TCAAGGGTAC 
TTCTTAATTC 
ACAAAGAACA 
CAGACTATAG 
AAAAACTGGC 
TCACATGCAA 
TGTTGGATAT 
AATGGCTCCT 
TCTTAGAGAA 



41 
I 

GTGGTCTCGT 
CGTCACCATG 
AAAGAAAAAT 
ACAATTGGAA 
CCAGCAAATG 
TCCTGGTTTG 
GGACCCAAAA 
ATGGAATAAT 
AGCTCTCATA 
GACAGGTGGT 
TGTGGAAACA 
GGGAAGAGCT 
CTTTCAGCCT 
AATGGTCAGA 
TGGAAATAAA 
GTTGGATGAA 
AGTGTGTTTA 
TACATCCAAG 
TGATGTTGTG 
TCACATGTGG 
AGAAAACATG 
TATCAAAGCT 
GGCCCAAGTC 
TCCCAAACTC 
CCTGACTGAG 
AGACAAATAT 
TAAGCAGATC 
CAAGAAAAAC 
TATGACAGAT 
TCACAACTGG 
AAAGGTATCT 
GAAGAGTTCT 
CAGCACATCA 
CAAATATTCT 
AGATGATCGA 
TGGGCTTCCT 
CATCAACAAG 
GGTGGATGGT 
CAAGCGAGAA 
TCATGGTGAG 
CAATAATCTA 
GGATTCTGCT 
TCCACCAAAA 
TGAATGGTAC 
TGGGTGGTCC 
TTTGATGGAT 
TATTGAAGAA 
TACAACCATT 
AGTTCTAGAA 
GGAATACCAT 
AGAGGCAGAG 
CTCTATGGTG 
TCTAAGAGAC 
AGGAATGCTT 
AATAGATGGC 



51 

I 

GGGGTCCTGC 
GAAGTGTCAC 
GAAGATGCTA 
CATATTTTGC 
TGGGTTTACG 
TACAAAATCT 
ATGTCTTGTA 
GGAAAAGGTA 
TTTGGACAGC 
CGAAATGGCT 
GCCAGTAGAG 
GGTGAGATGG 
GATTTGTCTA 
AGAGCATATG 
CTGCCAGTAA 
ACTGGTAACT 
ACTATGAGTG 
GGTGGCAGAC 
AAGAAGAAGA 
ATTTTTGTAA 
ACTTTACAAC 
GCCATTGGCT 
CAGTTAAACA 
GATGATGCCA 
GGAGATTCAG 
GGGGTTTTCC 
ATGGAAAATG 
TATGAAGATG 
CAGGACCAAG 
CCCTCTCTTC 
AAAAACAAGC 
ACTCCAAATC 
AAGGAAGCTA 
GGTCCTGAAG 
AAGGAATGGT 
GAGGATTACT 
GAACTTATCT 
TTGAAACCAG 
GTAAAGGTTG- 
ATGTCACTAA 
AACCTCTTGC 
AGTCCACGAT 
GATGATCACA 
ATTCCTATTA 
TGCAAAATCC 
GGAGAAGAAC 
CTGGCTCCAA 
GAAATCTCAG 
CCCATGTTGA 
ACAGATACCA 
AGAGTTGGAC 
CTTTTTGACC 
TTTTTTGAAC 
GGTGCTGAAT 
AAAATAATCA 



60 
120 
180 
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300 
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420 
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660 
720 
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900 
960 
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1080 
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1440 
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1980 
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2100 
2160 
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2260 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2680 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
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TTGAAAATAA GCCTAAGAAA GAATTAATTA AAGTTCTGAT TCAGAGGGGA TATGATTCGG 3360 

ATCCTGTGAA GGCCTGGAAA GAAGCCCAGC AAAAGGTTCC AGATGAAGAA GAAAATGAAG 3420 

AGAGTGACAA CGAAAAGGAA ACTGAAAAGA GTGACTCCGT AACAGATTCT GGACCAACCT 3480 

TCAACTATCT TCTTGATATG CCCCTTTGGT ATTTAACCAA GGAAAAGAAA GATGAACTCT 3540 

GCAGGCTAAG AAATGAAAAA GAACAAGAGC TGGACACATT AAAAAGAAAG AGTCCATCAG 3600 

ATTTGTGGAA AGAAGACTTG GCTACATTTA TTGAAGAATT GGAGGCTGTT GAAGCCAAGG 3660 

AAAAACAAGA TGAACAAGTC GGACTTCCTG GGAAAGGGGG GAAGGCCAAG GGGAAAAAAA 3720 

CACAAATGGC TGAAGTTTTG CCTTCTCCGC GTGGTCAAAG AGTCATTCCA CGAATAACCA 3780 

TAGAAATGAA AGCAGAGGCA GAAAAGAAAA ATAAAAAGAA AATTAAGAAT GAAAATACTG 3840 

AAGGAAGCCC TCAAGAAGAT GGTGTGGAAC TAGAAGGCCT AAAACAAAGA TTAGAAAAGA 3900 

AACAGAAAAG AGAACCAGGT ACAAAGACAA AGAAACAAAC TACATTGGCA TTTAAGCCAA 3960 

TCAAAAAAGG AAAGAAGAGA AATCCCTGGC CTGATTCAGA ATCAGATAGG AGCAGTGACG 4020 

AAAGTAATTT TGATGTCCCT CCACGAGAAA CAGAGCCACG GAGAGCAGCA ACAAAAACAA 4080 

AATTCACAAT GGATTTGGAT TCAGATGAAG ATTTCTCAGA TTTTGATGAA AAAACTGATG 4140 

ATGAAGATTT TGTCCCATCA GATGCTAGTC CACCTAAGAC CAAAACTTCC CCAAAACTTA 4200 

GTAACAAAGA ACTGAAACCA CAGAAAAGTG TCGTGTCAGA CCTTGAAGCT GATGATGTTA 4260 

AGGGCAGTGT ACCACTGTCT TCAAGCCCTC CTGCTACACA TTTCCCAGAT GAAACTGAAA • 4320 

TTACAAACCC AGTTCCTAAA AAGAATGTGA CAGTGAAGAA GACAGCAGCA AAAAGTCAGT 4380 

CTTCCACCTC CACTACCGGT GCCAAAAAAA GGGCTGCCCC AAAAGGAACT AAAAGGGATC 4440 

CAGCTTTGAA TTCTGGTGTC TCTCAAAAGC CTGATCCTGC CAAAACCAAG AATCGCCGCA 4500 

AAAGGAAGCC ATCCACTTCT GATGATTCTG ACTCTAATTT TGAGAAAATT GTTTCGAAAG 4560 

CAGTCACAAG CAAGAAATCC AAGGGGGAGA GTGATGACTT CCATATGGAC TTTGACTCAG 4620 

CTGTGGCTCC TCGGGCAAAA TCTGTACGGG CAAAGAAACC TATAAAGTAC CTGGAAGAGT 4680 

CAGATGAAGA TGATCTGTTT TAAAATGTGA GGCGATTATT TTAAGTAATT ATCTTACCAA 4740 

GCCCAAGACT GGTTTTAAAG TTACCTGAAG CTCTTAACTT CCTCCCCTCT GAATTTAGTT 4800 

TGGGGAAGGT GTTTTTAGTA CAAGACATCA AAGTGAAGTA AAGCCCAAGT GTTCTTTAGC 4860 

TTTTTATAAT ACTGTCTAAA TAGTGACCAT CTCATGGGCA TTGTTTTCTT CTCTGCTTTG 4920 

TCTGTGTTTT GAGTCTGCTT TCTTTTGTCT TTAAAACCTG ATTTTTAAGT TCTTCTGAAC 4980 

TGTAGAAATA GCTATCTGAT CACTTCAGCG TAAAGCAGTG TGTTTATTAA CCATCCACTA 5040 

AGCTAAAACT AGAGCAGTTT GATTTAAAAG TGTCACTCTT CCTCCTTTTC TACTTTCAGT 5100 

AGATATGAGA TAGAGCATAA TTATCTGTTT TATCTTAGTT TTATACATAA TTTACCATCA 5160 

GATAGAACTT TATGGTTCTA GTACAGATAC TCTACTACAC TCAGCCTCTT ATGTGCCAAG 5220 

TTTTTCTTTA AGCAATGAGA AATTGCTCAT GTTCTTCATC TTCTCAAATC ATCAGAGGCC 5280 

AAAGAAAAAC ACTTTGGCTG TGTCTATAAC TTGACACAGT CAATAGAATG AAGAAAATTA 5340 

GAGTAGTTAT GTGATTATTT CAGCTCTTGA CCTGTCCCCT CTGGCTGCCT CTGAGTCTGA 5400 

ATCTCCCAAA GAGAGAAACC AATTTCTAAG AGGACTGGAT TGCAGAAGAC TCGGGGACAA 5460 

CATTTGATCC AAGATCTTAA ATGTTATATT GATAACCATG CTCAGCAATG AGCTATTAGA 5520 

TTCATTTTGG GAAATCTCCA TAATTTCAAT TTGTAAACTT TGTTAAGACC TGTCTACATT 5580 

GTTATATGTG TGTGACTTGA GTAATGTTAT CAACGTTTTT GTAAATATTT ACTATGTTTT 5640 
TCTATTAGCT AAATTCCAAC AATTTTGTAC TTTAATAAAA TGTTCTAAAC ATTGC 

Seq ID NO: 178 Protein sequence: 
Protein Accession #: NP_001058.1 

1 11 21 31 41 51 

I I I I I I 

MEVSPLQPVN ENMQVNKIKK NEDAKKRLSV ERIYQKKTQLi EHILLRPDTY IGSVELVTQQ 60 

MWVYDEDVGI NYREVTFVPG LYKIFDEILV NAADNKQRDP KMSCIRVTID PENNLISIWN 120 

NGKGIPWEH KVEKMYVPAL IFGQLLTSSN YDDDEKKVTG GRNGYGAKliC NIFSTKFTVE 180 

TASREYKKMF KQTWMDNMGR AGEMELKPFN GEDYTCITFQ PDLSKFKMQS LDKDIVALMV 240 

RRAYDIAGST KDVKVFLNGN KXjPVKGFRSY VDMYLKDKLD ETGNSLKVIH EQVNHRWEVC 300 

LTMSEKGFQQ ISFVNSIATS KGGRHVDYVA DQIVTKLVDV VKKKNKGGVA VKAHQVKNHM 360 

WIFVNAXiIEN PTFDSQTKEN MTLQPKSFGS TCQLSEKFIK AAIGCGIVES ILNWVKFKAQ 420 

VQLNKKCSAV KHNRIKGIPK LDDANDAGGR NSTECTLILT EGDSAKTLAV SGLGWGRDK 480 

YGVFPLRGKI LNVREASHKQ IMENAEINNI IKIVGLQYKK NYEDEDSLKT LRYGKIMIMT 540 

DQDQDGSHIK GLLINFIHHN WPSLLRllRFL EEFITPIVKV SKNKQEMAFY SLPEFEEWKS 600 

STPNHKKWKV KYYKGLGTST SKEAKEYFAD MKRHRIQFKY SGPEDDAAIS LAFSKKQIDD 660 

RKEWLTNFME DRRQRKLLGL PEDYLYGQTT TYLTYNDFIN KELILFSNSD NERSIPSMVD 720 

GLKPGQRKVL FTCFKRNDKR EVKVAQLAGS VAEMSSYHHG EMSLMMTIIN LAQNFVGSNN 780 

LNLLQPIGQF GTRLHGGKDS ASPRYIFTML SSLARLLFPP KDDHTLKFLY DDNQRVEPEW 840 

YIPIIPMVLI NGAEGIGTGW SCKIPNFDVR EIVNNIRRLM DGEEPLPMLP SYKNFKGTIE 900 

ELAPNQYVIS GEVAILNSTT IEISELPVRT WTQTYKEQVL EPMLNGTEKT PPLITDYREY 960 

HTDTTVKFW KMTEEKLAEA ERVGLHKVFK LQTSLTCNSM VLFDHVGCLK KYDTVLDILR 1020 
DFFELRLKYY GLRKEWLLGM LGAESAKLNN QARFILEKID GKIIIENKPK KELIKVLIQR ' 1080 

GYDSDPVKAW KEAQQKVPDE EENEESDNEK ETEKSDSVTD SGPTFNYLLD MPLWYLTKEK 1140 

KDELCRliRNE KEQELDTLKR KSPSDLWKED LATFIEELEA VEAKEKQDEQ VGLPGKGGKA 1200 

KGKKTQMAEV LPSPRGQRVI PRITIEMKAE AEKKNKKKIK NENTEGSPQE DGVELEGLKQ 1260 

RLEKKQKREP GTKTKKQTTL AFKPIKKGKK RNPWPDSESD RSSDESNFDV PPRETEPRRA 1320 

ATKTKFTMDL DSDEDFSDFD EKTDDEDFVP SDASPPKTKT SPKLSNKELK PQKSWSDLE 1380 

ADDVKGSVPL SSSPPATHFP DETEITNPVP KKNVTVKKTA AKSQSSTSTT GAKKRAAPKG 1440 

TKRDPALNSG VSQKPDPAKT KNRRKRKPST SDDSDSNFEK IVSKAVTSKK SKGESDDFHM 1500 
DFDSAVAPRA KSVRAKKPIK YLEESDEDDL F 



Seq ID NO: 179 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 148-7095 

1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 
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AACACATTCA TTCATAACAC TGGGAAAACA 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT 
AAATGCAATA TGTCATCTGA TGGATCAGAG 
GAGATGCAAA TCTACTGCTT TGATGCGGAC 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT 
GATTTCAAAG CGATTATTGA TGGAGTCGAA 
TTAGATCCAT TCATACTGTT GAACCTTCTG 
AATGGCTCAT TGACATCTCC TCCCTGCACA 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT 
TCTGGTTATG TCATGCTGAT GGACTACTTA 
TTCTCTAGAC AGGTGTTTTC CTCATACACT 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC 
TGGGAAAGAC CTCGAGTCGT TTATGATACC 
CAGTTGGATG GAGAGGACCA AACCAAGCAT 
GGTGCTATTC TCAATAATTT GCTACCCAAT 
TGCACTAATG GCTTATATGG AAAATACAGC 
AATCCTGAAC TTGATCTTTT CCCTGAATTA 
GAAGAGGGAA AAGACATTGA AGAAGGCGCT 
AACCAAATCA GGAAAAAGGA ACCCCAGATT 
ACGAAATACA ATGAAGCCAA GACTAACCGA 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG 
GAAGGTACTT CAGCCTCTTT AAATGATGGC 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT 
AGTTTATTGA CCAGTTTCAA GCTTGATACT 
GCAACTTCTG CTATCCCATT CATCTCTGAG 
GAAAACCCAG AGACAATAAC ATATGATGTC 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA 
GTGTGGTTTC CTAGCTCTAC AGACATAACA 
AGCTTTCTCC AGACTAATTA CACTGAGATA 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG 
CATTATTCTA CCTTTGCCTA CTTCCCAACT 
TCCAGACAAC AGGATTTGGT CTCCACGGTC 
GTATACAATG GTGAGACACC TCTTCAACCT 
ACCCCTTTGT TGCTTGACAA TCAGATCCTC 
TCGGCCTTGC ATGCTACGCC TGTATTTCCC 
TCTTCCTATG ATGGTGCACC TTTGCTTCCA 
TTTCGCCATC TGCATACAGT TTCTCAAATC 
GATAAGGTGC CCTTGCATGC TTCTCTGCCA 
AGCCTTGCTC AGTATTCTGA TGTGCTGTCC 
TTTGGTAGTG AATCTGGTGT TCTTTATAAA 
AGCAGTGATG CCATGATGCA TGCACGTTCT 
GATAATGAGG GCTCCCAACA CATCTTCACT 
GATTCTGTGG GTGTAACTTA TCAGGGTTCC 
CCTAAGTCTT CGTTAATAAC CCCAACTGCA 
GGTGATGGGG AATGGTCTGG AGCCTCTTCT 
GGGCTGACAG CCCTTAACAT TTCTTCACCT 
TCTGTGTTTG GTGATGATAA TAAGGCGCTT 
ACTGAACTGC AAATTCCTTC TTTCAATGAG 
CCCAACATGT ATGATAATGT AAATAAGTTG 
ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG 
GATCATGAGA TTAGTCAAGT TCCAGAAAAT 
TCTCAAGCAT CTGGTGACAC TTCGCTTAAA 
TCCTCTGACC CTGCTTCTAG TGAAATGTTA 
ACCTCAGCTT CTTTTAGTAC TGAAGTATTG 
GACACCTTGC TTAAAACTGT TCTTCCAGCT 
CCCAAAGTTG ATAAAATTAG TTCTACAATG 
■ AGTGAAAACA TGCTGCACTC TACATCTGTA 
ATGCACTCTG CTTCACTTCA AGGTTTGACC 
GTTTTGTTAA AAAGTGAAAG TTCCCACCAA 
TTGTTCCAAA CGGCCAATTT GGAGATTAAC 
TTTGCTACAC CTGTTTTATC AATTGATGAA 
CATTCCGATG AAATTTTAAC CTCCACCAAA 
ATTCCAACAG TTGCTTCTGA TACATTTGTA 
GGGCATGTTG CCATTACAGC TGTTTCTCCC 
TTGCTGTTTC CTTCTAAGGC AACTTCTGAG 
TTAGTGGGTG GTGGTGAAGA TGGTGACACX 
AGAGGTAGTG ATGGCTTATC CATTCATAAG 
CAGGAAAAGG TAATGAATGA TTCAGACACC 
CCAATCTCAT ACTCACTATC TGAGAATTCT 
TCAGACAGTC AAACTGGTAT GGACAGAAGT 
TCCCAAAAGC ACAATGATGG AAAAGAGGAA 
CCTCTCAGCC CTGAATCTAA AGCATGGGCA 
GGGCAAGGTA CCTCAGATAG CCTTAATGAG 
GACACTAATG AAAAAGATGC TGATGGGATC 
GGATTCCCAC AGTCCCCAAC ATCATCTGTT 
TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT 
GAATCCGAGA AGAAGGCAGT TATACCCCTT 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC 
TACTTAGAGG ACAGTACATC CCCTAGAGTT 
ATTTCAGATG ATGTCGGAGC AATTCCAATA 
CATGCAAGTA GTGGGTTTAC TGAAGAATTT 
CAGAGCTGTA CTGTTGACTT AGGTATTACA 
CACAAGAATC GATACATAAA TATCGTTGCC 
CTTGCTGAAA AGGATGGCAA ACTGACTGAT 
AACAGACCAA AAGCTTATAT TGCTGCCCAA 
TGGAGAATGA TATGGGAACA TAATGTGGAA 



GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

TCTACCACAA CACACTACAA TCGCATAGGG 1560 

TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 

AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 

TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 

CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 

ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 

TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

G T T TCT T ACA GTTCTGCAAT ACCTGTGCAT 3000 

TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 

TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

AATGCGTCTT TACAAGAAAC CTCTGTTTCC 3420 

TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3 600 

TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 

CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 

CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 

ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 

GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 

CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

GATGATGATG GTGATGATGA TGATGATGAC 4440 

TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 

CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 

GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 

CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 

GTTCTGACAA GTGATGAAGA AAGTGGATCA 4800 

AATGAGACTT CCACAGATTT CAGTTTTGCA 4860 

CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 

GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 

GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 

TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 

ATATCCACAC CTCCAACACC TATCTTTCCA 5220 

AAGCACTTTC CAAAGCATGT TGCAGATTTA 5280 

GAGACACTGA AAGAGTTTTA CCAGGAAGTG 5340 

GCAGACAGCT CCAACCACCC AGACAACAAG 5400 

TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 

TATATCAATG CCAATTATGT TGATGGCTAC 5520 

GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 
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AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTAC6GGAAC 5700 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 5820 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 6000 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 

AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 6360 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 

GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 6840 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 7020 

TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 7080 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 7200 

GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 7260 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 7320 

TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 7380 

TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 

GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 

AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 

TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7680 

GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 

TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7800 

GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 

AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA AAAA 

Seg ID NO: 180 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

111111 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTPIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAI IDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY I*QNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLLLDNQI 780 

LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ 840 

ILPQVTSATE SDKVPLHASL PVAGGDLLLE PSLAQYSDVL STTHAASETL EFGSESGVLY 900 

KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIF TVSYSSAIPV HD SVG VT YOG 960 

SLFSGPSHIP IPKSSIjITPT ASLLQPTHAL SGDGEWSGAS SDSEFLLPDT DGLTALNISS 1020 

PVSVAEFTYT TSVFGDDNKA LSKSEIIYGN ETELQIPSFN EMVYPSESTV MPNMYDNVNK 1080 

LNASLQETSV SISSTKGMFP GSLAHTTTKV FDHEISQVPE NNFSVQPTHT VSQASGDTSL 1140 

KPVLSANSEP ASSDPASSEM LSPSTQLLFY ETSASFSTEV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPILVE TPKVDKISST MLHLIVSNSA SSENMLHSTS VPVFDVSPTS HMHSASLQGL 1260 

TISYASEKYE PVLLKSESSH QWPSLYSND ELFQTANLEI NQAHPPKGRH VFATPVLSID 1320 

EPLNTMNKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGHVAITAVS 1380 

PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGEDGD TDDDGDDDDD DRGSDGLSIH 1440 

KCMSCSSYRE SQEKVMNDSD THENSLMDQN NPISYSLSEN SEEDNRVTSV SSDSQTGMDR 1500 

SPGKSPSANG LSQKHNDGKE ENDIQTGSAL LPLSPESKAW AVLTSDEESG SGQGTSDSLN 1560 

ENETSTDFSF ADTNEKDADG ILAAGDSEIT PGFPQSPTSS VTSENSEVFH VSEAEASNSS 1620 

HESRIGLAEG LESEKKAVIP LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR 1680 

VISTPPTPIF PISDDVGAIP IKHFPKHVAD LHASSGFTEE FETLKEFYGE VQSCTVDLGI 1740 

TADSSNHPDN KHKNRYINIV AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAYIAA 1800 

QGPLKSTAED FWRMIWEHNV EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ 1860 

VLAYYTVRNF TLRNTKIKKG SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY 1920 

AKRHAVGPW VHCSAGVGRT GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE 1980 

QYVFIHDTLV BAILS KETEV LDSHIHAYVN ALLIPGPAGK TKLEKQFQLL* SQSNIQQSDY 2040 

SAALKQCNRE KNRTSSIIPV BRSRVGISSL SGEGTDYINA SYIMGYYQSN EFIITQHPLI* 2100 

HTIKDFWRMI WDHNAQLWM IPDGQNMAED EFVYWPNKDE PINCESFKVT LMAEEHKCLS 2160 

NEEKLIIQDF ILEATQDDYV fcEVRHFQCPX WPNPDSPISK TFELISVIKB EAANRDGPMI 2220 

VHDEHGGVTA GTFCALTTLM HQLEKENSVD VYQVAKMINL MRPGVFADIE QYQFLYKVIL 2280 
SLVSTRQEEN PSTSLDSNGA ALPDGNIAES LESLV 

Seq ID NO: 181 DNA sequence 

Nucleic Acid Accession 8: Eos sequence 
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Coding sequence: 148-4518 

1 11 21 31 41 51 

I I I I I I 

CACACATAOG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGA6CCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAA6 AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG 7TCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG * 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AGCTGAGGGG 2460 

TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 

TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 

TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 2640 

CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 2700 

TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 

GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 2820 

AAGCACAAGA ATCGATACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2880 

CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2940 

TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3000 

TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 3060 

GAGAAAGGAA GGAGAAAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 

AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 

ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGACGTGTG 3240 

GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 

GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 3360 

GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG .3420 

CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA 3480 

CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 

GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 3600 

GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 

AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 

AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 

AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3840 

GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3900 

TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 3960 

GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGTCACT 4020 

CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 

ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GG CACTTTCA GTGTCCTAAA 4140 

TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 

GAAGCTGCCA ATAGGGATGG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGACGGCA 4260 

GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACTAG AAAAAGAAAA TTCCGTGGAT 4320 

GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 4380 

CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 

CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 4500 

TTAGAGTCTT TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4680 

TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4980 
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AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG 
ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC 
ATTGTTTAGT TTAATGACGT AGTTCATTAG CTGGTCTTAC 
GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT 
ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT 
TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA 
AAAAAAAAAA AAAAAAAAAA AAAAAAA 



TCTCTCCATG GACCAAATTT S040 

AAGTTTTCTA GTTCTGTGTA 5100 

TCTACCAGTT TTCTGACATT 5160 

AATTTTAACT TTTGTGGAAA 5220 

AACACCTTAC CAAACATTGT 5280 

AATATTGCCA TTAAAAAAAA 5340 



PCTAJS02/12476 
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15 
20 
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Seq ID NO: 182 Protein sequence: 
Protein Accession #: Eos sequence 



l 
I 

MRILKRFLAC 
QSPINIDEDL 
PKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
PLVIVSALTF 
PIKHFPKHVA 
VAYDHSRVKL 
VEVIVMITNL 
GSQKGRPSGR 
TGTYIVLDSM 
VLDSHIHAYV 
VERSRVGISS 
MIPDGQNMAE 
VLEVRHFQCP 
MHQLEKENSV 
AALPDGNIAE 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 
EEBGKDIEEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
ICLWLVGIL 
DLHASSGFTE 
AQLAEKDGKL 
VEKGRRKCDQ 
WTQYHYTQW 
LQQIQHEGTV 
NALLIPGPAG 
LSGEGTDYIN 
DEFVYWPNKD 
KWPNPDSPIS 
DVYQVAKMIN 
SLESLV 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSIiIjV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
IYWRKCFQTA 
EFETLKEFYQ 
TDYINANYVD 
YWPADGSEEY 
PDMGVPEYSL 
NIFGFLKHIR 
KTKLEKQFQL 
ASYIMGYYQS 
EPINCESFKV 
KTFELISVIK 
LMRPGVFADI 



31 

I 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNAEASNS 
HFYLEDSTSP 
EVQSCTVDLG 
GYNRPKAYIA 
GNFLVTQKSV 
PVLTFVRKAA 
SQRNYLVQTE 
LSQSNIQQSD 
NEFIITQHPL 
TLMAEEHKCL 
EEAANRDGPM 
EQYQFLYKVI 



41 
I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
SHESRIGLAE 
RVISTPPTPI 
ITADSSNHPD 
AQGPLKSTAE 
QVLAYYTVRN 
YAKRHAVGPV 
EQYVFIHDTL 
YSAALKQCNR 
LHTIKDFWRM 
SNEEKLIIQD 
IVHDEHGGVT 
LSLVSTRQEE 



51 
! 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
GLESEKKAVI 
FPISDDVGAI 
NKHKNRYINI 
DFWRMIWEHN 
FTLRNTKIKK 
WHCSAGVGR 
VEAILSKETE 
EKNRTSSIIP 
IWDHNAQLW 
PILEATQDDY 
AGTFCALTTL 
NPSTSLDSNG 



Seq ID NO: 183 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 148-4494 



CACACATACG 
CAAAAAAAAC 
CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AAATATCCAA 
CAAGTAAATG 
AACACATTCA 
GTCAGCGGAG 
AAATGCAATA 
GAGATGCAAA 
GGAAAAGGGA 
GATTTCAAAG 
TTAGATCCAT 
AATGGCTCAT 
ACAGTTAGCA 
TCTGGTTATG 
TTCTCTAGAC 
AGTTCAGAAC 
TGGGAAAGAC 
CAGTTGGATG 
GGTGCTATTC 
TGCACTAATG 
AATCCTGAAC 
GAAGAGGGAA 
AACCAAATCA 
ACGAAATACA 
AAGGGTGATG 
ACAGAAAAAG 
GAAGGTACTT 
AACTTGTCGG 
AGTTTATTGA 
GCAACTTCTG 
GAAAACCCAG 
GAAGATTCAA 
GTGTGGTTTC 
AGCTTTCTCC 
TCCTTTTCTG 
CATTATTCTA 
TCCAGACAAC 
GTATACAATG 



11 
I 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 
TTCATAACAC 
GAGTTTCAGA 
TGTCATCTGA 
TCTACTGCTT 
AGTTAAGAGC 
CGATTATTGA 
TCATACTGTT 
TGACATCTCC 
TCTCTGAAAG 
TCATGCTGAT 
AGGTGTTTTC 
CAGAAAATGT 
CTCGAGTCGT 
GAGAGGACCA 
TCAATAATTT 
GCTTATATGG 
TTGATCTTTT 
AAGACATTGA 
GGAAAAAGGA 
ATGAAGCCAA 
TTCCCAATAC 
ATATTTCCTT 
CAGCCTCTTT 
GGACTGCAGA 
CCAGTTTGAA 
CTATCCCATT 
AGACAATAAC 
CTTCATCAGG 
CTAGCTCTAC 
AGACTAATTA 
CAGGCCCAGT 
CCTTTGCCTA 
AGGATTTGGT 
AGGCCAGTAA 



21 
I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCAAAACAA 
GAAACTTAAA 
TGGGAAAACA 
AATGGTGTTT 
TGGATCAGAG 
TGATGCAGAC 
TTTATCCATT 
TGGAGTCGAA 
GAACCTTCTG 
TCCCTGCACA 
CCAGTTGGCT 
GGACTACTTA 
CTCATACACT 
TCAGGCTGAC 
TTATGATACC 
AACCAAGCAT 
GCTACCCAAT 
AAAATACAGC 
CCCTGAATTA 
AGAAGGCGCT 
ACCCCAGATT 
GACTAACCGA 
ATCTTTAAAT 
GACTTCTCAG 
AAATGATGGC 
ATCCTTAAAT 
GCTTGATACT 
CATCTCTGAG 
ATATGATGTC 
TTCAGAAGAA 
AGACATAACA 
CACTGAGATA 
GATGTCACAG 
CTTCCCAACT 
CTCCACGGTC 
TAGTAGCCAT 



31 
I 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 
GTGGAAATTA 
AAAGCAAGCA 
CATAGTTTAG 
CGATTTTCAA 
TTGTTTGAGG 
AGTGTTAGTC 
CCAAACTCAA 
GACACAGTTG 
GTTTTTTGTG 
CAAAACAATT 
GGAAAGGAAG 
CCAGAGAATT 
ATGATTGAGA 
GAATTTTTGA 
ATGAGTTATG 
GACCAACTGA 
ATTGGAACTG 
ATTGTGAATC 
TCTACCACAA 
TCCCCAACAA 
TCCACTTCCC 
ACTGTGACTG 
TCTAAAACTG 
ACAGTTTCTA 
GGAGCTGAAG 
AACATATCCC 
CTTATACCAG 
TCACTAAAGG 
GCACAGCCCG 
CGTGTTGATG 
GGTCCCTCAG 
GAGGTAACAC 
AACGTGGTAT 
GAGTCTCGTA 



41 
I 

GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAA 
ATATTGATGA 
GGGATAAAAC 
ATCTCACTAA 
AGATAACTTT 
AAGGACAAAA 
GTTTTGAGGA 
TTGGGACAGA 
GTTTTGGGAA 
CTGACAAGTA 
ACTGGATTGT 
AAGTTCTTAC 
TTCGAGAGCA 
AGATTCATGA 
ATACCAGCCT 
AGTTTGCAGT 
CAGATGGCTA 
TTCTTCAGAT 
TTGTCGACAT 
AAGAAATAAT 
CTGGTAGAGA 
CACACTACAA 
GAGGAAGTGA 
AACCAGTCAC 
AACTGCCACC 
TTCTTAGATC 
TAACAGAATA 
ATTCTTCAGG 
AAGGGTATAT 
AATCTGCTAG 
ATCCTTCTAT 
ATGTTGGATC 
AATCTGAGAA 
TTACAGATCT 
CTCATGCTTT 
ACTCGCAGAC 
TTGGTCTAGC 



51 
I 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TTGGGGAAAG 
AGATCTTACA 
ATCATTGGAA 
TGACTACCGT 
TCACTGGGGA 
ATTTCCACTT 
AGCAGTCAAA 
AGAAAATTTG 
GCAGGCTGCT 
TTACATTTAC 
TTTTAAAGAT 
AATGCAACAA 
ACAGTACAAG 
AGCAGTTTGT 
TCTTGTTACA 
TTTGTACCAG 
TCAAGACTTG 
AGTAGCCATA 
GCCTACTGAT 
CAAGGAGGAG 
CAGTGCTACA 
TCGCATAGGG 
ATTCTCTGGA 
TAAATTAGCC 
TCACACTGTG 
TCCACATATG 
TGAGGAGGAG 
CTCCAGTCCC 
ATTTTCCTCC 
AAATGCTTCC 
GGAGGGAAAT 
AGGCAGAGAG 
GACAACCAAG 
GGAAATGCCA 
TACCCCATCC 
AACCCAACCG 
TGAGGGGTTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



259 



WO 02/086443 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 

GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC ACAAGAATCG ATACATAAAT 2820 

ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC TTGCTGAAAA GGATGGCAAA 2880 

CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA ACAGACCAAA AGCTTATATT 2940 

GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT GGAGAATGAT ATGGGAACAT 3000 

AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA AAGGAAGGAG AAAATGTGAT 3060 

CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT TTCTGGTCAC TCAGAAGAGT 3120 

GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC TAAGAAACAC AAAAATAAAA 3180 

AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA CACAGTATCA CTACACGCAG 3240 

TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC TGACCTTTGT GAGAAAGGCA 3300 

GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC ACTGCAGTGC TGGAGTTGGA 3360 

AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC AGATTCAACA CGAAGGAACT 3420 

GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA GAAATTATTT GGTACAAACT 3480 

GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG CCATACTTAG TAAAGAAACT 3540 

GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA 3600 

GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC AGTCAAATAT ACAGCAGAGT 3660 

GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA ATCGAACTTC TTCTATCATC 3720 

CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG GAGAAGGCAC AGACTACATC 3780 

AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT TCATCATTAC CCAGCACCCT 3840 

CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG ACCATAATGC CCAACTGGTG 3 900 

GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT TTGTTTACTG GCCAAATAAA 3 960 

GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA TGGCTGAAGA ACACAAATGT 4020 

CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT TAGAAGCTAC ACAGGATGAT 4080 

TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC CAAATCCAGA TAGCCCCATT 4140 

AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG CTGCCAATAG GGATGGGCCT 4200 

ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA CTTTCTGTGC TCTGACAACC 4260 

CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT ACCAGGTAGC CAAGATGATC 4320 

AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT ATCAGTTTCT CTACAAAGTG 4380 

ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT CCACCTCTCT GGACAGTAAT 4440 

GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG AGTCTTTAGT TTAACACAGA 4500 

AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT TCCTAAAATT AGGCAGGAAA 4560 

ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG ACAGTAACTT TCATGACATA 4620 

GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC TTTTTGCAAG ACTTGTAATT 4680 

TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT 4740 

GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT TTATAGAGGT TAGGAATTCC 4800 

AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG CTGTATTTGT AGCAATTATC 4860 

AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA AATAAAACAC TCTTCCATAT 4920 

GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA GAAATAATCT GTTACTTATT 4980 

GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT TTATAATTGT AGATTTTTAT 5040 

ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG TTTAGTTTAA TGACGTAGTT 5100 

CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT TGTGTTACCT AAGTCATTAA 5160 

CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG AAATACCTTC ATTTTGAAAG 5220 

AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA ATGGTTTTTA TCCAAGGAAT 5280 

TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 5340 
AAA 

Seq ID NO: 184 Protein sequence: 
Protein Accession 8: EOS sequence 

1 11 21 31 41 51 

I I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNNSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSI/TSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH KNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE* ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP .720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 

IKHFPKHVAD LHASSGFTEE FEEVQSCTVD LGITADSSNH PDNKHKNRYI NIVAYDHSRV 900 

KLAQLAEKDG KLTDYINANY VDGYNRPKAY IAAQGPLKST AEDFWRMIWE HNVEVIVMIT 960 

NLVEKGRRKC DQYWPADGSE EYGNFLVTQK SVQVLAYYTV RNFTLRNTKI KKGSQKGRPS 1020 

GRWTQYHYT QWPDMGVPEY SLPVLTFVRK AAYAKRHAVG PVWHCSAGV GRTGTYIVLD 1080 

SMLQQIQHEG TVNIFGFLKH IRSQRNYLVQ TEEQYVFIHD TLVEAILSKE TEVLDSHIHA 1140 

YVNALLIPGP AGKTKLEKQF QLLSQSNIQQ SDYSAALKQC NREKNRTSSI IPVERSRVGI 1200 

SSLSGEGTDY INASYIMGYY QSNEFIITQH PLLHTIKDFW RMIWDHNAQL WMIPDGQNM 1260 

AEDEFVYWPN KDEPINCESF KVTLMAEEHK CLSNEEKLII QDFILEATQD DYVLEVRHFQ 1320 

CPKWPNPDSP ISKTFELISV IKEEAANRDG PMIVHDEHGG VTAGTFCALT TLMHQLEKEN 1380 

SVDVYQVAKM INLMRPGVFA DIEQYQFLYK VILSLVSTRQ EENPSTSLDS NGAALPDGNI 1440 
AESLESLV 



Seq ID NO: 185 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 501-4514 

1 11 21 31 41 51 
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CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAT TGGGGAAAGA 300 
AATATCCAAC ATGTAATAGC CCAAAACAAT CTCCTATCAA TATTGATGAA GATCTTACAC 360 
AAGTAAATGT GAATCTTAAG AAACTTAAAT TTCAGGGTTG GGATAAAACA TCATTGGAAA 420 
ACACATTCAT TCATAACACT GGGAAAACAG TGGAAATTAA TCTCACTAAT GACTACCGTG 4B0 
TCAGCGGAGG AGTTTCAGAA ATGGTGTTTA AAGCAAGCAA GATAACTTTT CACTGGGGAA 540 
AATGCAATAT GTCATCTGAT GGATCAGAGC ATAGTTTAGA AGGACAAAAA TTTCCACTTG 600 
AGATGCAAAT CTACTGCTTT GATGCGGACC GATTTTCAAG TTTTGAGGAA GCAGTCAAAG 660 
GAAAAGGGAA GTTAAGAGCT TTATCCATTT TGTTTGAGGT TGGGACAGAA GAAAATTTGG 720 
ATTTCAAAGC GATTATTGAT GGAGTCGAAA GTGTTAGTCG TTTTGGGAAG CAGGCTGCTT 780 
TAGATCCATT CATACTGTTG AACCTTCTGC CAAACTCAAC TGACAAGTAT TACATTTACA 840 
ATGGCTCATT GACATCTCCT CCCTGCACAG ACACAGTTGA CTGGATTGTT TTTAAAGATA 900 
CAGTTAGCAT CTCTGAAAGC CAGTTGGCTG TTTTTTGTGA AGTTQTTACA ATGCAACAAT 960 
CTGGTTATGT CATGCTGATG GACTACTTAC AAAACAATTT TCGAGAGCAA CAGTACAAGT 1020 
TCTCTAGACA GGTGTTTTCC TCATACACTG GAAAGGAAGA GATTCATGAA GCAGTTTGTA 1080 
GTTCAGAACC AGAAAATGTT CAGGCTGACC CAGAGAATTA TACCAGCCTT CTTGTTACAT 1140 
GGGAAAGACC TCGAGTCGTT TATGATACCA TGATTGAGAA GTTTGCAGTT TTGTACCAGC 1200 
AGTTGGATGG AGAGGACCAA ACCAAGCATG AATTTTTGAC AGATGGCTAT CAAGACTTGG 1260 
GTGCTATTCT CAATAATTTG CTACCCAATA- TGAGTTATGT TCTTCAGATA GTAGCCATAT 1320 
GCACTAATGG CTTATATGGA AAATACAGCG ACCAACTGAT TGTCGACATG CCTACTGATA 1380 
ATCCTGAACT TGATCTTTTC CCTGAATTAA TTGGAACTGA AGAAATAATC AAGGAGGAGG 1440 
AAGAGGGAAA AGACATTGAA GAAGGCGCTA TTGTGAATCC TGGTAGAGAC AGTGCTACAA 1500 
ACCAAATCAG GAAAAAGGAA CCCCAGATTT CTACCACAAC ACACTACAAT CGCATAGGGA 1560 
CGAAATACAA TGAAGCCAAG ACTAACCGAT CCCCAACAAG AGGAAGTGAA TTCTCTGGAA 1620 
AGGGTGATGT TCCCAATACA TCTTTAAATT CCACTTCCCA ACCAGTCACT AAATTAGCCA 1680 
CAGAAAAAGA TATTTCCTTG ACTTCTCAGA CTGTGACTGA ACTGCCACCT CACACTGTGG 1740 
AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATGA 1800 
ACTTGTCGGG GACTGCAGAA TCCTTAAATA CAGTTTCTAT AACAGAATAT GAGGAGGAGA 1860 
GTTTATTGAC CAGTTTCAAG CTTGATACTG GAGCTGAAGA TTCTTCAGGC TCCAGTCCCG 1920 
CAACTTCTGC TATCCCATTC ATCTCTGAGA ACATATCCCA AGGGTATATA TTTTCCTCCG 1980 
AAAACCCAGA GACAATAACA TATGATGTCC TTATACCAGA ATCTGCTAGA AATGCTTCCG 2040 
AAGATTCAAC TTCATCAGGT TCAGAAGAAT CACTAAAGGA TCCTTCTATG GAGGGAAATG 2100 
TGTGGTTTCC TAGCTCTACA GACATAACAG CACAGCCCGA TGTTGGATCA GGCAGAGAGA 2160 
GCTTTCTCCA GACTAATTAC ACTGAGATAC GTGTTGATGA ATCTGAGAAG ACAACCAAGT 2220 
CCTTTTCTGC AGGCCCAGTG ATGTCACAGG GTCCCTCAGT TACAGATCTG GAAATGCCAC 2280 
ATTATTCTAC CTTTGCCTAC TTCCCAACTG AGGTAACACC TCATGCTTTT ACCCCATCCT 2340 
CCAGACAACA GGATTTGGTC TCCACGGTCA ACGTGGTATA CTCGCAGACA ACCCAACCGG 2400 
TATACAATGA GGCCAGTAAT AGTAGCCATG AGTCTCGTAT TGGTCTAGCT GAGGGGTTGG 2460 
AATCCGAGAA GAAGGCAGTT ATACCCCTTG TGATCGTGTC AGCCCTGACT TTTATCTGTC 2520 
TAGTGGTTCT TGTGGGTATT CTCATCTACT GGAGGAAATG CTTCCAGACT GCACACTTTT 2580 
ACTTAGAGGA CAGTACATCC CCTAGAGTTA TATCCACACC TCCAACACCT ATCTTTCCAA 2640 
TTTCAGATGA TGTCGGAGCA ATTCCAATAA AGCACTTTCC AAAG CATGT T GCAGATTTAC 2700 
ATGCAAGTAG TGGGTTTACT GAAGAATTTG AGACACTGAA AGAGTTTTAC CAGGAAGTGC 2760 
AGAGCTGTAC TGTTGACTTA GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC 2820 
ACAAGAATCG ATACATAAAT ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC 2880 
TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA 2940 
ACAGACCAAA AGCTTATATT GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT 3000 
GGAGAATGAT ATGGGAACAT AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA 3060 
AAGGAAGGAG AAAATGTGAT CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT 3120 
TTCTGGTCAC TCAGAAGAGT GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC 3180 
TAAGAAACAC AAAAATAAAA AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 3240 
CACAGTATCA CTACACGCAG TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC 3300 
TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 3360 
ACTGCAGTGC TGGAGTTGGA AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC 3420 
AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA 3480 
GAAATTATTT GGTACAAACT GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG 3540 
CCATACTTAG TAAAGAAACT GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC 3600 
TCCTCATTCC TGGACCAGCA GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 3660 
AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 3720 
ATCGAACTTC TTCTATCATC CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG 3780 
GAGAAGGCAC AGACTACATC AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT 3840 
TCATCATTAC CCAGCACCCT CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG 3900 
ACCATAATGC CCAACTGGTG GTTATGATTC CTGATGGCCA AAA CATGG CA GAAGATGAAT 3960 
TTGTTTACTG GCCAAATAAA GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA 4020 
TGGCTGAAGA ACACAAATGT CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT 4080 
TAGAAGCTAC ACAGGATGAT TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC 4140 
CAAATCCAGA TAGCCCCATT AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG 4200 
CTGCCAATAG GGATGGGCCT ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA 4260 
CTTTCTGTGC TCTGACAACC CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT 4320 
ACCAGGTAGC CAAGATGATC AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT 4380 
ATCAGTTTCT CTACAAAGTG ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT 4440 
CCACCTCTCT GGACAGTAAT GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG 4500 
AGTCTTTAGT TTAACACAGA AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT 4560 
TCCTAAAATT AGGCAGGAAA ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG 4620 
ACAGTAACTT TCATGACATA GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC 4680 
TTTTTGCAAG ACTTGTAATT TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT 4740 
ATTTCTAAGA ATGGAATTGT GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT 4800 
TTATAGAGGT TAGGAATTCC AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG 4860 
CTGTATTTGT AGCAATTATC AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA 4920 
AATAAAACAC TCTTCCATAT GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA 4980 
GAAATAATCT GTTACTTATT GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT 5040 
TTATAATTGT AGATTTTTAT ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG 5100 
TTTAGTTTAA TGACGTAGTT CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT 5160 
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TGTGTTACCT AAGTCATTAA CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG 
AAATACCTTC ATTTTGAAAG AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA 
ATGGTTTTTA TCCAAGGAAT TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 186 Protein sequence-. 
Protein Accession #: EOS sequence 



S220 
5280 
5340 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



MVFKASKITF 
LSILFEVGTE 
PCTDTVDWIV 
SYTGKEEIHE 
TKHEFLTDGY 
PEIjIGTEEII 
TNRSPTRGSE 
NDGSKTVLRS 
ISENISQGYI 
DITAQPDVGS 
FPTEVTPHAF 
IPLVIVSALT 
IPIKHFPKHV 
IVAYDHSRVK 
NVEVIVMITN 



RTGTYIVLDS 
EVLDSHIHAY 
PVERSRVGIS 
VMIPDGQNMA 
YVLEVRHFQC 
LMHQLEKENS 
GAALPDGNIA 



11 
I 

HWGKCNMSS0 
ENLDFKAIID 
FKDTVSISES 
AVCSSEPENV 
QDLGAILNNL 
KEEEEGKDIE 
FSGKGDVPNT 
PHMNLSGTAE 
FSSENPETIT 
GRESFLQTNY 
TPSSRQQDLV 
FldiWLVGI 
ADLHASSGFT 
LAQXiAEKDGK 
LVEKGRRKCD 
RWTQYHYTQ 
MLQQIQHEGT 
VNALLIPGPA 
SLSGEGTDYI 
EDEFVYWPNK 
PKWPNPDSPI 
VDVYQVAKMI 
ESLESLV 



21 
I 

GSEHSLEGQK 
GVESVSRFGK 
QLAVFCEVLT 
QADPENYTSL 
LPNMSYVLQI 
EGAIVNPGRD 
SLNSTSQPVT 
SLNTVSITEY 
YDVLIPESAR 
TEIRVDESEK 
STVNWYSQT 
LIYWRKCFQT 
EEFETLKEFY 
LTDYINANYV 
QYWPADGSEE 
WPDMGVPEYS 
VNIFGFLKHI 
GKTKLEKQFQ 
NASYIMGYYQ 
DEPINCESFK 
SKTFELISVI 
NLMRPGVFAD 



31 

i 

FPLEMQIYCF 
QAALDPFILL 
MQQSGYVMLM 
LVTWERPRW 
VAICTNGLYG 
SATNQIRKKE 
KLATEKDISL 
EEESLLTSFK 
NASEDSTSSG 
TTKSFSAGPV 
TQPVYNEASN 
AHFYLEDSTS 
QEVQSCTVDL 
DGYNRPKAYI 
YGNFLVTQKS 
LPVLTFVRKA 
RSQRNYLVQT 
LLSQSNIQQS 
SNEFIITQHP 
VTLMAEEHKC 
KEEAANRDGP 
IEQYQFLYKV 



41 
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DADRFSSFEE 
NLLPNSTDKY 
DYLQNNFREQ 
YDTMIEKFAV 
KYSDQLIVDM 
PQISTTTHYN 
TSQTVTELPP 
LDTGAEDSSG 
SEESLKDPSM 
MSQGPSVTDL 
SSHESRIGLA 
PRVISTPPTP 
GITADSSNHP 
AAQGPLKSTA 
VQVLAYYTVR 
AYAKRHAVGP 
EEQYVFIHDT 
DYSAALKQCN 
LLHTIKDFWR 
LSNEEKIiIIQ 
MIVHDEHGGV 
ILSLVSTRQE 



51 
t 

AVKGKGKLRA 
Y1YNGSLTSP 
QYKFSRQVFS 
LYQQLDGEDQ 
PTDNPELDLF 
RIGTKYNEAK 
HTVEGTSASL 
SSPATSAIPF 
EGNVWFPSST 
EMPHYSTFAY 
EGLESEKKAV 
IFPISDDVGA 
DNKHKNRYIN 
EDFWRM1WEH 
NFTLRNTKIK 
VWHCSAGVG 
LVEAILSKET 
REKNRTSSII 
MIWDHNAQLV 
DFILEATQDD 
TAGTFCALTT 
ENPSTSLDSN 



Seq ID NO: 187 DNA sequence 

Nucleic' Acid Accession #: EOS sequence 

Coding sequence: 148-4632 



CACACATACG 
CAAAAAAAAC 
CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AAATATCCAA 
CAAGTAAATG 
AACACATTCA 
GTCAGCGGAG 
AAATGCAATA 
GAGATGCAAA 
GGAAAAGGGA 
GATTTCAAAG 
TTAGATCCAT 
AATGGCTCAT 
ACAGTTAGCA 
TCTGGTTATG 
TTCTCTAGAC 
AGTTCAGAAC 
TGGGAAAGAC 
CAGTTGGATG 
GGTGCTATTC 
TGCACTAATG 
AATCCTGAAC 
GAAGAGGGAA 
AACCAAATCA 
ACGAAATACA 
AAGGGTGATG 
ACAGAAAAAG 
.GAAGGTACTT 
AACTTGTCGG 
AGTTTATTGA 
GCAACTTCTG 
GAAAACCCAG 
GAAGATTCAA 
GTGTGGTTTC 
AGCTTTCTCC 
TCCTTTTCTG 
CATTATTCTA 
TCCAGACAAC 
GTATACAATG 
GAATCCGAGA 
CTAGTGGTTC 
TACTTAGAGG 
ATTTCAGATG 
CATGCAAGTA 
CAGAGCTGTA 



11 

! 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 
TTCATAACAC 
GAGTTTCAGA 
TGTCATCTGA 
TCTACTGCTT 
AGTTAAGAGC 
CGATTATTGA 
TCATACTGTT 
TGACATCTCC 
TCTCTGAAAG 
TCATGCTGAT 
AGGTGTTTTC 
CAGAAAATGT 
CTCGAGTCGT 
GAGAGGACCA 
TCAATAATTT 
GCTTATATGG 
TTGATCTTTT 
AAGACATTGA 
GGAAAAAGGA 
ATGAAGCCAA 
TTCCCAATAC 
ATATTTCCTT 
CAGCCTCTTT 
GGACTGCAGA 
CCAGTTTCAA 
CTATCCCATT 
AGACAATAAC 
CTTCATCAGG 
CTAGCTCTAC 
AGACTAATTA 
CAGGCCCAGT 
CCTTTGCCTA 
AGGATTTGGT 
AGGCCAGTAA 
AGAAGGCAGT 
TTGTGGGTAT 
ACAGTACATC 
ATGTCGGAGC 
GTGGGTTTAC 
CTGTTGACTT 



21 
I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCAAAACAA 
GAAACTTAAA 
TGGGAAAACA 
AATGGTGTTT 
TGGATCAGAG 
TGATGCGGAC 
TTTATCCATT 
TGGAGTCGAA 
GAACCTTCTG 
TCCCTGCACA 
CCAGTTGGCT 
GGACTACTTA 
CTCATACACT 
TCAGGCTGAC 
TTATGATACC 
AACCAAGCAT 
GCTACCCAAT 
AAAATACAGC 
CCCTGAATTA 
AGAAGGCGCT 
ACCCCAGATT 
GACTAACCGA 
ATCTTTAAAT 
GACTTCTCAG 
AAATGATGGC 
ATCCTTAAAT 
GCTTGATACT 
CATCTCTGAG 
ATATGATGTC 
TTCAGAAGAA 
AGACATAACA 
CACTGAGATA 
GATGTCACAG 
CTTCCCAACT 
CTCCACGGTC 
TAGTAGCCAT 
TATACCCCTT 
TCTCATCTAC 
CCCTAGAGTT 
AATTCCAATA 
TGAAGAATTT 
AGGTATTACA 



31 
I 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 
GTGGAAATTA 
AAAGCAAGCA 
CATAGTTTAG 
CGATTTTCAA 
TTGTTTGAGG 
AGTGTTAGTC 
CCAAACTCAA 
GACACAGTTG 
GTTTTTTGTG 
CAAAACAATT 
GGAAAGGAAG 
CCAGAGAATT 
ATGATTGAGA 
GAATTTTTGA 
ATGAGTTATG 
GACCAACTGA 
ATTGGAACTG 
ATTGTGAATC 
TCTACCACAA 
TCCCCAACAA 
TCCACTTCCC 
ACTGTGACTG 
TCTAAAACTG 
ACAGTTTCTA 
GGAGCTGAAG 
AACATATCCC 
CTTATACCAG 
TCACTAAAGG 
GCACAGCCCG 
CGTGTTGATG 
GGTCCCTCAG 
GAGGTAACAC 
AACGTGGTAT 
GAGTCTCGTA 
GTGATCGTGT 
TGGAGGAAAT 
ATATCCACAC 
AAGCACTTTC 
GAGACACTGA 
GCAGACAGCT 



41 
I 

GGAGGATTAA 
TGAGAAGCAG 
AACGTTTCCT 
ACTACAGACA 
ATCAAAAAAA 
ATATTGATGA 
GGGATAAAAC 
ATCTCACTAA 
AGATAACTTT 
AAGGACAAAA 
GTTTTGAGGA 
TTGGGACAGA 
GTTTTGGGAA 
CTGACAAGTA 
ACTGGATTGT 
AAGTTCTTAC 
TTCGAGAGCA 
AGATTCATGA 
ATACCAGCCT 
AGTTTGCAGT 
CAGATGGCTA 
TTCTTCAGAT 
TTGTCGACAT 
AAGAAATAAT 
CTGGTAGAGA 
CACACTACAA 
GAGGAAGTGA 
AACCAGTCAC 
AACTGCCACC 
TTCTTAGATC 
TAACAGAATA 
ATTCTTCAGG 
AAGGGTATAT 
AATCTGCTAG 
ATCCTTCTAT 
ATGTTGGATC 
AATCTGAGAA 
TTACAGATCT 
CTCATGCTTT 
ACTCGCAGAC 
TTGGTCTAGC 
CAGCCCTGAC 
GCTTCCAGAC 
CTCCAACACC 
CAAAGCATGT 
AAGAGTTTTA 
CCAACCACCC 



51 
I 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TTGGGGAAAG 
AGATCTTACA 
ATCATTGGAA 
TGACTACCGT 
TCACTGGGGA 
ATTTCCACTT 
AGCAGTCAAA 
AGAAAATTTG 
GCAGGCTGCT 
TTACATTTAC 
TTTTAAAGAT 
AATGCAACAA 
ACAGTACAAG 
AGCAGTTTGT 
TCTTGTTACA 
TTTGTACCAG 
TCAAGACTTG 
AGTAGCCATA 
GCCTACTGAT 
CAAGGAGGAG 
CAGTGCTACA 
TCGCATAGGG 
ATTCTCTGGA 
TAAATTAGCC 
TCACACTGTG 
TCCACATATG 
TGAGGAGGAG 
CTCCAGTCCC 
ATTTTCCTCC 
AAATGCTTCC 
GGAGGGAAAT 
AGGCAGAGAG 
GACAACCAAG 
GGAAATGCCA 
TACCCCATCC 
AACCCAACCG 
TGAGGGGTTG 
TTTTATCTGT 
TGCACACTTT 
TATCTTTCCA 
TGCAGATTTA 
CCAGGAAGTG 
AGACAACAAG 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1OB0 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
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CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 2880 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 2940 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 3000 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 3060 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 3120 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 3180 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 3240 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 3300 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 3360 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 3420 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 3480 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 3540 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 3600 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 3660 

CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 3720 

GGCTTAACTG ATCCTCCTAC CTCAGCCTCC OGAGTGGCTG GGACTATACT CCTGAGCCAG 3780 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 3840 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 3900 

GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 3960 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 4020 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 4080 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 4140 

GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 4200 

GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 4260 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 4320 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 4380 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 4440 

CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 4500 

CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 4560 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 4620 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 4680 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 4740 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 4800 

TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 4860 

TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 4920 

ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 4980 

GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 5040 

TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 5100 

AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 5160 

ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 5220 

TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 5280 

TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 5340 

ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 5400 

GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 5460 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: 188 Protein sequence; 
Protein Accession #: EOS sequence 

1 11 21 31 41 51 

I I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQGR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKXjRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYKRI GTKYNEAXTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 

IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI TADSSNHPDN KHKNRYINIV 900 

AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAYIAA QGPLKSTAED FWRMIWEHNV 960 

.EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ VLAYYTVRNF TLRNTKIKKG 1020 

SQKGRPSGRV VTQYHYTQWP DMGVPEYS&P VLTFVRKAAY AKRHAVGPW VHCSAGVGRT 1080 

GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE QYVFIHDTLV BAILS KETEV 1140 

LDSHIHAYVN ALLIPGPAGK TKLEKQFQGL TLSPRLECRG TISAHCNLPL PGLTDPPTSA 1200 

SRVAGTIIiLS QSNIQQSDYS AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS 1260 

YIMGYYQSNE FIITQHPLLH TIKDFWRMIW DHNAQLWMI PDGQNMAEDE FVYWPNKDEP 1320 

INCESFKVTL MAEEHKCLSN EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISKT 1380 

FELISVIKEE AANRDGPMIV HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM 1440 
RPGVFADIEQ YQFLYKVILS LVGTRQEENP STSLDSNGAA LPDGNIAESL ESIjV 



Seq ID NO: 189 DNA sequence 
Nucleic Acid Accession #: NM_002820 
Coding sequence : 304 .. 831 

1 11 21 31 41 51 

I I I I I I 

CCGGTTCGCA AAGAAGCTGA CTTCAGAGGG GGAAACTTTC TTCTTTTAGG AGGCGGTTAG 60 
CCCTGTTCCA CGAACCCAGG AGAACTGCTG GCCAGATTAA TTAGACATTG CTATGGGAGA 120 
CGTGTAAACA CACTACTTAT CATTGATGCA TATATAAAAC CATTTTATTT TCGCTATTAT 180 
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TTCAGAGGAA GCGCCTCTGA TTTGTTTCTT 
GTTTGGAGAA AGCACAGTTG GAGTAGCCGG 
ACGATGCAGC GGAGACTGGT TCAGCAGTGG 
GTGCCCTCCT GCGGGCGCTC GGTGGAGGGT 
GAACATCAGC TCCTCCATGA CAAGGGGAAG 
CTTCACCATC TGATOGCAGA AATCCACACA 
CCTAACTCCA AGCCCTCTCC CAACACAAAG 
GAGGGCAGAT ACCTAACTCA GGAAACTAAC 
AAGACACCTG GGAAGAAAAA GAAAGGCAAG 
AAACGGCGAA CTCGCTCTGC CTGGTTAGAC 
GACCACCTGT CTGACACCTC CACAACGTCG 
CTGGCCCGTA GCCTCAGCGG GGTGCTCTCA 
GCTTGGACAA ACCTAGAATT TTCTCCCTTT 
CAGAGAATAA CTCAGAATAT TGTCTGCCTT 
TGTCCTCCAG CACCATAGAG AGGCGCTAGA 
CATCAATCCT TTACCACTCT ACCAAATAAT 
ATCTTCATAA TTTGCTGGAG AAGTGTATTT 
TTCTTCAGTG TTTTTCATTT CTTAOGTTCT 
GATATTATCT ACAAACACTG CAGAACAGCA 
ACTTTTTATT TAATTAAATG TATTTAATTA 
TAAATTATGT TTTAAACACA TGCCTTAAAT 
CCAGCTCATA CAAAATAAAT GGTTTCTGAA 
GGTTTTTCTC ATGTATCTTT TTGTTCATTG 
CCGTAGGAAA AATAAAACTT CACATTTAAA 



Seq ID NO: 190 Protein sequence: 
Protein Accession #: NP 002B11 



PCT/US02/12476 



TTTTCCCTTT 
TTGCTAAATA 
AGCGTCGCGG 
CTCAGCCGCC 
TCCATCCAAG 
GCTGAAATCA 
AACCACCCCG 
AAGGTGGAGA 
CCCGGGAAAC 
TCTGGAGTGA 
CTGGAGCTCG 
GCTGGGTTTT 
ATGTATCTCT 
AAAGCAGTAC 
GCCCATTCCT 
TTCATATTCA 
CTTCCCCTTA 
TTCACTTCAA 
TCATGTCATA 
AATCTCAAAT 
TTGTTTAATT 
AATGTTTAAG 
GCAAGATGAA 
AAAAA 



TTGCTCTTTC 
AGTCCCGAGC 
TGTTCCTGCT 
GCCTCAAAAG 
ATTTACGGCG 
GAGCTACCTC 
TCCGATTTGG 
CGTACAAAGA 
GCAAGGAGCA 
CTGGGAGTGG 
ATTCACGGTA 
GGAGCCTCCC 
ATCGATTGTG 
CCCCCTACCA 
CTTTCTCCAC 
AGCTTCAGAA 
CTCTCACACC 
GGGAGAATAT 
AACGATTCTG 
TTATTTTAAT 
AAATTTAACT 
TATTAACTTA 
ATAATTTTTC 



TGGCTGTGTG 
GCGAGCGGAG 
GAGCTACGCG 
AGCTGTGTCT 
ACGATTCTTC 
GGAGGTGTCC 
GTCTGATGAT 
GCAGCCGCTC 
GGAAAAGAAA 
GCTAGAAGGG 
ACAGGCTTCT 
TTCTGCCTTG 
TAGCAATTGA 
CACACACCCC 
CGTCACCCAA 
GCTAGTGACC 
TGGGCAAACT 
AGAAGCATTT 
AGCCATTCAC 
GTAAAGAACT 
CTGGTTTCTA 
CAAGGATATA 
TAGGGTAATG 



11 



21 
I 



31 



41 



51 



MQRRLVQQWS VAVFLLSYAV PSCGRSVEGL SRRLKRAVSE HQLLHDKGKS IQDLRRRFFL 
HHLIAEIHTA EIRATSEVSP NSKPSPNTKN HPVRFGSDDE GRYLTQETNK VETYKEQPLK 
TPGKKKKGKP GKRKEQEKKK RRTRSAWLDS GVTGSGLEGD HLSDTSTTSL ELDSR 

Seq ID NO: 191 DNA sequence 
Nucleic Acid Accession #: XM_059328 
Coding sequence: 52 . .1023 



GGGCTGTCCG 
CCTCGCATGC 
GGTATCGTGG 
GCGGCCACGG 
GCCAACCTGT 
GGCCCGGAAG 
GTGGATTTGC 
CTGGGCAGGG 
TGCCAGGTGT 
GAGCGCGGTG 
GTGGAGCGCG 
GACGCCTTCG 
GCCCTGGCGC 
GCGCACCCCG 
TTCTCTTGCT 
GCCCAGCTTG 
CCAGGGGAGG 
TGACCCCCTA 
TTAGTCCTGG 
GGACACTGCC 
AGCCTTCTTG 
TGGTGCCCCT 
CTATATTAAT 



11 
I 

GCCCACTCCC 
GCCTGGTGGT 
AGGCCTTTCT 
AGAGCGCGGC 
CCGAGGGCCG 

CTCAGGTGCG 
CCCCCACGCA 
TCGCCGAGGC 
TGGGTGGCTG 
ACGCCCGGGC 
TGGGCCTGAG 
GGGTCCTGGA 
GCTACCCCAG 
CTTGGGAGCG 
CCCAGGATGG 
AGGTCCCCTG 
CAGACAACCA 
CCCAGCCCAG 
ACCTCTGGGC 
GCTGCAGGCA 
CCATGTTGCA 
AAAATAACGT 



21 
I 

CTGGGAGCGC 
CACCGCGGAC 
GGCCGGGGCT 
GGAGCTGGCC 
CCCCGTGGGT 
TGGCAAGATG 
GGAGGAGCTC 
CGCGGACGGG 
GCTGCAGGCC 
CACTTGGCTG 
CGCCGTGGGC 
CACTTGCGGC 
AGGTACCCTA 
TGTGCCTCCC 
GCTGCATGAG 
CGTGCAGCTT 
TGAGCCCACT 
AGCACTAATC 
AGCTGGGACC 
TCAGGTCCTC 
GGCCTAGCCT 
ATGCAAACAC 
GTGTCTTTC 



31 
I 

GAGCGGTGGA 
GACTTTGGTT 
GTGACCAGCG 
CGCAGGCACA 
CCGGCCCGCC 
GGATTCCGGG 
GAGGCCCAAC 
CACCAGCACG 
TATGGGGTGC 
GAGGCCCCCG 
CCCTTCTCCC 
CGGCACATGT 
GCGGGCCACA 
ACCGGCGGCT 
CTGCGCGTCC 
TGCGCCCTCG 
CTGGAACCCT 
CCCTTAGTAC 
TGGAGCACGA 
ATGCCTCCAA 
GTGGCAGCGG 
CTTCACCACT 



41 
I 

CCCAGGCGGC 
ACTGCCCGCG 
TGTCCCTGCT 
GCATCCCCAC 
GTGGCGCCTC 
AGGCGGTGGC 
TAAGCTGCTT 
TGCACGTGCT 
GCTTTACGCG 
CGCGTGCCTT 
GCCACGGCCT 
CCGCTCACCG 
CCCTGACAGC 
GCGGTGAAGG 
TCACCGCGCC 
ACGACCTGGA 
TCCTGGAACC 
CAAGAAAGGG 
TCTGTTGACT 
ATGGCATCTA 
GCTAGGGCCC 
GGGGCAGTGG 



51 
I 

CATGTCCCGC 
ACGCGATGAG 
GGTCAACGGT 
GGGCCTCCAC 
ATCGCTGCTC 
GGCCGGAGAC 
CCGGGAGCTG 
CCCAGGCGTG 
ACTGCCGCTG 
CGCCTGCGCC 
GCGGTGGACA 
CGTGTCCGGG 
CG AG CTGATG 
CCCCGACGCT 
CACGCTGCGG 
CTCCAAGAGG 
CTCCCTACTC 
GAGCCAGGAT 
TCCCTGGGTA 
GAGTTTGAGC 
GCAGAGCATT 
GGAGAGATGG 



Seq ID NO: 192 Protein sequence: 
Protein Accession &: XP 059328 



MSRPRMRLW 
GLHANLSEGR 
RELLGRAPTH 
ACAVERDARA 
ELMAHPGYPS 
SKRPGEEVPC 



11 
I 

TADDFGYCPR 
PVGPARRGAS 
ADGHQHVHVL 
AVGPFSRHGL 
VPPTGGCGEG 
EPTLEPFLEP 



21 
I 

RDEGIVEAFL 
SLLGPEGFFL 
PGVCQVFAEA 
RWTDAFVGLS 
PDAFSCSWER 
SLL 



31 41 51 

I I I 

AGAVTSVSLL VNGAATESAA ELARRHSIPT 
GKMGFREAVA AGDVDLPQVR EELEAQLSCF 
LQAYGVRFTR LPLERGVGGC TWLEAPARAF 
TCGRHMSAHR VSGALARVLE GTLAGHTLTA 
LHELRVLTAP TLRAQLAQDG VQLCALDDLD 



Seq ID NO: 193 DNA sequence 

Nucleic Acid Accession &: NM_005688.1 

Coding sequence: 126.. 4439 



31 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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1140 
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1380 
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1560 



60 
120 



60 
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600 
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60 
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41 



51 



1 11 21 

I I I i I I 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 
AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 



60 
120 



264 



WO 02/086443 

AGAAGATGAA GGATATCGAC ATAGGAAAAG 
GTGTGAGGGA GAGAACCAGC ACTTCTGGGA 
GGAGAACTCG ACC6TTGGAA TGCCAAGATG 
TCTCTCTTGA TGCCTCCATG CATTCTCAGC 
GAAAGTACCA TCATGGCTTG AGTGCTCTGA 
ACCCAGTGGA CAATGCTGGG CTTTTTTCCT 
CCCGTGTGGC CCACAAGAAG GGGGAGCTCT 
ACGAGTCTTC TGACGTGAAC TGCAGAAGAC 
AAGTTGGGCC AGACGCTGCT TCCCTGCGAA 
TCATCCTGTC CATCGTGTGC CTGATGATCA 
TCATGGTGAA ACACCTCTTG GAGTATACCC 
TGTTGTTAGT GCTGGGCCTC CTCCTGACGG 
CTTGGGCATT GAATTACCGA ACCGGTGTCC 
TTAAGAAGAT CCTTAAGTTA AAGAACATTA 
TTTGCTCCAA CGATGGGCAG AGAATGTTTG 
GAGGACCCGT TGTTGCCATC TTAGGCATGA 
GCTTCCTGGG ATCAGCTGTT TTTATCCTCT 
TCACAGCATA TTTCAGGAGA AAATGCGTGG 
ATGAAGTTCT TACTTACATT AAATTTATCA 
AGAGTGTTCA AAAAATCCGC GAGGAGGAGC 
AGGGTATCAC TGTGGGTGTG GCTCCCATTG 
CTGTTCATAT GACCCTGGGC TTCGATCTGA 
TCTTCAATTC CATGACTTTT GCTTTGAAAG 
AAGCCTCAGT GGCTGTTGAC AGATTTAAGA 
TAAAGAACAA ACCAGCCAGT CCTCACATCA 
GGGACTCCTC CCACTCCAGT ATCCAGAACT 
ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA 
AGGCGGTGCT GGCAGAGCAG AAAGGCCACC 
CCGAAGAGGA AGAAGGCAAG CACATCCACC 
ACAGCATCGA TCTGGAGATC CAAGAGGGTA 
GTGGAAAAAC CTCTCTCATT TCAGCCATTT 
TTGCAATCAG TGGAACCTTC GCTTATGTGG 
TGAGAGACAA CATCCTGTTT GGGAAGGAAT 
ACAGCTGCTG CCTGAGGCCT GACCTGGCCA 
GAGAGCGAGG AGCCAACCTG AGCGGTGGGC 
TGTATAGTGA CAGGAGCATC TACATCCTGG 
TGGGCAACCA CATCTTCAAT AGTGCTATCC 
TTGTTACCCA CCAGTTACAG TACCTGGTTG 
GCTGTATTAC GGAAAGAGGC ACCCATGAGG 
CCATTTTTAA TAACCTGTTG CTGGGAGAGA 
AAACCAGTGG TTCACAGAAG AAGTCACAAG 
AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC 
GTTCAGTGCC CTGGTCAGTA TATGGTGTCT 
TCCTGGTTAT TATGGCCCTT TTCATGCTGA 
GGTTGAGTTA CTGGATCAAG CAAGGAAGCG 
CCTCGGTGAG TGACAGCATG AAGGACAATC 
CCCTCTCCAT GGCAGTCATG CTGATCCTGA 
GCACGCTGCG AGCTTCCTCC CGGCTGCATG 
CTATGAAGTT TTTTGACACG ACCCCCACAG 
TGGATGAAGT TGACGTGCGG CTGCCGTTCC 
TGGTGTTCTT CTGTGTGGGA ATGATCGCAG 
GGCCCCTTGT CATCCTCTTT TCAGTCCTGC 
TGAAGCGTCT GGACAATATC ACGCAGTCAC 
AGGGCCTTGC CACCATCCAC GCCTACAATA 
AGCTGCTGGA TGACAACCAA GCTCCTTTTT 
CTGTGCGGCT GGACCTCATC AGCATCGCCC 
TTATGCACGG GCAGATTCCC CCAGCCTATG 
TAACGGGGCT GTTCCAGTTT ACGGTCAGAC 
CGGTGGAGAG GATCAATCAC TACATTAAGA 
AGAACAAGGC TCCCTCCCCT GACTGGCCCC 
AGATGAGGTA CCGAGAAAAC CTCCCTCTTG 
CTAAAGAGAA GATTGGCATT GTGGGGCGGA 
CCCTCTTCCG TCTGGTGGAG TTATCTGGAG 
GTGATATTGG CCTTGCCGAC CTCCGAAGCA 
TGTTCAGTGG CACTGTCAGA TCAAATTTGG 
TTTGGGATGC CCTGGAGAGG ACACACATGA 
TTGAATCTGA AGTGATGGAG AATGGGGATA 
GCATAGCTAG AGCCCTGCTC CGCCACTGTA 
CCATGGACAC AGAGACAGAC TTATTGATTC 
GTACCATGCT GACCATTGCC CATCGCCTGC 
TGCTGGCCCA GGGACAGGTG GTGGAGTTTG 
GTTCCCGATT CTATGCCATG TTTGCTGCTG 
TCCTCCCTGT TGACGAAGTC TCTTTTCTTT 
CCCCTCATCG CGTCCTCCTA CCGAAACCTT 
GTTCCGGATT GGCTTGTGTG TTTCACTTTT 
ATTCCATATT CATGTAAACA AAATTTAGTT 
GGGAACCGTT ATTATAATTG TATCAGAGGC 
TCTATATATA ATTCTGTACA TAGCCTATAT 
TATTAAAATA AGCACTGTGC TAATAACAGT 
TTGCTGTACT AGAGATCTGG TTTTGCTATT 
CTCTAGCTGG TGGTTTCACG GTGCCAGGTT 
ATAGTGGGCC CTCCGACAGC CCCCTCTGCC 
GAGACGGGTG GGCGGCTGGA GACCATGCAG 
CTGTCCTGGT GTCACTTACT GTTTCTGTCA 
TTTCACTCCC TCCATCAAGA ATGGGGATCA 
TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT 
TCCCACTGCC TCAGGTTCCT ATGGCTGGCC 



AGTATATCAT CCCCAGTCCT GGGTATAGAA 180 

CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 

TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 

AGCCCATCCG GACTACTTCC AAACACCAGC 420 

GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 

CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 

TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600 

GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

TTTATAATGT AATTATTCTG GGACCAACAG 1080 

TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 

AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 

ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 

ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 

GGAACACCAC TGTGACTCGA GGGAACGAGA 2 820 

CTCATATGCA GTACTATGCC AGCATCTACG 2880 

AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2940 

ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 

AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 

TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 

CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 

CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 

AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 

TCCTAAAGAA AGTATCCTTC ACGATCAAAC * 3780 

CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 

AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 

ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 

AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 

ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 

AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 

ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 

ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

AGGGAGAGTC ATATTTTGAT TATTGTATTT 4620 

TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

GCATATTCCT TTCTATCATT TTTGTACAGT 4860 

AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 

TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 

GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 

CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 

ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 
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GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 5400 

ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTTTTCCTTT 5460 

CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 

CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 

ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 

GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 

ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 5760 

CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 5820 
AAAAAAAAAA AAAAAAAA 

Seq ID NO: 194 Protein sequence-. 
Protein Accession 8: NPJJ05679.1 



1 11 21 31 41 51 

I . I I I I I 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 60 

LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLFSCM TFSWLSSLAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 

LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTEI VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMPEA AAVGSLLAGG 300 

PWAIliGMIY NVIILGPTGF LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 360 

VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEBVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 

VLAEQKGHIiL LDSDERPSPE EBEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 720 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVI FM KEG C ITERGTHEEL MNLNGDYATI 780 

FNNLLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS B40 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV FFWFLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG LATIHAYNKG QEFLHRYQEL 1080 

LDDNQAPFFL FTCAMRWIAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVEfcSGGC IKIDGVRISD 1260 

IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALLRHCKI LILDEATAAM DTETDLLIQE TIREAPADCT 1380 
MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFYAMFAAAE NKVAVKG 

Seq ID NO: 195 DNA sequence 
Nucleic Acid Accession #: NM_006470 
Coding sequence : 228 .. 1922 



1 11 21 31 41 51 

[ | I I I I 

GCTGTCCTGA GCCTGAGTAC TCTAGCTGCC TTGTCGCCAT CGCATCTGGC TGCCATCCAG 60 

CGCCAGCACA CAGTAATGAG TGGCCGAGCT TCCTCTGGGA GGGAGGAAAC AGTTAAAATC 120 

TTGCAGCAGC TGCAATCATC TAGGCGTGGT TCTCTTGTCT GACTTGGGCT GCACAGATCC 180 

TGGGCCAAGG GACAGAAGAA AGACAGCCTA GGAGCAGAGC CTCCCAGATG GCTGAGTTGG 240 

ATCTAATGGC TCCAGGGCCA CTGCCCAGGG CCACTGCTCA GCCCCCAGCC CCTCTCAGCC 300 

CAGACTCTGG GTCACCCAGC CCAGATTCTG GGTCAGCCAG CCCAGTGGAA GAAGAGGACG 360 

TGGGCTCCTC GGAGAAGCTT GGCAGGGAGA CGGAGGAACA GGACAGCGAC TCTGCAGAGC 420 

AGGGGGATCC TGCTGGTGAG GGGAAAGAGG TCCTGTGTGA CTTCTGCCTT GATGACACCA 480 

GAAGAGTGAA GGCAGTGAAG TCCTGTCTAA CCTGCATGGT GAATTACTGT GAAGAGCACT 540 

TGCAGCCGCA TCAGGTGAAC ATCAAACTGC AAAGCCACCT GCTGACCGAG CCAGTGAAGG 600 

ACCACAACTG GCGATACTGC CCTGCCCACC ACAGCCCACT GTCTGCTTTC TGCTGCCCTG 660 

ATCAGCAGTG CATCTGCCAG GACTGTTGCC AGGAGCACAG TGGCCACACC ATAGTCTCCC 720 

TGGATGCAGC CCGCAGGGAC AAGGAGGCTG AACTCCAGTG CACCCAGTTA GACTTGGAGC 780 

GGAAACTCAA GTTGAATGAA AATGCCATCT CCAGGCTCCA GGCTAACCAA AAGTCTGTTC 840 

TGGTGTCGGT GTCAGAGGTC AAAGCGGTGG CTGAAATGCA GTTTGGGGAA CTCCTTGCTG 900 

CTGTGAGGAA GGCCCAGGCC AATGTGATGC TCTTCTTAGA GGAGAAGGAG CAAGCTGCGC 960 

TGAGCCAGGC CAACGGTATC AAGGCCCACC TGGAGTACAG GAGTGCCGAG ATGGAGAAGA 1020 

GCAAGCAGGA GCTGGAGAGG ATGGCGGCCA TCAGCAACAC TGTCCAGTTC TTGGAGGAGT 1080 

ACTGCAAGTT TAAGAACACT GAAGACATCA CCTTCCCTAG TGTTTACGTA GGGCTGAAGG 1140 

ATAAACTCTC GGGCATCCGC AAAGTTATCA CGGAATCCAC TGTACACTTA ATCCAGTTGC 1200 

TGGAGAACTA TAAGAAAAAG CTCCAGGAGT TTTCCAAGGA AGAGGAGTAT GACATCAGAA 1260 

CTCAAGTGTC TGCCGTTGTT CAGCGCAAAT ATTGGACTTC CAAACCTGAG CCCAGCACCA 1320 

GGGAACAGTT CCTCCAATAT GCGTATGACA TCACGTTTGA CCCGGACACA GCACACAAGT 1380, 

ATCTCCGGCT GCAGGAGGAG AACCGCAAGG TCACCAACAC CACGCCCTGG GAGCATCCCT 1440 

ACCCGGACCT CCCCAGCAGG TTCCTGCACT GGCGGCAGGT GCTGTCCCAG CAGAGTCTGT 1500 

ACCTGCACAG GTACTATTTT GAGGTGGAGA TCTTCGGGGC AGGCACCTAT GTTGGCCTGA 1560 

CCTGCAAAGG CATCGACCGG AAAGGGGAGG AGCGCAACAG TTGCATTTCC GGAAACAACT 1620 

TCTCCTGGAG CCTCCAATGG AACGGGAAGG AGTTCACGGC CTGGTACAGT GACATGGAGA 1680 

CCCCACTCAA AGCTGGCCCT TTCCGGAGGC TCGGGGTCTA TATCGACTTC CCGGGAGGGA 1740 

TCCTTTCCTT CTATGGCGTA GAGTATGATA CCATGACTCT GGTTCACAAG TTTGCCTGCA 1800 

AATTTTCAGA ACCAGTCTAT" GCTGCCTTCT GGCTTTCCAA GAAGGAAAAC GCCATCCGGA 1860 

TTGTAGATCT GGGAGAGGAA CCCGAGAAGC CAGCACCGTC CTTGGGGGTG ACTGCTCCCT 1920 

AGACTCCAGG AGCCATATCC CAGACCTTTG CCAGCTACAG TGATGGGATT TGCATTTTAG 1980 

GGTGATTTGT GGGCAGAAAT AACTGCTGAT GGTAGCTGGC TTTTGAAATC CTATGGGGTC 2040 

TCTGAATGAA AACATTCTCC AGCTGCTCTC TTTTGCTCCA TATGGTGCTG TTCTCTATGT 2100 

GTTTGCAGTA ATTCTTTTTT TTTTTTTTGA GACGGAGTCT CGCACTGTTG CCCAGGCTGG 2160 

AGAGCAGTGG CGCGATCTTG GCTCACTGCA AGCTCCGCCT CCCGAGTTCA AGCAATTCTC 2220 

CTGCCTCAGC CTCCCGAGTA GCTGGGATTA CAGGTGCCTG CCACCACACC CAGCTAATGT 2280 

TTTGTATTTT TAGTAGAGAT GGGGTTTCAC CATGTTGGCC AGGCAGATCT CAAACTCCTG 2340 
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ACCTCGT6AT GCACCCACCT CGGCCTCCCA AAGTGCTGGG ATTACATGCG TGAGCCACTG 2400 
CGCCCTGCCT GTTTGTAGTA ATTTTTAGGC ACCAAATCTC CCTCATCTTC TAGTGCCATT 2460 
CTCCTCTCTG TTCAGGTAAA TGTCACACTG TGCCCAGAAT GGATGACCAG GAACCTTAAA 2520 
GAGTGGCTGA AAAGATTGCA GAGTTATCAT AATAAATTGC TAACTTGCGT 



PCT/US02/12476 



Seq ID NO: 196 Protein sequence: 
Protein Accession #: NP_006461 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 



MAELDLMAPG 
DSAEQGDPAG 
EPVKDHNWRY 
LDLERKLKLN 
EQAALSQANG 
VGLKDKLSGI 
EPSTREQPLQ 
QQSLYLHRYY 
SDMETPLKAG 
NAIRIVDLGE 



11 
I 

PLPRATAQPP 
EGKEVLCDFC 
CPAHHSPLSA 
ENAISRLQAN 
IKAHLEYRSA 
RKVITESTVH 
YAYDITFDPD 
PEVEIFGAGT 
PFRRLGVYID 
EPEKPAPSLG 



21 
I 

APLSPDSGSP 
LDDTRRVKAV 
FCCPDQQCIC 
QKSVLVSVSE 
EMEKSKQELE 
LIQLLENYKK 
TAHKYLRLQE 
YVGLTCKGID 
FPGGILSFYG 
VTAP 



31 
I 

SPDSGSASPV 
KSCLTCMVNY 
QDCCQEHSGH 
VKAVAEMQFG 
RMAAISNTVQ 
KLQEFSKEEE 
ENRKVTNTTP 
RKGEERNSCI 
VEYDTMTLVH 



41 
I 

EEEDVGSSEK 
CEEHLQPHQV 
TIVSLDAARR 
ELLAAVRKAQ 
FLEBYCKFKN 
YDIRTQVSAV 
WEHPYPDLPS 
SGNNFSWSLQ 
KFACKFSEPV 



51 
I 

LGRETEEQDS 
NIKLQSHLLT 
DKEABLQCTQ 
ANVMLFLEEK 
TEDITFPSVY 
VQRKYWTSKP 
RFLHWRQVLS 
WNGKEFTAWY 
YAAFWLSKKE 



Seq ID NO: 197 DNA sequence 
Nucleic Acid Accession fh NM_004316 
Coding sequence: 433-1149 



CCCGAGACCC 
GCGCGTTCAG 
GCGCCAGCGG 
GGAGGAGGGG 
TTGCTCCCAC 
GCTCCCGCTT 
GTCCCCCTCG 
CCGCCGCTGC 
* CAGCCGCAGC 
GCCGCGGCGG 
CAGCAGCAGC 
TCAGGGGGCG 
GAACTGATGC 
CAGCAGCCGG 
AACCTGGGCT 
AGTAAGGTGG 
GACGAGCATG 
CCCAACTACT 
GACGAGGGCT 
TGGTTCTGAG 
ATCGCACAAC 
AAAAGAAAAA 
CCAACCCCAT 
AGCGCTCAGA 
ACCTGAGTCA 
GAGCAGCACA 
GCTCGGGTCC 
GAGTTGGTGT 



11 
I 

GGCGCAAGAG 
CACTGACTTT 
CAGCCTCACA 
AGGGAGGAGG 
TCTAAGAAGT 
CATATTTCCT 
CGGGCCCCGC 
GCATGGAAAG 
CCCAGCAGCC 
CCGCAGCCGC 
AGCAGCAGCA 
GTCACAAGTC 
GCTGCAAACG 
CCGCCGTGGC 
TTGCCACCCT 
AGACACTGCG 
ACGCGGTGAG 
CCAACGACTT 
CTTACGACCC 
GGGCTCGGCC 
CTGCATCTTT 
AAAAGAAGAA 
CGCCAACTAA 
ACAGTATCTT 
ATGCGCAAAA 
CGCGTTATAG 
CTTCACCTCC 
CTTTC 



21 
I 

AGCGCAGCCT 
TGCTGCTGCT 
CGCGAGCGCC 
AGGCGGCGTG 
CTCCCGGGGA 
TTTCTTTCCC 
ACCTCGCGTC 
CTCTGCCAAG 
CTTCCTGCCG 
CGCAGCGGCA 
GCAGGCGCCG 
AGCGCCCAAG 
CCGGCTCAAC 
GCGCCGCAAC 
TCGGGAGCAC 
CTCGGCGGTC 
CGCCGCCTTC 
GAACTCCATG 
GCTCAGCCCC 
TGGTCAGGCC 
AGTGCTTTCT 
GAAGAAGAAA 
GCGAGGCATG 
TGCACTCCAA 
TGCAGCTTGT 
TAACTCCCAT 
CCGCCCTTTC 



31 
I 

TAGTAGGAGA 
TCTGCTTTTT 
ACGCGAGGCT 
CAGGGAGGAG 
TTTTGTATAT 
TCTCTGTTCC 
CCGGATCGCT 
ATGGAGAGCG 
CCCGCAGCCT 
GCGCAGAGCG 
CAGCTGAGAC 
CAAGTCAAGC 
TTCAGCGGCT 
GAGCGCGAGC 
GTCCCCAACG 
GAGTACATCC 
CAGGCAGGCG 
GCCGGCTCGC 
GAGGAGCAGG 
CTGGTGCGAA 
TGTCAGTGGC 
AGAGAAGAAG 
CCTGAGAGAC 
TCATTCACGG 
GTGCAAAAGC 
CACCTCTAAC 
TTAGAGTGCA 



41 
I 

GGAACGCGAG 
TTTTTCTTAG 
CCCGAAGCCA 
AAAAAGCATT 
ATTTTTTAAC 
TGCACCCAAG 
CTGATTCCGC 
GCGGCGCCGG 
GTTTCTTTGC 
CGCAGCAGCA 
CGGCGGCCGA 
GACAGCGCTC 
TTGGCTACAG 
GCAACCGCGT 
GCGCGGCCAA 
GCGCGCTGCA 
TCCTGTCGCC 
CGGTCTCATC 
AGCTTCTCGA 
TGGACTTTGG 
GTTGGGAGGG 
AAAAAAACGA 
ATGGCTTTCA 
AGATATGAAG 
AGTGGGCTCC 
ACGCACAGCT 
GTTCTTAGCC 



51 

I 

ACGCGGCAGA 
AAACAAGAAG 
ACCCGCGAAG 
TTCACCTTTT 
TTCCGTCAGG 
TTCTCTCTGT 
GACTCCTTGG 
CCAGCAGCCC 
CACGGCCGCA 
GCAGCAGCAG 
CGGCCAGCCC 
GTCTTCGCCC 
CCTGCCGCAG 
CAAGTTGGTC 
CAAGAAGATG 
GCAGCTGCTG 
CACCATCTCC 
CTACTCGTCG 
CTTCACCAAC 
AAGCAGGGTG 
GGAGAAAAGG 
AAACAGTCAA 
GAAAACGGGA 
AGCAACTGGG 
TGGCAGAAGG 
GAAAGTTCTT 
CTCTAGAAAC 



60 
120 
180 
240 
300 
360 
420 
460 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



60 
65 
70 
75 
80 
85 



Seq ID NO: 198 Protein sequence: 
Protein Accession #: NP__004307 

1 11 21 31 41 51 

I I II I I 

MESSAKMESG GAGQQPQPQP QQPFLPPAAC FFATAAAAAA AAAAAAAQSA QQQQQQQQQQ 60 
QQQQAPQLRP AADGQPSGGG HKSAPKQVKR QRSSSPELMR CKRRLNFSGF GYSLPQQQPA 120 
AVARRNERER NRVKLVNLGF ATLREHVPNG AANKKMSKVE TLRSAVEYIR ALQQLLDEHD 180 
AVSAAFQAGV LSPTISPNYS NDLNSMAGSP VSSYSSDEGS YDPLSPEEQE LLDFTNWF 

Seq ID NO: 199 DNA sequence 
Nucleic Acid Accession #: NM_007015 
Coding sequence: 1-1005 

1 11 21 31 41 51 

I I I I I I 

ATGACAGAGA ACTCCGACAA AGTTCCCATT GCCCTGGTGG GACCTGATGA CGTGGAATTC 60 

TGCAGCCCCC CGGCGTACGC TACGCTGACG GTGAAGCCCT CCAGCCCCGC GCGGCTGCTC 120 

AAGGTGGGAG CCGTGGTCCT CATTTCGGGA GCTGTGCTGC TGCTCTTTGG GGCCATCGGG 180 

GCCTTCTACT TCTGGAAGGG GAGCGACAGT CACATTTACA ATGTCCATTA CACCATGAGT 240 

ATCAATGGGA AACTACAAGA TGGGTCAATG GAAATAGACG CTGGGAACAA CTTGGAGACC 300 

TTTAAAATGG GAAGTGGAGC TGAAGAAGCA ATTGCAGTTA ATGATTTCCA GAATGGCATC 360 

ACAGGAATTC GTTTTGCTGG AGGAGAGAAG TGCTACATTA AAGCGCAAGT GAAGGCTCGT 420 

ATTCCTGAGG TGGGCGCCGT GACCAAACAG AGCATCTCCT CCAAACTGGA AGGCAAGATC 480 

ATGCCAGTCA AATATGAAGA AAATTCTCTT ATCTGGGTGG CTGTAGATCA GCCTGTGAAG 540 

GACAACAGCT TCTTGAGTTC TAAGGTGTTA GAACTCTGCG GTGACCTTCC TATTTTCTGG 600 

CTTAAACCAA CCTATCCAAA AGAAATCCAG AGGGAAAGAA GAGAAGTGGT AAGAAAAATT 660 

GTTCCAACTA CCACAAAAAG ACCACACAGT GGACCACGGA GCAACCCAGG CGCTGGAAGA 720 

CTGAATAATG AAACCAGACC CAGTGTTCAA GAGGACTCAC AAGCCTTCAA TCCTGATAAT 780 
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CCTTATCATC AGCAGGAAGG GGAAAGCATG ACATTCGACC CTAGACTGGA TCACGAAGGA 840 

ATCTGTTGTA TAGAATGTAG GCGGAGCTAC ACCCACTGCC AGAAGATCTG TGAACCCCTG 900 

GGGGGCTATT ACCCATGGCC TTATAATTAT CAAGGCTGCC GTTCGGCCTG CAGAGTCATC 960 

ATGCCATGTA GCTGGTGGGT GGCCCGTATC TTGGGCATGG TGTGAAATCA CTTCATATAT 1020 

CACGTGCTGT AAAATAAGAA CTAGCTGAAG AGACAACCAA AGAAGCATTA AGGCAGGTTG 1080 

ATGCTGATGG GACCATAAAA TATTTTTACA CGCAGCCTGA GCGGTTATTC TTGACACTCT 1140 

TAACAGAATT TTTTTAATCG TTTTCCAGAA CTTTAGTATA TGCAAATGCA CTGAAAGGGT 1200 

AGTTCAAGTC TAAAATGCCA TAACCCCGTT ATTTGTTATT TTTTATTTGC ATTGATTTGC 1260 

CATAAGTCTT CCCTTGCTTG CATCTTCCAA AGCTATTTCG AAATAAACAC GAAAATTTAC 1320 
AGTTTGCC 



Seq ID NO: 200 Protein sequence: 
Protein Accession 8: NPJJ08946 

1 ii 21 31 41 51 

I ! I I I I 

MTENSDKVPI ALVGPDDVEF CSPPAYATLT VKPSSPARLIi KVGAWLISG AVLLLFGAIG 60 

AFYFWKGSDS HIYNVHYTMS INGKLQDGSM EIDAGNNLET FKMGSGAEEA IAVNDFQNGI 120 

TGIRFAGGEK CYIKAQVKAR IPEVGAVTKQ SISSKLEGKI MPVKYEENSL IWVAVDQPVK 180 

DNSFLSSKVL ELCGDLPIFW LKPTYPKEIQ RERREWRKI VPTTTKRPHS GPRSNPGAGR 240 

LNNETRPSVQ EDSQAFNPDN PYHQQEGESM TFDPRLDHEG ICCIECRRSY THCQKICEPL 300 
GGYYPWPYNY QGCRSACRVI MPCSWWVARI LGMV 

Seq ID NO: 201 DNA sequence 

Nucleic Acid Accession 8: NMJJ00728.2 

Coding sequence: 112.. 495 

1 11 21 31 41 51 . 

I | I I I I 

GTAATAAGAG CGGGGTCTCC GCGGGGAAGG CGCCCACAGC AGGTGTGGTG TTCATCCCGG 60 

GTCGACCGGC CGCTCGCGCT GCCCTGAAAC TCTAGTCGCC AGAGAGGCGG CATGGGTTTC 120 

CGGAAGTTCT CCCCCTTCCT GGCTCTCAGT ATCTTGGTCC TGTACCAGGC GGGCAGCCTC 180 

CAGGCGGCGC CATTCAGGTC TGCCCTGGAG AGCAGCCCAG ACCCGGCCAC ACTCAGTAAA 240 

GAGGACGCGC GCCTCCTGCT GGCTGCACTG GTGCAGGACT ATGTGCAGAT GAAGGCCAGT 300 

GAGCTGAAGC AGGAGCAGGA GACACAGGGC TCCAGCTCCG CTGCCCAGAA GAGAGCCTGC 360 

AACACTGCCA CCTGTGTGAC TCATCGGCTG GCAGGCTTGC TGAGCAGATC AGGGGGCATG 420 

GTGAAGAGCA ACTTCGTGCC CACCAATGTG GGTTCCAAAG CCTTTGGCAG GCGCCGCAGG 480 

GACCTTCAAG CCTGAGCAGA TGAATGACTC CAGGAAGAAG GTGTGTCCTA AATCCAATGA 540 

CATATCCTTA TAAGAGATTC ACTCAGAAGA CACATGTGGA GAAGGTGACA TGACAGAGGC 600 

AAGGAGGCAC AAGCCAAGGA AGTCTGTGTC TACCAGAAGC CAGAATCACA GAACAGTCTC 660 

TGGAAGAAGA GCAGCCCTGC TGACACCTAG AGTTTGGACT TCCAGCTTCC AGAACTGTGA 720 

GAGAATAATT TCTGTTGTTT TAAGCCACAA AGTTTGTGGT AATTTGTTAT GACAGCCCTA 780 

GGAAACTAAT ACAATACATT TTCATTTATT TTGGGTAAAT GCCTTGGAGT GGGATTGCTG 840 

GGTTATTTGG AAAGTGTGTA TTTAACTCTG TAAGAAACTG CCAAACTATT TTCTGAAGTG 900 

ACTGTACCAC TTCGCCTTCT TGCCAGCCAC ATATGAGAGC TCTAGTATTT CCACAAATAG 960 

GTATGTAGCA GTATCTCATT GCTGTTTTAA TTTGTATTTC CCCAATGACT AATGACGTTG 1020 

AGCATCTATT TTACCATATG TTTATCACCT TTATTGAAGG GTCTGTTTAA ATCTTCTGCT 1080 

AAATTTTTGT TGGCTTGCTT GCTTTATTAG TGTTGAGTTT TTAGAGCTCT TTATATGTTG 1140 

TGGATGCAAG ATTGTTTTCA GATATATAGT TTGGAAACTT CCTTCCCCTG AATCTGCGGA 1200 

TTGCTTTTTC ATTTTCTTAG CAGTGTCTCT CACAGAGAAA AAGTTGTAAT TTGAATAAGA 1260 

TCCAATTCAT CTTTTTTTTT CTTTTATGTA TTGTGCTTTT AGTTCATGTC TAAGAACTCT 1320 

TTGCCTAACT AAGGTCCCAA GGTCACAATA ACCTTATTCT ATACTTTCTT GTAAAAGTTT 1380 

TATAGTTTTA TATTTTATAT GTAGATTAGT GATCTATTTT GAGTTAATTT TTGTATAAGG 1440 

TGAGAGGTGT AGGTTGAAAT TCATACCTGT GAATATAGAT ACCCAATTGT TTCAGTGCCA 1500 

TTTGTTAAAA AGACTGTTAT TTCACCATTT AATTGCCCCT GCACCTTTGT CAAAAAGCAA 1560 

CTGATCATAT TTGTGTGGGT ATATTTCTGG GTTCTCAATT CTGTCTCATT GATTGATTTG 1620 

ACCATTCTTT TGCCAATGTC ATACTGCCTT GATTAGTGTA GTGTTAAAGT GAATCTCAAA 1680 

ACCAGATAAT GTGGGTCTAC CAACATTGTT CATTCTTGTT CAAAAAGATT TTAGCTACAT 1740 

CTAAAATATT TTCTACATCT TTTATACATT TTAGAATCAG TGTGTTACTA TCTACAAAAT 1800 

TTCTGATGAG ATTTTTAATG GGATTGTGTT AAATCAGTGG GTTAATTTTG GGAGAATTAG 1860 

CATATTAATA ATATTAAGTC GTTCAATTCA TGAACACAAT ACATGTTTTC ACTTATTTAG 1920 

GTTTTCTCTG TTTTTTTTTT TTTAACAGTG TTCTCAGTTT TCAACAGAAA TATTCTACAC 1980 

ATATCTTGTT AGATTTTTAA CTATTTTATT TTTTGGTGCT AATGTAAATG GTACTTAAAC 2040 

ATTTTTGTTT TTAATTGTTC ATTGCTAGTA GATAGAAATA CAATATTTAA AATATTAGGA 2100 
. AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 202 Protein sequence: 
Protein Accession ft: NP_000719.1 

1 11 21 31 41 51 

| | | 1 I I 

MGFRKFSPFL ALSILVLYQA GSLQAAPFRS ALES SPD PAT LSKEDARLLL AALVQDYVQM 60 
KASELKQEQE TQGSSSAAQK RACNTATCVT HRLAGLLSRS GGMVKSNFVP TNVGSKAFGR 120 
RRRDLQA 

Seq ID NO: 203 DNA sequence 
Nucleic Acid Accession 8: NM_00174l 
Coding sequence: 71.. 496 

1 11 21 31 41 51 

CTCTGGCTGG ACGCCGCCGC CGCCGCTGCC ACCGCCTCTG ATCCAAGCCA CCTCCCGCCA 60 

GAGAGGTGTC ATGGGCTTCC AAAAGTTCTC CCCCTTCCTG GCTCTCAGCA TCTTGGTCCT 120 

GTTGCAGGCA GGCAGCCTCC ATGCAGCACC ATTCAGGTCT GCCCTGGAGA GCAGCCCAGC 180 

AGACCCGGCC ACGCTCAGTG AGGACGAAGC GCGCCTCCTG CTGGCTGCAC TGGTGCAGGA 240 

CTATGTGCAG ATGAAGGCCA GTGAGCTGGA GCAGGAGCAA GAGAGAGAGG GCTCCAGCCT 300 
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5 

10 
15 
20 
25 
30 
35 
40 



TCTGAGTACT 
CCAAACTGCA 
GAGAGACCAT 
TTCCTAATTT 
TGGTGGCTCT 
AGATGGGAGG 
GAAAATGAGG 
TGAATGTGCT 



TGCATGCTGG 
ATTGGGGTTG 
CGCCCTCATG 
CCCTTCTTGC 
TTGGGCTGGT 
AAAGAGAGCA 
GCCGCTTTGA 
GATCATTTGA 



GCACATACAC 
GAGCACCTGG 
TTAGCATGCC 
ATCCTTCCTA 
ATTGGTGGCT 
GGACTCACAG 
GTCCCCCAGA 
GGAATAAAAT 



WO 02/086443 

GGACAGCCCC AGATCTAAGC GGTGCGGTAA 
GCAGGACTTC AACAAGTTTC ACACGTTCCC 
AAAGAAAAGG GATATGTCCA GCGACTTGGA 
CCAGAATGCC AACTAAACTC CTCCCTTTCC 
TAACTTGATG CATGTGGTTT GGTTCCTCTC 
TTCCTTGTGG CAGAGGATGT CTCAAACTTC 
GTTGGAAGAG AATCACCTGG GAAAATACCA 
GATGTCATCA GAGCTCCTCT GTCCTGCTTC 
TATTTTTCCC C 



Seq ID NO: 204 Protein sequence: 
Protein Accession #: NP_001732 

1 11 21 31 41 51 

I I I . I I 1 

MGFQKFSPFL ALSILVLLQA GSLHAAPFRS ALES SP AD PA TLSEDEARLL LAALVQDYVQ 
MKASELEQEQ EREGSSLDSP RSKRCGNLST CMLGTYTQDF NKFHTFPQTA IGVGAPGKKR 
DMSSDLERDH RPHVSMPQNA N 

Seq ID NO: 205 DNA sequence 
Nucleic Acid Accession #: NMJ>05361 
Coding sequence: 1-945 



PCT/US02/12476 



ATGCCTCTTG 
GAGGCCCTGG 
TCCTCTTCTA 
CCTCCCCACA 
AGACAATCCG 
CTGGAGTCCG 
CTCCTCAAGT 
AGAAATTGCC 
GTCTTTGGCA 
TGCCTGGGCC 
CTCCTGATAA 
ATCTGGGAGG 
CATCCCAGGA 
GTGCCCGGCA 
ACCAGCTATG 
TACCCACCCC 



11 
I 

AGCAGAGGAG 
GCCTGGTGGG 
CTCTAGTGGA 
GTCCTCAGGG 
ATGAGGGCTC 
AGTTCCAAGC 
ATCGAGCCAG 
AGGACTTCTT 
TCGAGGTGGT 
TCTCCTACGA 
TCGTCCTGGC 
AGCTGAGTAT 
AGCTGCTCAT 
GTGATCCTGC 
TGAAAGTCCT 
TGCATGAACG 



21 
I 

TCAGCACTGC 
TGCGCAGGCT 
AGTTACCCTG 
AGCCTCCAGC 
CAGCAACCAA 
AGCAATCAGT 
GGAGCCGGTC 
TCCCGTGATC 
GGAAGTGGTC 
TGGCCTGCTG 
CATAATCGCA 
GTTGGAGGTG 
GCAAGATCTG 
ATGCTACGAG 
GCACCATACA 
GGCTTTGAGA 



31 
I 

AAGCCTGAAG 
CCTGCTACTG 
GGGGAGGTGC 
TTCTCGACTA 
GAAGAGGAGG 
AGGAAGATGG 
ACAAAGGCAG 
TTCAGCAAAG 
CCCATCAGCC 
GGCGACAATC 
ATAGAGGGCG 
TTTGAGGGGA 
GTGCAGGAAA 
TTCCTGTGGG 
CTAAAGATCG 
GAGGGAGAAG 



41 
I 

AAGGCCTTGA 
AGGAGCAGCA 
CTGCTGCCGA 
CCATCAACTA 
GGCCAAGAAT 
TTGAGTTGGT 
AAATGCTGGA 
CCTCCGAGTA 
ACTTGTACAT 
AGGTCATGCC 
ACTGTGCCCC 
GGGAGGACAG 
ACTACCTGGA 
GTCCAAGGGC 
GTGGAGAACC 
AGTGA 



51 
I 

GGCCCGAGGA 
GACCGCTTCT 
CTCACCGAGT 
CACTCTTTGG 
GTTTCCCGAC 
TCATTTTCTG 
GAGTGTCCTC 
CTTGCAGCTG 
CCTTGTCACC 
CAAGACAGGC 
TGAGGAGAAA 
TGTCTTCGCA 
GTACCGGCAG 
CCTCATTGAA 
TCACATTTCC 



360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



45 



50 



55 



60 



65 



70 



75 



80 



85 



Seq ID NO: 206 Protein sequence: 
Protein Accession #: NP_005352 



MPLEQRSQHC 
PPHSPQGASS 
LLKYRAREPV 
CLGLSYDGLL 
HPRKLLMQDL 
YPPLHERALR 



11 
I 

KPEEGLEARG 
FSTTINYTLW 
TKAEMLESVL 
GDNQVMPKTG 
VQENYLEYRQ 
EGEE 



21 

I 

EALGLVGAQA 
RQSDEGSSNQ 
RNCQDFFPVI 
LLIIVLAIIA 
VPGSDPACYE 



31 
I 

PATEEQQTAS 
EEEGPRMFPD 
FSKASEYLQL 
IEGDCAPEEK 
FLWGPRALIE 



41 
I 

SSSTLVEVTL 
LESEFQAAIS 
VFGIEWEW 
IWEELSMLEV 
TSYVKVLHHT 



51 
I 

GEVPAADSPS 
RKMVELVHFL 
PISHLYILVT 
FEGREDSVFA 
LKIGGEPHIS 



Seq ID NO: 207 DNA sequence 
Nucleic Acid Accession #: NM_021115 
Coding sequence: 743-2893 



AAAGGAAGGG 
GGCACCGCCC 
CCCAAACTAA 
CCCTTTGGGT 
GCACCCTGAA 
GGGCGkGCTG 
ACCGCTGCTT 
TTCGCTCAAG 
CACTGTCCAA 
CACGGAGAAG 
AGAAGTGCCC 
GCAAATCTCC 
ACCCGGGGAG 
GGCCCTGATG 
GACCACTACC 
CTGCAGTGTG 
GCCCCTCAAC 
GGAGCTCCAG 
GGACGGCCCT 
CCGAAGCCCC 
GACCTTCCAG 
CTCTGGGGAT 
CCTGGGCTAT 
CTGGAGCAGC 
CATCGGCCGC 
CTGGACGATT 



11 

I 

AGGGAGGGAG 
TTAGGAGGGC 
CTGGTGTCTT 
CCTTACCTCC 
GAGAGAGTGG 
GTGCTGGATG 
CCAGAGGAGG 
CAGGTGAACT 
AGGGCAGGGT 
CCTGGCCCAC 
CTTTGGCTGG 
CCCTTCACTT 
CCTGGGCCTG 
GACAAAGGTG 
TCCACCATTA 
AGCTTCTCCA 
AACTTTCTGG 
GTGAAGAGTG 
ACCCTGACCG 
ACCAACACCA 
CTTCACTACC 
GTCACGGTGA 
GAGCTCCAGG 
CAGGAGCCCA 
GTCCTCTCCC 
GAAGCTCCAG 



21 
I 

AAAGGAGAAG 
CACCCTCAGA 
TTCTCCTCTT 
TGCCCTCAGG 
TAACAGCGCC 
GGACCGCACC 
CCCGCCCCAA 
CTGCCAGGAA 
CCCAGCCAGC 
CGGGGGACCC 
ACCGAAAGGA 
CGCAGCCCTA 
ACATGGCCCA 
AGAATGAGCT 
TCACCACCAC 
ATCCTGAGGG 
AGTGCACATA 
TGAACCTGTC 
TCCTGGCCAA 
TCTCCGTCTA 
AGGCCTTCAT 
TGGACCTGCA 
GCGCTAAGAT 
TCTGCTCAGC 
CAAGTTACCC 
AGGGCCAGAA 



31 
I 

TTGGTTTAGA 
GTCTGACAGC 
CCAAGATGCT 
AGCCCCGGAG 
CCCCAGTTCC 
CTCTGCACAT 
GCACGCCTTG 
GCAGCTGAGG 
GTCCCAGGGC 
GGACCCCATC 
GAGTGCGGTC 
TGTGGCCCAC 
GGAGGCCCCC 
GACTGGGTCA 
GGTCATCACC 
GTACATTGAC 
CAACGTGACA 
CGATGGGGAA 
CCAGACACTC 
CTTCCGGACC 
GCTGAGCTGC 
CTCAGGTGGG 
GCTGACATGC 
TCCTTGTGGA 
TGAAAACACA 
GCTGCACCTG 



41 
I 

GGCCAGCCGG 
AGGTGAAGGT 
CTTCCCGAGG 
AGAGGCAGTC 
TCACAGTCGG 
CACGACATCC 
CCCCCCAAGA 
CCCAAGGCCA 
CTAGATCTCC 
GTGGCCTCCG 
CCTACAACAC 
ACACTCCCCC 
CAGGAGGACA 
GCCTCAGAGG 
ACCGAGCAGG 
TCCAGCGACT 
GTCTACACTG 
CTGCTCTCCA 
CTGGTGGAGG 
TTCCAGGACG 
AACTTTCCCC 
GTGGCCCACT 
ATCAATGCCT 
GGGGCAGTGC 
AATGGGAGCC 
CACTTTGAGA 



51 
I 

ACGAGCTTTG 
CCTAAATCTC 
GAGATGCTAG 
CTGGCAAAGA 
CGGAAGTGCT 
CAGCCCTGTC 
AGAAACTGCC 
CCTCCGCAGC 
TCTCCTCCTC 
AGGAGGCATC 
CCGCACCCCT 
AGAGGCCAGA 
CCAGCCCCAT 
AGAGCCAGGA 
CACCAGCTCT 
ACCCACTGCT 
GCTATGGGGT 
TCCGCGGGGT 
GGCAGGTAAT 
ACGGCCTTGG 
GCCGGCCTGA 
TTCACTGCCA 
CCAAGCCGCA 
ACAATGCCAC 
AATTCTGCAT 
GGCTGTTGCT 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



269 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GCATGACAAG GACAGGATGA 
CGACTCCCTT CAAACCGAGA 
CCGCATCGAG TTCACGTCCG 
AGCGTTTGAG AAAGGCCACT 
CGACCCGACC TATAACATTG 
GGAGCAGGGC CCGGCCATCA 
AGAGCCCCTG TGCAGAGCCA 
GTCCCCAAAC TGGCCCGAGC 
GGGAGAAGAG AAACGGATCT 
CTTGACCATC TACGATGGCG 
CAGTGGCCCC CAGAAACTGT 
CCCTGCTGGC CTCATCTTTG 
AAGGAATGAC TCCTGCTCGG 
CACGGAGTTG GTGCGGGGAG 
GGGGAGTGAC ACCCTCACCT 
TGAGAAAATT ATGTACTGCA 
GGATCCTGTG CTGCTGGTGG 
TGAAGGGAGT TCTCTTCTGA 
TCGCCTGCCC CACTGCGTTT 
GGCCCTGGCT ATCTTCATCC 
TTACATCACA AGATGTCGCT 
CTACAGCCAG ATCACCGTGG 
CCAAAAGGTT TAGGGTTTCA 
AACCCCAATT TCCCCGAGAC 
AAAGGCGGCT GTTTTTTGGT 
TTTATAAATT TTAAAAGTG 



CGGTTCACAG 
GTGTCCCTTT 
ACCAGGCCCG 
GCTATGAGCC 
GGACTATAGT 
TCGAATGCAT 



CCTAOGTGGA 
TCTTAGATAT 
ACGAGGTCAT 
ACTCCTCCAC 
GAAAGGGCCA 
ATTTACCCGA 
CCAGAATCAC 
GCCAGTGGGA 
CCGACCCCGG 
GGACCACCAT 
CCTGCTACAG 
CAGAAGCGGC 
CGGTCCTCAT 
ACTATTCCAA 
AAACCGAGTT 
TTTAAAAAGA 
ATTTATCCAA 
TAAACTTTTT 



CGGGCAGACC 
TGAGGGCCTG 
GGCGGCCTCC 
CTACATCCAG 
GGAGTTCACC 
CAATGTGCGG 
GGAGCTCTCT 
AGGTGAAGAT 
CCAGTTCCTG 
GCCCCACATC 
GCCAGACTTA 
GGGATTTATC 
GATCCAGAAT 
CTACCAGTGT 
CCTCAGCTGG 
AGAGGTGGAT 
CCAATACACC 
CCGTGAAACA 
AGCAGAGACG 
CATCTCCTTA 
CCTCCGCCTG 
TGACAACCCC 
GGTACCCTTT 
AGGCCCTGGG 
AACAAAGGGT 



AACAAGTCAG 
CTGAGCGAAG 
ACCTTCAACA 
AATGGGAACT 
TGCGACCCCG 
GACCCATACT 
GCTGTGGCTG 
TGTATCTGGA 
AATCTGAGCA 
TTGGGGCAGT 
ACCATCCAGT 
ATGAACTACA 
GGCTGGAAAA 
GACCCCGGCT 
AGCAGCGACC 
CACTCGACCC 
TGCAACCCCG 
GGGACTCCCA 
TCGCTGGAAG 
CTGCTGGGAG 
CCTCTGATGT 
ATTTACGAGA 
AAAAAGGGGC 
GGCCTTGATT 
TACGGGTTTT 



CTCTTCTCTA 
GCAACACCAT 
TCCGATTTGA 
TCACTACATC 
GCCACTCCCT 
GGAATGACAC 
GGGTGGTATT 
AGATCCACGT 
ACAGTGACAT 
ACCTTGGGAA 
TCCATTCGGA 
TAGAGGTATC 
CCACTTCTCA 
ATGACATCGT 
CCCCATTTTG 
GCTTAATTTC 
GTTTTGTGCT 
TCTGGACGTC 
GGGGGAACAT 
GAGCCTACAT 
ACTCCCACCC 
CAGGGGGAAC 
TTGTGAACTC 
TAAACCCCCA 
TTCCCCGGAT 



Seq ID NO: 208 Protein sequence: 
Protein Accession 8: NP_066938 



MAQEAPQEDT 
PEGYIDSSDY 
LANQTLLVEG 
DLHSGGVAHF 
SYPENTNGSQ 
VPFEGLLSEG 
TIVEFTCDPG 
YVEGEDCIWK 
SSTPDLTIQF 
RITYQCDPGY 
TTIQYTCNPG 
VLIISLLLGG 



11 
I 

SPMALMDKGE 
PLLPLNNFLE 
QVIRSPTNTI 
HCHLGYELQG 
FCIWTIEAPE 
NTIRIEFTSD 
HSIiEQGPAII 
IHVGEEKRIF 
HSDPAGLIFG 
DIVGSDTLTC 
FVLEGSSLLT 
AYIYITRCRY 



21 
I 

NELTGSASEE 
CTYNVTVYTG 
SVYFRTFQDD 
AKMLTCINAS 
GQKLHLHFER 
QARAASTFNI 
ECINVRDPYW 
LDIQFLNLSN 
KGQGFIMNYI 
QWDLSWSSDP 
CYSRETGTPI 
YSNLRLPLMY 



31 
I 

SQETTTSTII 
YGVELQVKSV 
GLGTFQIiHYQ 
KPHWSSQEPI 
LLLHDKDRMT 
RFEAFEKGHC 
NDTEPLCRAM 
SDILTIYDGD 
EVSRNDSCSD 
PFCEKIMYCT 
WTSRLPHCVS 
SHPYSQITVE 



41 
I 

TTTVITTEQA 
NLSDGELLSI 
AFMLSCNFPR 
CSAPCGGAVH 
VHSGQTNKSA 
YEPYIQNGNF 
CGGELSAVAG 
EVMPHILGQY 
LPEIQNGWKT 
DPGEVDHSTR 
EAAAETSLEG 
TEFDNPIYET 



51 
I 

PALCSVSFSN 
RGVDGPTLTV 
RPDSGDVTVM 
NATIGRVLSP 
LLYDSLQTES 
TTSDPTYNIG 
WLSPNWPEP 
LGNSGPQKLY 
TSHTELVRGA 
LISDPVLLVG 
GNMALAIFIP 
GGTQKV 



Seq ID NO: 209 DNA sequence 

Nucleic Acid Accession #: NM_001327.1 

Coding sequence: 89-631 



I 

AGCAGGGGGC 
CTGAGAGCCG 
GACGGGCGAT 
TGGCGGCCCA 
AAGGGCCTCG 
GCTGAATGGA 
CCTCGCCATG 
GGATGCCCCA 
CATACTGACT 
CTGTCTCCAG 
GGCTCAGCCT 
GCCTCCTCCC 
GTTTGTCGCT 



11 

I 

GCTGTGTGTA 
GGCAGAGGCT 
GCTGATGGCC 
GGAGAGGCGG 
GGGCCGGGAG 
TGCTGCAGAT 
CCTTTCGCGA 
CCGCTTCCCG 
ATCCGACTGA 
CAGCTTTCCC 
CCCTCAGGGC 
CTAGGGAATG 
GGAGGAGGAC 



21 
I 

CCGAGAATAC 
CCGGAGCCAT 
CAGGAGGCCC 
GTGCCACGGG 
GAGGCGCCCC 
GCGGGGCCAG 
CACCCATGGA 
TGCCAGGGGT 
CTGCTGCAGA 
TGTTGATGTG 
AGAGGCGCTA 
GTCCCAGCAC 
GGCTTACATG 



31 
I 

GAGAATACCT 
GCAGGCCGAA 
TGGCATTCCT 
CGGCAGAGGT 
GCGGGGTCCG 
GGGGCCGGAG 
AGCAGAGCTG 
GCTTCTGAAG 
CCACCGCCAA 
GATCACGCAG 
AGCCCAGCCT 
GAGTGGCCAG 
TTTGTTTCTG 



41 
I 

CGTGGGCCCT 
GGCCGGGGCA 
GATGGCCCAG 
CCCCGGGGCG 
CATGGCGGCG 
AGCCGCCTGC 
GCCCGCAGGA 
GAGTTCACTG 
CTGCAGCTCT 
TGCTTTCTGC 
GGCGCCCCTT 
TTCATTGTGG 
TAGAAAATAA 



51 
I 

GACCTTCTCT 

CAGGGGGTTC 

GGGGCAATGC 

CAGGGGCAGC 

CGGCTTCAGG 

TTGAGTTCTA 

GCCTGGCCCA 

TGTCCGGCAA 

CCATCAGCTC 

CCGTGTTTTT ■ 

CCTAGGTCAT 

GGGCCTGATT 

AACTGAGCTA 



Seq ID NO: 210 Protein sequence: 
Protein Accession #: NP__001318.l 

1 11 21 31 41 51 

111)11 
MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPGGGA 
PRGPHGGAAS GLNGCCRCGA RGPESRLLEF YLAMPFATPM EAELARRSLA QDAPPLPVPG 
VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCFLPVF LAQPPSGQRR 



Seq ID NO: 211 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 52-459 



1 11 21 

I I I 

CCTCGTGGGC CCTGACCTTC TCTCTGAGAG 
GAAGGCCAGG GCACAGGGGG TTCGACGGGC 
CCTGATGGCC CAGGGGGCAA TGCTGGCGGC 
GGTCCCCGGG GCGCAGGGGC AGCAAGGGCC 
CCGCATGGCG GTGCCGCTTC TGCGCAGGAT 



31 41 

I I 
CCGGGCAGAG GCTCCGGAGC 
GATGCTGATG GCCCAGGAGG 
CCAGGAGAGG CGGGTGCCAC 
TCGGGGCCGA GAGGAGGCGC 
GGAAGGTGCC CCTGCGGGGC 



51 
I 

CATGCAGGCC 
CCCTGGCATT 
GGGCGGCAGA 
CCCGCGGGGT 
CAGGAGGCCG 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 



60 
120 
180 
240 
300 



270 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GACAGCCGCC TGCTTCAGTT CCGACTGACT GCTGCAGACC ACCGCCAACT GCAGCTCTCC 
ATCAGCTCCT GTCTCCAGCA GCTTTCCCTG TTGATGTGGA TCACGCAGTG CTTTCTGCCC 
GTGTTTTTGG CTCAGGCTCC CTCAGGGCAG AGGCGCTAAG CCCAGCCTGG CGCCCCTTCC 
TAGGTCATGC CTCCTCCCCT AGGGAATGGT CCCAGCACGA GTGGCCAGTT CATTGTGGGG 
GCCTGATTGT TTGTCGCTGG AGGAGGACGG CTTACATGTT TGTTTCTGTA GAAAATAAAG 
CTGAGCTA 

Seq ID NO: 212 Protein sequence: 
protein Accession f): Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 
PRGPHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 
FLPVFLAQAP SGQRR 

Seq ID NO: 213 DNA sequence 
Nucleic Acid Accession #: NM_000555 
Coding sequence: 416.-1498 



PCTYUS02/12476 



360 
420 
480 
540 
600 



CTTATTTTTT 
AGCACAAAGA 
TCTGGGGGGA 
ACTCCCCCTT 
AACCTTGGGT 
TACAACTGTT 
CCCGAAGTTC 
ACTTGATTTT 
GATGAATGGG 
CTTGCAGGCA 
CCGCTACTTC 
CTTGCTGGCT 
TTACATTTAC 
GGAAAGCTAT 
CAATCCCAAC 
GGCTAGCAGC 
TACCATCATC 
GACAGCCCAC 
CGGGGTTGTC 
CTTTGGTGAT 
TGATTTTTCT 
TGGCCCAAAG 
CCGAAGCAAG 
CAAGTCTAAG 
GGACCTGTAC 
GAGGGGAGAG 
TCTGCTCAAG 
TATTTTGAAA 
CACTGATCCA 
TCCAGGGATG 
TAAATTTGCC 
ATACAGCAAT 
ACAAACACAA 
GCAAGGCAGC 
CACCAATTCT 
TGCTTAGGAT 
ACATTTCCGA 
TTTGGAATCA 
TCTAGAAGGC 
TGTTCTATTG 
GGTCTGACAC 
CATGTCTGTA 
TTTTGTCTAA 
ATGTTATTTA 
AATAGCAGAG 
CATTAATAAT 
ATATTTTAAG 
CTTTCACTTG 
TAGTTTTCAG 
GCCCATAATG 
GCTTTNACCA 
CACAGCATCC 
CTAAATGGAA 
GTGTGTGTGT 
GTAATGGATT 
GTGGTGGTGG 
ACATTTTCGG 
ACCCCAAATG 
CCATTAATAA 
GTGCCTCCCA 
GGGCATAAAG 
CACCCATAGT 
TGGAGGCTGG 
ACACTAGCTC 
GGATATATTT 



11 
I 

ATGAATGTCG 
CACTGGCTGT 
GGGGATGCAC 
CATAGTCATT 
AGCTCCTTCT 
AGTCATGTGG 
CAATTTGATA 
GGACACTTTG 
TTGCCTAGCC 
CTGAGTAATG 
AAGGGGATTG 
GACCTGACGC 
ACCATTGATG 
GTCTGTTCCT 
TGGTCTGTCA 
AACAGTGCAC 
CGCAGTGGGG 
TCTTTTGAGC 
AAAAAACTCT 
GATGATGTGT 
CTGGATGAAA 
GCATCCCCAA 
TCTCCAGCTG 
CAGTCTCCCA 
CTGCCTCTGT 
TGCTCAGAGT 
TGTCCAACAG 
AACACATTGT 
CAGTTACCAA 
CAAAATGTGC 
CCGTTTAAAT 
TAAAAAGTTT 
GTGCCCCTTT 
TCCCCAGCCT 
GGCTGTCAAT 
CCTGGTGCTG 
AGAGTTTATA 
ATAAGCAATT 
TGTCTAACAT 
AATGCCTTGT 
CTTTCAGTCT 
TTTCAGGAGC 
AAAACACATG 
GAAATTATGC 
GCCAATTCAA 
TTCAATGTGG 
CAACTCTTTT 
TCTTTAACAT 
TCTTTTGAGA 
GCAAAAACAA 
AATATAAAAA 
AAACCAAGCT 
TGAGCTTGCT 
GTGTGCATCT 
GGTGGCAACT 
GGTATCTCAA 
TTCAAGAAAA 
ATGAGGATCT 
GCCCATTTTA 
NAACATTTTG 
AATGGTGGGA 
NTCACTTTAG 
TAAAGAGCAG 
TNTGAGTATT 
TCTTTAGGAT 



21 
I 

GATAGCTGCA 
TCCCTGGAGG 
ACATTAGAGT 
GTACTGAAAT 
GTTCTCTTCA 
GCATGTGTGA 
GGAGCCACTG 
ACGAAAGAGA 
CCACTCACAG 
AGAAGAAAGC 
TGTACGCTGT 
GATCTCTGTC 
GATCCAGGAA 
CAGACAACTT 
ACGTAAAAAC 
AGGCCAGGGA 
TGAAGCCTCG 
AAGTCCTCAC 
ACACTCTGGA 
TTATTGCCTG 
ATGAATGCCG 
CACCTCAGAA 
ACTCAGCAAA 
TCTCTACGCC 
CCTTGGATGA 
CCAGAGTACA 
GGCTATTGGT 
AATATGTTGG 
TTATGAGAGA 
TAGTCCATGA 
TTGCCCAAAC 
GTGTGGGGAA 
TCTCTGGATC 
CACTCTTCAC 
GGGGAGAAAT 
GGTTAGCTAA 
AAGCACAGTG 
GATAATAGTT 
ACCACATGAT 
TAACAGCCAA 
CTTTTTATAG 
AAACTCTTCA 
AAGAAAATTT 
TGTCACTGCC 
TAGAATCAGT 
ACCAGACATT 
TATCTATAAT 
TAGAAAGGAT 
TACAGGTTTA 
CTAATTTTAA 
TTCCCTTATT 
GCTGTTTGGC 
GTGTGTGTGT 
GCAGCTGCTT 
GGGTGGCACT 
ATGCCCCTAG 
GTGAGATGAT 
CTTTTTGCCC 
CTAANCCCCT 
TAGTTAATTG 
GGCCTGATTT 
GTCTCATTTA 
GACCAGAGGA 
TCCTTGATTG 
AACCTTTGAA 



31 
I 

CCAGCTTGGT 
CTGTCCCTTT 
AGGAAAGAGG 
GCAAAGACTG 
AGGGGAATTT 
GGAAACAGAT 
TCAGTCTCTG 
TAAGACATCC 
CGCCCACTGT 
CAAGAAGGTA 
GTCCTCTGAC 
TGACAACATC 
GATCGGAAGC 
CTTTAAAAAG 
ATCTGCCAAT 
GAACAAGGAC 
GAAGGCTGTG 
TGATATCACA 
TGGAAAACAG 
TGGTCCTGAA 
AGTCATGAAG 
GACTTCAGCC 
CGGAACCTCC 
CACCAGTCCT 
CTCGGACTCG 
AATCCAAGCC 
GCTTTCAAGT 
GTTTATTTTC 
TAGATTGATA 
CCTTTCAATG 
AGTTTTCCTT 
AAAAAAAACT 
TCAAGAATGG 
TCCTGATTGA 
AAACCAACAA 
GAGAATAGAC 
AATTCCTGGT 
TGGAGTAAGG 
TACATGAACT 
CACTGAAAAC 
CAAGAAATCA 
GGCTCCTTTT 
ACCAGAAAAA 
AAACAGTAAC 
TTTTTGATAG 
CTAATTATAT 
CCTAATATTT 
TTCTCTTTAC 
TAACACTGCT 
TTGAAGGTCT 
CCTTGGTAAT 
TACTGAATGG 
GTGGTGGTGG 
CAAAATTAAG 
GCTGATGTGC 
ACAAGCTTCA 
GGTAGTACTG 
CCTCTCCTTT 
ATTTCTTTCT 
GGAAAAAGTG 
TAAAATTCAG 
GTCCATCACC 
AGAATCCAGA 
CGGTATATGT 
CCAACAATNT 



41 
I 

GGGGAAAGGG 
AAAGGAGAAT 
GCTTGGAATA 
CTTCCTAAGC 
TGTCAGGCTA 
GCCAGTTTTA 
AGGTTCCACC 
AGGAACATGC 
AGCTTCTACC 
CGTTTCTACC 
CX3TTTTCGCA 
AACCTGCCTC 
ATGGATGAAC 
GTGGAGTACA 
ATGAAAGCCC 
TTTGTGCGCC 
CGTGTGCTTC 
GAAGCCATCA 
GTAACTTGTC 
AAATTTCGCT 
GGAAACCCAT 
AAGAGCCCTG 
AGCAGCCAGC 
GGCAGCCTCC 
CTTGGTGATT 
TATCATTGTA 
TTTTATTTTG 
CTGTGATTTC 
ACCATCCTTT 
GAAAGCTTAG 
TTGTAGAGGG 
CATTGGCAGA 
TGGAGGACCC 
GGCCCGGGTT 
CTTATAATTG 
AGAATTGGAA 
CAATCTCTCC 
GACTTCATAT 
GTATGGTATC 
ACTGTGAGAA 
ATATCCTTTT 
TTATAAACTG 
AAAAAAAAAG 
CTCCAGGAGA 
CTTTTTAACA 
TTTAAATGAA 
CATACTGAAG 
TAAGGACTGA 
TTTTTTTTCC 
TGCTTGCCAN 
GGTGCAAATN 
CTTGCAGTTG 
TGGGAGGGGG 
AAATACTACA 
ACTGTGTAGG 
GATGTCTGTA 
GTTTCTGGTG 
TTTTGTAAAC 
AGAAGCTCAG 
ATACTTGGAT 
GCCAGAACCC 
TTTATTTTAA 
TTTCCTTATG 
ACTACTAGAA 
TCAATAACAA 



51 
I 

TTTGATGAAT 
CTTAGTTTAT 
AAATGAAAAC 
TGGAGATGCT 
TGGATTCATT 
ATGTATTTAG 
AAAATATGGA 
GAGGCTCCCG 
GAACCAGAAC 
GCAATGGGGA 
GCTTTGACGC 
AGGGAGTGCG 
TGGAGGAAGG 
CCAAGAATGT 
CCCAGTCCTT 
CCAAGCTGGT 
TGAACAAGAA 
AACTGGAGAC 
TCCATGATTT 
ATGCTCAGGA 
CAGCCACAGC 
GTCCTATGCG 
TCTCTACCCC 
GGAAGCACAA 
CCATGTAAAG 
GTAGGGTACT 
TTGTTGTTGT 
TCCTCTGGGC 
GGGGCAGCAT 
GGGCCTGGGG 
GTGTTTAAAT 
TCCAAGAATG 
TGGAAGGACA 
TGTTGTCCAG 
TGACACCAGA 
AATACTGCAG 
ACTGAGGCAA 
ACCTGATTCC 
CATCTATCTC 
TTTGTTTTCA 
TATAAAAATT 
GTGATTTTTC 
CCGAAGAATA 
AAACAAGATG 
GTTATGCTTG 
ATGTTACAGC 
ACACAGAAAT 
TCATTTGAAA 
TGTAAACATA 
TCCTGTGTTG 
TTTGGAAAGG 
TTCCTCCACT 
TGGTGCATGT 
AGACACCCCT 
GGGGAACCCA 
GCTACCAAAA 
AAATTGAAAA 
CCATTCAAAA 
GGTTTNCTTA 
TAGGGGGTGT 
CCAATGACTC 
GTTGAGGAAG 
CTTGGGCCTC 
AATACCAAAT 
TAGTACATCT 



60 
120 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
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TCCATCTTAC TTTTAATCGA GTATAAGGAA ATGTTTCTTT ATGGCCATTT TGGAGGGAGC 3960 

A6GGGATGAG GCTTGGCATA GTCCAAAATT TAAGNCTCCA ATAATTAATT GCATTTTAAA 4020 

TTGTTTTAAA TTGGCCCACT TTCAAGGCAA TTTTTTTTGT GTGTCTGTAA CTGAGCTCCT 4080 

CCACCCCTGT CATTCACTTC CAATTTTACC CAATCCAATT TTAGCACTCA AGTTCCATTG 4140 

TGTTAATTTT TGCACGGTCT ACACACATCA AGTCAGCAAG CATTTGCCAC CACTCCCTAT 4200 

ACTTCTCCCT CTTTTTTACA CACACACACA CACACACACA CACAATCCAT CTCTTGCTTG 4260 

TTCCTACCTC CCTGATTTTT CTTCCCTACA GAAATAGAAA TAGGGACAAA GAAGGGGAAA 4320 

ATGTATATAT TGGGGCTGGG CTGAACAACT AACTTCATAA GTAGTATTAA CTAGGGGTAA 4380 

ATTGAGAGAA AAGCTCCTTT TCTCTTCACT GTTTTGGAAA GGATAGCCAT TAGCATGACT 4440 

GCTTTGTGTC CTTATGGACT TTAGTATTAG CCTAGATTGA ATTATAGCGT TTTTCTAGCT 4500 

GAAGGAACCT TAAGATCACA TCATCTACTC CTCTACTCCA AATTTCTCAT TCTTCAGGCC 456,0 

AGGAAACCGA GACACAGAGG TAAAGTAATT TCCCCAAGGT CACACAGCTG GCTGGGGCAG 4620 

GATTGGGTTT ACAACCCACA TCTCCTGGCT CTTATTCCAG GGCCTTTTCC CACTAAGTAG 4680 

TATTGCCTTC CATTAGGCTC CTGAGAGTTA TTTCTCAGGG TCATGTTGCA TCTTGGAGCC 4740 

ACATGCTGCT GCCCTGATCT CAGTGGGAAA TNCACCCAGC AACCTAATAC AGCCCCTTTT 4800 

CCCTGCATTC ACCTGGTTCC CATCCACATG GGTTGCAGAT GTCCTTGAAG AGAGTGAGGC 4860 

ATTGAGGGCC AATAGGAGCA ATGGGGTCCC TGGCCTTGTC CATCTGATTC AGGAGATCAC 4920 

TGCTCCATCG TGAGGAGCCC TCTGAATAGC CCCCCACTGA ATGCTTGCCT TGCCCAAATG 4980 

GAATGGAGGA AGATTGATTT TCTCCATCAG TTCACCTTGT GTCATCTCAT AATGGTTGGT 5040 

CTTTCCAGGC TGAGGGAAAT GTTTCTTGTT TCCANAGTAN AAAAAAGAAA GAGTGGAACA 5100 

ATANCTTTGT TCATCCTAAC TTTCTGAGAT GGCTTTTCAA CATTTAAAAA AAACTAGTGT 5160 

GGTACCATTC ACTGGCANGA TTTNTTTTAG AATATGGGAG TAAGATGAGG TAGAGAAAAT 5220 

AACCTGGTCT CACTGTGGTT GCCCTCATCC ACAATGTCCC CAAAGCCATC CTGCTNTGAT 5280 

GAGGACAATT TCCAGGTATA AGCAAGGGGC TTTGTGACAA AAATGTACCC TGGCTGATGT 5340 

TAAACATTGG CTCCTGTGTT TGCACCAAAA TAGCAAGCTG TGTGCTCTAT ACACTCTTCC 5400 

CATCGTCTTG TGTACACTGC TCCTGTGGCC TTCCACAGCA GAAACCAGGG CAAAAGGGTC 5460 

CAAACACATG GTTTTCCTTG CTGCAAGGCT NTTCCTGGGA ACTAAGGGGG TATTTATTAG 5520 

TTCAGTTNTA AGAGACCTCC TTCTGGGCTT ACCCCACTCC TCAGGTACTT CTCTCTCCTT 5580 

CCTCCTTCTC CTCCACAGTC ACAAGTAACC AAGGAACCTG AAAGTGGATG TGTAGCTATT 5640 

TGAAGAAGGC AAGGAACCCT GAGATTCTTC TTTGAATCCT TTAGTCCAAG TCTTAGACCA 5700 

GTGATTGGTG CTTACCTTGA ACAAAATTTT GTCTGTGTTC CTAATCCCTT CAATACTNTG 5760 

GGTACAATGC TCCCAATCAC CCTGCACATT TGATTCTAAA TGGCTTTTAT TTTTTAAAAA 5820 

TCCATATCCC TAGGACAAGA NAACAGGATG CCTATATCCC CAAAATGAGC TCCAGGACAC 5880 

TGATGGGAAT GATCCCAANG ATCACCCCAC CTCAGAAAAC GTCTGTGCCA ANAGACTTCC 5940 

CCAGATAGAA NCACTGGGAC AGTGGTTTGA ACGACTTCTT TTATGGTTGT CCAGTTTGCT 6000 

ATGGAAATAA AAGGCATTGA TTTTTTAAAA AAGATGATTG GAACCTGTCT TTGGCCACAT 6060 

AGGGCCACTT GGATCCATTT CCAGGCCTTA CTCATATATT GCCTTCACTG AAGGGCTTTG 6120 

GCTTTAAGTC CCAGACTGGT CTCCCAAGTG AACCATAAGT GTTTTGGAGC TCATCTGGGG 6180 

TGAGGCATGA GAATGTTGCC CCATCTATCC CTTCAGGAAA AGGTGCCTTC CCTCCCTTTC 6240 

TCCTAAAGCC TGGTCCCCAA AAATTGTTTT TGTCTCCAAA AGTCTAGTAT GGTCTTTATA 6300 

CACCCANACT CTTAGTGTTG CGTCCTGCCT TGTTTCCTTG TTAAGGATCT ATGCANACCT 6360 

CCCGCTTTGG CTTAGCTAGC GTGACATTGG CTATCATTTG ACAAGACTAA CTTTTTTTTT 6420 

TTTTTTTTTG ACTGAGTCTC CCTCTGTCAC CTAGGCTGGA GTGCAGTGGC ACAATCTTGG 6480 

CTCGCTGCAA CCTTCACCCT TCACCTCCCA GGTCGAAGCG ATTCTCCTGC CTCAGTCTCC 6540 

CGAGTAGCTG GGATTACAGG CGTGCGCCAC CAAATCTGGC TATTTTTTTA TTATTATTAT 6600 

TTTTAGTAGA GATGGGGTTT CACCATGTTG GCCAGACTGG TCTTGAACTC TTGGCCTCAA 6660 

ATTATCTGCC CACCTCGGCC TCCCAAAGTG CTGGGATTAC AGGCATGAGC ACCATGCCCA 6720 

GCTGACAAGA CTAATTTTTT ATCCCTTGGT TTATTGGCTT CAACATCTTC TGGAATCAGA 6780 

GGTGATTTTT TCTTACCTTG GATGCCTGAG ACTAGGGGAG TATAGAATTC CAATTGGTAA 6840 

TTAAGGCATC TTTCTGCTCC TGATCAGAAG GGCAGGTTAG TTGGGAGAGG TCAGATGGCA 6900 

CAACAGAAGT CACCTTGTAA GTAAGGCAAA GACTTTGAAG GCATTAGCGT TTCTCATTAC 6960 

TTAGGTCAAT AACCTTGAGG GAATCAATGG CTTTTTTGCC GCTCTACCTC TTTGTGTATC 7020 

TCTTTGACTT TTCTTTCTCT GTCTAGTTTC CTCTGTTCTC AGTTTATATT CTATGTTATC 7080 

AGTCTCTCTT TCCACAGTAC AAACATCCAT CCTTTCTCCT GTGCAATTCT GTCTCTCCCT 7140 

CTTATTATCT TTATTTGTAC TTTTTCCTTC CTCCCTGTCT AGGCATTGGG CATGTGCCTC 7200 

TTCTTAGCCT GTGATTTTGC CTTGGGACTG ATGATAAATT ATTTCCAGAT TCAATCAGCC 7260 

CTGGTCCTAC CCCAGTCCAA TCAGAAGTAT GTTGGTGGGG AATCAACCTG ATCCTGGCCC 7320 

TTTCTTCTTC TCCATTTTCA TTCGTAATCC CCCTCAGCAG ATCTTTACAA GCAGTTTCCT 7380 

TATAGCTCAT GTATCTTTAG GTCTTTGCCT TCCAAGCACT GTACAGAATA CTTTGTGGTT 7440 

CCTTTTTAGT CTGACATTTT GTGGAGCAGT GAAGCGTGCT CAGAGACATA ATCAGCTGAA 7500 

GAGAAAAAAT CCACCCATGG ATTTATATCA GCTAAATACT AATAATTGAT TTTGTTTGAT 7560 

GTGCCCATAA TTTTTAAAGC TGCAATATAA TATAATGAGG GACCACAGGT AATTTCTCCT 7620 

GTCATTTGTT TTGGCTGGAT GGGGGTGGGG GAGTAATTGC TTAAAGTTTT ACCATTACAC 7680 

ATTAAACTCT CTATAATAAT CTTGTTTGGG GCTTGCTAAC TGTTGAGCTG TTTTAACTAA 7740 

ACTGGTAGGC AATCGGAGTT GATTTAAATG AAAAGATAAT TTAACAAATC TATACTATAA 7800 

AAAGAGACAT TTGCTTAATT GACATGTATT TTTTCCTTCT GAGTCACCTA AACATTTACT 7860 

CTTGACACCA ACTGTTCATG ATACTGAATA GACAGTCCAT ATAAGAGAAA TTAGTGGACC 7920 

TAAAGAAGCC AGATTGTAGG TGTTAATTTA TTAAACAGAA TTGCAAAGCC CTTGGAAATG 7980 

TCACTGCTTG GCAATACCAT ATGGCATGCC AAAATTTACA ATGACTTTTC TTTATAAGTT 8040 

ATCCAAAAGG GATTTGAACA AGTAAGAGGT TATGCCAAAA TGTCTCCAAT GTATGGTCCT 810 0 

GTAATATATT GCAGCTTGAA GCCAATGATC CCTTATGACT TGTATACAAC TAATGCATGT 8160 

TTTATTGAAT TTTGCATTTC CCACGTGTGG TAAGTCTTTA AAATGTTTTT GATCACCTTT 8220 

NTGTGCCATT AAACTTGTAC AGAAAATGTT TTTATGGCCA TTTTCAAAGG GAGAAAGTTT 8280 

AAAATGGAAA CAGCCCACCC TTTCTGCCCT ATAGCTGTAG TTAGAATTGA GTACCTGTAG 8340 

CAAAACAGCT GTAATTGGTG GTTGTAGTGT TAGAGGTGTT AGCTTGCTAG TGACTAGCTT 8400 

TGGAGAGTAA ATGCATGGTA TTGTACATCA CATTTCTTAA CTCGTTTTAA CCTCTGAAAA 8460 

GAATATATTC TTCTTTGTAG TCCTTCTTCC CACCCCCTTG CCCTCTCCCT CTCCCTGCTC 8520 

CCAGTTGTCT TACAGTTGTA AATATCTGAT TTGAGGCCCA ATAACTCTTG CCAAGTAAAG 8580 

TCAGCAAACA ACAAACAAAC CAAAATGTGG GGAAAAGGCA TTTCTCAACC ATCTCTCAGC 8640 

AGTTATTGAT CATTTCTTAA GGAACAGCAT TGTGATCAAA GACTCAACTT TACGTAAAAA 8700 

TCAGTGGTAA ATTGGGGTTG TATTGGCCAT TGATTACATT CAGGATTGAA TAGTTTTCAG 8760 

AATCACATGT AATCCAAAGA CAGTAGGTAG TGATGTCCCT TATCCCTGCA GCTGTTTTAA 8820 

GATAGAGACC TCAGAAGACT CTGCTTGACC GATGACCAAT AATTATTTGA AAAAAAAAGA 8880 

AAAAATGAGA GAAATAAAAC AGATATTTAA GAACTTTAGC CACCTATTTA GAATAGTTAT 8940 

AGCCAGAAAA AAAAACAAGG GCATGAGTTC AAATGCATTA CTATCAGTGT CCTAGGCAAT 9000 

ACCTAACCTA CTCTGAAATT GTGATTCAAA AGCAGTATTT CAAGAGGCAT TCTCCTTTTT 9060 

TGGTTTGCTG ACCCCACTTG GACTGGTAGG TTTGGTGAGG CCCCCATAAA CCAGCTGGAG 9120 
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CAGACCCTTT TCATCTCCTG TGCCTGTAAC ACCCCTCTTC CCCCACCCCC TCCGCAATTC 
AATGAGGGCT TTCTTGGGTC AGAGGACTTC AAGGTTGTCT AGAGAAGTTT GCCATGTGTG 
TAAGGTGCTG TGAACTGTGA GTGCTGAAGA TTCGCAGCAT TCAATACCAG GCAGCCAAAG 
AGCTGCTCTT GCAATTATTT TGGCTCTCAA GCTCTGTTCT TCATCGCATT CTCATTTCTG 
TGTACATTTG CAAGATGTGT GTAATGTCAT TTTCCAAAAA TAAAATTTGA TTTCAAT 



Seq ID NO: 214 Protein sequence: 
Protein Accession ft: NP 000546 



MELDFGHFDE 
GDRYFKGIVY 
EGESYVCSSD 
LVTIIRSGVK 
DFFGDDDVFI 
MRRSKSPADS 



11 

I 

RDKTSRNMRG 
AVSSDRFRSF 
NFFKKVEYTK 
PRKAVRVLLN 
ACX5PEKFRYA 
ANGTSSSQLS 



21 
I 

SRMNGLPSPT 
DALLADLTRS 
NVNPNWSVNV 
KKTAHSFEQV 
QDDFSLDENE 
TPKSKQSPIS 



31 

I 

HSAHCSFYRT 
LSPNINLPQG 
KTSANMKAPQ 
LTD I TEA I KL 
CRVMKGNPSA 
TPTSPGSLRK 



41 

I 

RTLQALSNEK 
VRYIYTIDGS 
SLASSNSAQA 
ETGWKKLYT 
TAGPKASPTP 
HKDLYLPLSL 



51 
I 

kakkVrfyrn 
rkigsmdele 
renkdfvrpk 

LDGKQVTCLH 
QKTSAKSPGP 
DDSDSLGDSM 



Seq ID NO: 215 DNA sequence 
Nucleic Acid Accession ft: NM_130467 
Coding sequence: 312.. 644 



GGCACGAGGC 
CTTTCCAACA 
GTCTTCCTGG 
TCCTGTGGCA 
CCCAGGTCGT 
AAGTGAGAGA 
AAGAGTCTTC 
AAGAAGAGGA 
AAGGAGCACC 
TTAAGATAGA 
TTGATCCCAC 
ATGAAGACTG 
, TAATAAAGTT 



11 
I 

AGAGCTCTGC 
TCTTCGTTCT 
TAATTTAGTT 
CAGTCCGTGG 
GATGCAGGCG 
TATGAGTGAG 
CCAGCCAGTT 
ACCACCAACT 
TGCTGTTCAA 
GGATGCACCT 
TAAAGTGCTG 
AAACCAAGAA 
TTACAGTTTT 



21 
I 

AAGGAGAGGT 
TTCTCACTGA 
GTGAGTGAAT 
CTTTGAGGGA 
CCATGGGCCG 
CATGTAACAA 
GGACCTGTGA 
GATAATCAGG 
GGGACTGATG 
GGAGATGGTC 
GAAGCAGGTG 
TATTGTTCTT 
CTGCAAAAAA 



31 

I 

TGTGTCTTCG 
CCGAGACTCA 
GTGTGGAGGA 
AAAGGGCCTC 
GTAATCGTGG 
GATCCCAATC 
TTGTCCAGCA 
GTATTGCACC 
TGGAAGCTTT 
CTGATGTCAG 
AAGGGCAACT 
ATGCTGGAAA 
AAAAAAAAAA 



41 
I 

TTCTTTCCGC 
GCCGGTAGGT 
GCCAGCGGGC 
GCGGTGGTCC 
CTGGGCTGGA 
CTCAGAAAGA 
GCCCACTGAG 
TAGTGGGGAG 
TCAACAGGAA 
GGAGGGGACT 
ATAGGTTTAA 
TTTGACTGCT 
AAA 



51 
I 

CATCTTCGTT 
CTGCAGAGTG 
TTAGGACAGG 
TCCGCCTTCC 
ACGAGGGAGG 
GGAAATGACC 
GAAAAACGTC 
ATCAAAAATG 
CTGGCTCTGC 
CTGCCCACTT 
ACCAAGACAA 
AACATTCTCT 



9180 
9240 
9300 
9360 
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60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 216 Protein sequence: 
45 Protein Accession ft: NP 569734 



50 
55 
60 
65 
70 
75 
80 
85 



1 11 21 31 41 * 51 

I I I I I I 

MSEHVTRSQS SERGNDQESS QPVGPVIVQQ PTEEKRQEEE PPTDNQGIAP SGEIKNEGAP 60 
AVQGTDVEAF QQELALLKIE DAPGDGPDVR EGTLPTFDPT KVLEAGEGQL 

Seq ID NO: 217 DNA sequence 

Nucleic Acid Accession ft: NM_001476.1 

Coding sequence: 82.. 435 

1 11 21 31 41 51 

I I 1 I I 1 

GCCAGGGAGC TGTGAGGCAG TGCTGTGTGG TTCCTGCCGT CCGGACTCTT TTTCCTCTAC 60 
TGAGATTCAT CTGTGTGAAA TATGAGTTGG CGAGGAAGAT CGACCTATTA TTGGCCTAGA 120 
CCAAGGCGCT ATGTACAGCC TCCTGAAGTG ATTGGGCCTA TGCGGCCCGA GCAGTTCAGT 180 
GATGAAGTGG AACCAGCAAC ACCTGAAGAA GGGGAACCAG CAACTCAACG TCAGGATCCT 240 
GCAGCTGCTC AGGAGGGAGA GGATGAGGGA GCATCTGCAG GTCAAGGGCC GAAGCCTGAA 300 
GCTGATAGCC AGGAACAGGG TCACCCACAG ACTGGGTGTG AGTGTGAAGA TGGTCCTGAT 360 
GGGCAGGAGG TGGACCCGCC AAATCCAGAG GAGGTGAAAA CGCCTGAAGA AGGTGAAAAG 420 
CAATCACAGT GTTAAAAGAA GACACGTTGA AATGATGCAG GCTGCTCCTA TGTTGGAAAT 480 
TTGTTCATTA AAATTCTCCC AATAAAGCTT TACAGCCTTC TGCAAAA 

Seq ID NO: 218 Protein sequence: 
Protein Accession #: NP_001467.1 

1 11 21 31 41 51 

I I I I I I 

MSWRGRSTYY WPRPRRYVQP PEVIGPMRPE QFSDBVEPAT PEEGEPATQR QDPAAAQEGE 60 
DEGASAGQGP KPEADSQEQG HPQTGCECED GPDGQEVDPP NPEEVKTPEE GEKQSQC 

Seq ID NO: 219 DNA sequence 
Nucleic Acid Accession ft: NM_001476 
Coding sequence: 90-3671 

1 11 21 31 41 51 

I I 1 I I 1 

ACAGCGGAGC GCAGAGTGAG AACCACCAAC CGAGGCGCCG GGCAGCGACC CCTGCAGCGG 60 
AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 120 
GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 180 
ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT . 240 
TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 300 
GCTTTTACCG GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 360 
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CTCTTAGTGC TCGATGTGAC AACTCTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 420 

CCAGATGCGA CCGATGTCTG CCAGGCTTCC ACATGCTCAC GGATGCGGGG TGCACCCAAG 480 

ACCAGAGACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATCGCA GGGCCCTGTG 540 

ACGCGGGCOG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA ACGCTGTGAT AGGTGTCGAT 600 

5 CAGGTTACTA TAATCTGGAT GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 660 

GGCATTCAGC CAGCTGCOGC AGCTCTGCAG AATACAGTGT CCATAAGATC ACCTCTACCT 720 

TTCATCAAGA TGTTGATGGC TGGAAGGCTG TCCAACGAAA TGGGTCTCCT GCAAAGCTCC 780 

AATGGTCACA GCGCCATCAA GATGTGTTTA GCTCAGCCCA ACGACTAGAC CCTGTCTATT 840 

TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AGCCTGTCCT 900 

10 TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG 960 

GTGCTGGTCT ACGGATCACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 1020 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCCAAG CAATAATTGG AGCCCCCAGC 1080 

TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCOGC ATCCGAGCTA 1140 

CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 1200 

15 TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACAAGGGGC 1260 

AATTCTGCCA GGATTGTGCT TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 1320 

GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC ACAGGAGATT 1380 

GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT GGTTTCTACA 1440 

ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 

20 CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 

CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 

TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 

GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 

ACCAGTGCAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 

25 GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGAAGT GATGGCACCT 1860 

GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 1920 

CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 

AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 

GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 

30 AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 

ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 2220 

AGTACCAGAA CCGAGTTCGG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 2280 

CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 

CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 

35 CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 

CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGGACGGTG 2520 

CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 

CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2640 

TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 

40 CAAAGAGGAT CAAACAAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 

AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2820 

AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA 2880 

AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2940 

TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 

45 AAGCCATGAA GAGACTCTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 

AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 3120 

CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 

AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 

GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 

50 TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG 3360. 

CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 

AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 3480 

AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3540 

AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3600 

55 AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 3660 

AGCAACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT TCTTGGGATA CAGATCTCAG 3720 

GGCTCGGGAG CCATGTCATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 

TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3900 

60 ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 3960 

ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 4020 

ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 4080 

ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 

ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 

65 ACCCAGGGTG TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 4260 

AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 

GTTCTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 4380 

ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 

GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 

70 ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT GATGCGCATC AATGTATTTT 4560 

ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 

CACACTTCAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 

TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 

GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG ATTGTCTAGT GAGGAAGACA 4800 

75 AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTGTT TGTCACAAGT GGTGTTTATT 4860 

GCAATAACCG CTTGGTTTGC AACCTCTTTG CTCAACAGAA CATATGTTGC AAGACCCTCC 4920 

CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 

CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA 5040 

c TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT GGGAAGACTA 5100 

80 TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 
CAATTGTTAG ATGCC 
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Seq ID NO: 220 Protein sequence: 
Protein Accession #;NP_005553 
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10 
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WO 02/086443 

MPALWLGCCL CFSLLLPAAR ATSRREVCDC NGKSRQCIFD 
DGIHCEKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG 
HMLTDAGCTQ DQRLLDSKCD CDPAGIAGPC DAGRCVCKPA 
PEGCTQCFCY GHSASCRSSA EYSVHKITST FHQDVDGWKA 
SSAQRLDPVY FVAPAKFLGN QQVSYGQSlrS FDYRVDRGGR 
MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ LSYFEYRRLL 
DNVTLISARP VSGAPAPWVE QCICPVGYKG QFCQDCASGY 
GGACDPDTGD CYSGDENPD1 ECADCPIGFY NDPHDPRSCK 
CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQCN 
CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS 
NCEKGAFSCP ACYNQVKIQM DQFMQQLQRM EALISKAQGG 
QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM 
RLITOMQLSL AESEASLGNT NIPASDHYVG PNGFKSLAQE 
ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL 
ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS 
NWKEEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM 
QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA 
IEQEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEL 
AQKVDTRAKN AGVTIQDTLN TLJDGLiLHLMD QPLSVDEEGL 
MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENIRDNL 

Seq ID NO: 221 DNA sequence 
Nucleic Acid Accession #: NMJ316529 
Coding sequence: 13-1854 



RELHRQTGNG 
RCSCKPGVTG 
VTGERCDRCR 
VQRNGSPAKL 
HPSAHDVILE 
RNLTALRIRA 
KRDSARLGPF 
PCPCHNGPSC 
NNVDPSASGN 
EPVGCRSDGT 
DGWPDTELE 
TVERVRALGS 
ATRLAESHVE 
EKTKSLAQQL 
LSTLVTRHMD 
GNATFYEVES 
AADAQRAKNG 
ERKELEFDTN 
VLLEQKLSRA 
PPGCYNTQAL 



FRCLNCNDNT 
ARCDRCLPGF 
SGYYNLDGGN 
QWSQRHQDVF 
GAGLRITAPL 
TYGEYSTGYI 
GTCIPCNCQG 
SVMPETEEW 
CDRLTGRCLK 
CVCKPGFGGP 
GRMQQAEQAL 
QYQNRVRDTH 
SASNMEQLTR 
TREATQAEIE 
EFKRTQKNLG 
ILKNLREFDL 
AGEALBISSE 
MDAVQMVITE 
KTQINSQLRP 
EQQ 



GTCAAGAAAA 
AAAGGGGCTG 
ACATTATGCC 
GCTGATCTCT 
ATATTGAAGG 
CTGCTACTTG 
GCAACACTGT 
GCGATTAATA 
AAGGAGGACT 
AATTTGCTGG 
GCGCTCTCCT 
ATATGCTGCA 
GTGAAGGCCA 
GCCCACGTGG 
TACGCCATCG 
TACAACCGGG 
ATTGAGCTTT 
TGCATCGGCC 
GAGAGGTCTT 
AATGGCGAAG 
TCCCTCATCC 
GGTCATGCTA 
GTTTGTCTGA 
TGGGGAAGCA 
ATTCCCATTG 
TGGTTGGGAT 
GCCAAGCACA 
CGAGTCCTGG 
CGCCTGATCA 
CAGCAGGGCG 
GAAGAAGTCA 
AATTTTCCTG 
TTTGTCAGAG 
AGTTAAGCAG 
AGCTATCTTT 
ATGAAGCATT 
AGAAAAAAAA 



11 
I 

GAATGTCTGT 
ATAATGTGAT 
ATCTGGAATA 
CTGAGAATGA 
ACAGAGCTCA 
GAGCCACAGC 
TGAAGGCAGA 
TAGGGTATTC 
CTTTGGATGC 
GCAAGGAAAA 
TCGAAGTCCG 
GAGTGTCTCC 
TCACCCTCGC 
GTGTGGGAAT 
CACAGTTTTC 
TGACCAAGTG 
GGTTCGCCTT 
TGTACAATGT 
GCACTCAGGA 
GCTTCAACAC 
TCTTCTGGTT 
CCGACTATTT 
AAGCTGGTTT 
TGCTGACCTG 
CTCCAGATAT 
TATTTCTGGT 
CCTGCAAAAA 
GAAAAGCGGT 
AGAGGCTGGG 
TCCCGCATGG 
TCCGTGCTTA 
ACTGATCTTA 
AAGACTGGCG 
TTTGTTAGTT 
GCCCTCCCAA 
CAACTGTGCT 
AAAAAAA 



21 
I 

AATTGTTCGA 
TTTTGAGAGA 
CTTTGCCACG 
GTATGAGGAG 
ACGGTTGGAA 
CATAGAAGAT 
AATTAAAATA 
CTGCCGATTG 
CACAAGGGCA 
TGACGTGGCC 
GAGGAGTTTC 
TCTGCAGAAG 
CATCGGAGAC 
CAGTGGGAAT 
CTACTTAGAG 
CATCTTGTAC 
TGTTAATGGA 
GATTTTCACC 
GAGCATGCTC 
AAAGGTTTTC 
TCCCATGAAA 
ATTTGTTGGA 
GGAGACCACA 
GCTGGTGTTT 
GAGAGGACAG 
TCCTACTGCC 
GACATTGCTG 
GCTGCGGGAT 
CCGGAAGAGG 
GTATGCTTTT 
TGACACCACC 
GGAAAGAGAT 
TCCAAGGCCA 
ACATATTCCC 
CTCGTCTGCA 
CTGTGAGGTC 



31 
I 

ACTCCTTCAG 
CTTTCAAAAG 
GAAGGCTTGC 
TGGCTGAAAG 
GAGTGTTACG 
CGCCTTCAAG 
TGGGTGTTGA 
GTATCGCAGA 
GCCATTACTC 
CTCATCATCG 
CTGGATTTGG 
TCTGAGATAG 
GGCGCCAACG 
GAAGGCATGC 
AAGCTTCTGT 
TGCTTCTATA 
TTTTCTGGGC 
GCTTTGCCGC 
AGGTTTCCCC 
TGGGGTCACT 
GCTCTGGAGC 
AATATTGTTT 
GCTTGGACTA 
TTTGGCATCT 
GCAACTATGG 
TGTTTGATTG 
GAGGAGGTGC 
AGCAATGGAA 
CCCCCGACGC 
TCTCAAGAAG 
AAAAAGAAAT 
TCAGTTTGTT 
AAACACCAGG 
TCGCAAACCT 
GTGCTTAGCC 
TCAAATTAAA 



41 
I 

GACGACTTCG 
ACTCAAAATA 
GGACTCTCTG 
TCTATCAGGA 
AGATCATTGA 
CAGGAGTTCC 
CAGGAGACAA 
ATATGGCCCT 
AGCACTGCAC 
ATGGCCACAC 
CACTCTCGTG 
TGGATGTGGT 
ATGTCGGGAT 
AGGCCACCAA 
TGGTTCATGG 
AGAACGTGGT 
AGATTTTATT 
CCTTCACTCT 
AGCTCTACAA 
GCATCAACGC 
ATGATACTGT 
ACACATATGT 
AATTCAGTCA 
ACTCGACCAT 
TCCTGAGCTC 
AAGATGTGGC 
AGGAGCTGGA 
AGAGGCTGAA 
TGTTCCGGGG 
AACACGGAGC 
CCAGGAAGAA 
GCACCCAGTG 
AAACACATTT 
GGAGTGCAGA 
TAACTTTTGT 
AACATTATGT 



51 
I 

GCTTTACTGT 
TATGGAGGAA 
TGTGGCTTAT 
AGCCAGCACC 
GAAGAATTTG 
AGAAACCATC 
ACAAGAAACT 
TATCCTATTG 
TGACCTTGGG 
CCTGAAGTAC 
CAAAGCGGTC 
GAAGAAGCGG 
GATCCAGACA 
CAACTCGGAT 
AGCCTGGAGC 
CCTGTATATT 
TGAACGTTGG 
GGGAATCTTT 
AATCACCCAG 
CTTGGTCCAC 
GTTTGACAGT 
TGTTGTTACT 
TCTGGCTGTC 
CTGGCCCACC 
CGCACACTTC 
ATGGAGAGCA 
AACCAAGTCT 
CGAGCGCGAC 
CAGCTCCCTG 
TGTTAGTCAG 
ATAAGACATG 
TTAACACATC 
CTGTGGCCTT 
CCACAGGGGA 
TTATGTCGTT 
TTCACCAATA 



€0 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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720 
780 
840 
900 
960 
1020 
1080 
1140 
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Seq ID NO: 222 Protein sequence: 
Protein Accession #: NP 057613 



70 , 



75 
80 
85 



MSVIVRTPSG 
ENEYEEWLiKV 
KAEIKIWVLT 
KENDVALIID 
TLAIGDGAND 
TKCILYCFYK 
TQESMLRFPQ 
DYLFVGNIVY 
PDMRGQATMV 
KAVLRDSNGK 
RAYDTTKKKS 



11 

I 

RLRLYCKGAD 
YQEASTILKD 
GDKQETAINI 
GHTLKYALSF 
VGMIQTAHVG 
NWLYIIELW 
LYKITQNGEG 
TYWVTVCLK 
LSSAHFWLGL 
RLNERDRLIK 
RKK 



21 
I 

NVIFERLSKD 
RAQRLEECYE 
GYSCRLVSQN 
EVRRSFLDLA 
VGISGNBGMQ 
FAFVNGFSGQ 
FNTKVFWGHC 
AGLETTAWTK 
FLVPTACLIE 
RLGRKTPPTL 



31 
I 

SKYMEETLCH 
IIEKNLLLLG 
MALILLKEDS 
LSCKAVICCR 
ATNNSDYAIA 
ILFERWCIGL 
INALVHSLIL 
FSHLAVWGSM 
DVAWRAAKHT 
FRGSSLQQGV 



41 
I 

LEYFATEGLR 
ATAIEDRLQA 
LDATRAAI TQ 
VSPLQKSEIV 
QFSYLEKLLL 
YNVIFTALPP 
FWFPMKALEH 
LTWIiVFFGIY 
CKKTLLEEVQ 
PHGYAFSQEE 



51 

I 

TLCVAYADIiS 
GVPETIATLL 
HCTDLGNLLG 
DWKKRVKAI 
VHGAWSYNRV 
FTLGIFERSC 
DTVFDSGHAT 
STIWPTIPIA 
ELETKSRVLG 
HGAVSQEEVI 



Seq ID NO: 223 DNA sequence 
Nucleic Acid Accession #: BC017001 
Coding sequence: 1-394 



60 
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275 



WO 02/086443 
| | | I I I 

AACGCTGGGC AGGGCCGGCG CGGGTOGGGG GGCGCCCGAG GGGCCCGGGC CGAGCGGCGG 60 

CGCGCAGGGC GGCAGCATCC ACTCGGGCCG CATCGCCGCG GTGCACAACG TGCCGCTGAG 120 

CGTGCTCATC CGGCCGCTGC CGTCCGTGTT GGACCCCGCC AAGGTGCAGA GCCTCGTGGA 180 

CACGATCCGG GAGGACCCAG ACAGCGTGCC CCCCATCGAT GTCCTCTGGA TCAAAGGGGC 240 

CCAGGGAGGT GACTACTTCT ACTCCTTTGG GGGCTGCCAC CGCTACGCGG CCTACCAGCA 300 

ACTGCAGCGA GAGACCATCC CCGCCAAGCT TGTCCAGTCC ACTCTCTCAG ACCTAAGGGT 360 

GTACCTGGGA GCATCCACAC CAGACTTGCA GTAGCAGCCT CCTTGGCACC TGCTGCCACC 420 

TTCAAGAGCC CAGAAGACAC ACCTGGCCTC CAGCAGGCTG GGCCATGCAG AAGGGATAGC 480 

AGGGGTGCAT TCTCTTTGCA CCTGGCGAGA GGGTCTGACT CTGGGCACCC CTCTCACCGG 540 

CTACAAGGCC TTGGACTCAC TGTACAGTGT GGGAGCCCCA GTTCCCACCT CTGTGACAAT 600 

AGGATCATGG CCTTACCCTT GAAGCATTAC CGAGAAGGAG AACAGAGATG GGCTTGAAGA 660 

GCCACGTGCT GCCGGCTCCA AATTCCCAAG GACAAGGATC CCTCTGCATT TTTGTCTATG 720 

TAACCTCTTA TATGGACTAC ATTCAGCTGC AAGGAAAGGA AAACCTTGAT TGCAGTGGTT 780 

TAAACAAACA GAAGATTGTT TTTCCACATA GCATGGATTC TGGAGATGGG TGGCTAATGG 840 

TATTGGTTCA ACAACTCCAC GGAGGTAGGG GTCACGTCTT GGATCCTTTT GCCTTAATCT 900 

CAGTGCTCGT TACTTCATGG TCCCAAGATG GCTGCTGTAT CCCCAAGAAT CATGTCTGCG 960 

TTCAAGGAAG GAGGGGTGGA GGAAGAGGAA GGGCCAAACT AGCTGGACCC GTCACCTTCT 1020 

ATCAGAAAGT AAAACCTCGT CAGAAGTCTG TTTCCTGCTC TCTCCCTCTG CATATCTTCA 1080 

CTTAGATGCC CTTGGCCCGA GCCAGCTACC ATTGCACCTC TAGCTGCAAA CAAAGCTAAG 1140 

ACAGCAGGGA ACAGAATTGT CATGGCTGAA TAGACCAATC GTGTTCCATC TACTGAGACT 1200 

GGCACACTGC CTCCTGCAAT AAAACTGGGA TCCCATTACC AAGAGAGAAA TGCAGAATTG 1260 

TGTACCAGTT AGCTTTTGCT GTGTAACAAA CCATCCCCAA ACTTGGCAGC TAGAAACAAA 1320 

CCCTGTATTT TCCCACAATC CTATGGGTTG GCAATTTGGG CTGGGCTCAA CAGGGCAGTT 1380 

CTGCTGCTCA CACCTGGGAT CCCTCATGGA GCTAAGGTCA GCTGTTACCT CAGCTGGGCC 1440 

TGGATGGTCT AGGATAGCCT TACTCACTTG CCTGGCAGGT GACAGGCTGT TGGCTGGAAT 1500 

TGCTTGGTTC TCCTCCATGT GGCCTCTCCA GCAGGCTAGC TCAGGCTTAT TCACATGATG 1560 

GCTTCAGGAT TCCAAAGAGA GTGAGAGTAG AAGCTGAAAG ACTTCTTGAG TTCTTGGCCT 1620 

GGAACTGGGA CTAGGACAGT GTCACTTCTG CTAAGTTCTT TTGGTCAGAG CAAATCACAA 1680 

GGCTTTACCC AGATTCAAGG GATGAGAAAC AGACTACATG TCTTGATGAG GGGAACCACA 1740 

AAGAGCTTGT GGCCATTTTT CACCTATCAC AAATAATTTT GGATGGGTAT TTATTTGGAT 1800 

AAAGGTATTT CCCTCTTCCC CCTTTCTCTC TGTCTCATGG GGCCTCACTC TGCCAAGTTG 1860 

GAAGGCACTA AGACATTGTC CTGGCCCTCA GGGTCTAGGG GAAGAGGTGT TGGGGCAGGA 1920 

AGTGAGTCTC TCCATGGGCT GGACCCACTG TAGTAGGAGT GCCTCCTTGT CTGCACTGCT 1980 

GGTATGGGGT TAGGCCAGGT AGGACATTCC AGAGGGGCTT CTGAAAACCA AGAGTCCCTG 2040 

GGGAAAGGGA ACAGAGTAAG GCAGGCCTTG TTCTCACTGC CCTCTAAGGG AACTTG GTCA 2100 

CTCGGCACTT TTAAGCCTCA GTTTCTCCAG TTCAATAATA AGGACAAGAG CTTTTCCCAT 2160 

GCATTCTCTT TCCCCGGGAA AGTTGACTGA GGTGACCAGT AATAGAATTG AAAAGGGAGA 2220 

GTGTCTTCAG TGCAATGTGG CATCCTGGAT TGGGTCTTGG AACAAAAACA GGACATTAGT 2280 

GGGAAAATTG GAAATCTGAA AAAAGTCTGA ATTTTAGTTA ATATACCAAT TTCAGTCTCT. 2340 

TGGTTTTGAC AGATGTACCA TGGTGATGTA AGATGTTGAC CTTGGGGTAG GCTGGGTGAA 2400 

GGGTATACAG GAACTCTTTG TACTATCTCT GCAACTTCTC TGTAAATCTA GTATCATTCC 2460 
AAAATAAAAG TTTATTTAAT TTAAAAAAAA AAAAAAAAAA AA 



Seq ID NO: 224 Protein sequence x 
Protein AcceBBion tf: AAH17001.1 

1 11 21 31 41 51 

| 1 I I I I 

TLGRAGAGRG APEGPGPSGG AQGGS1HSGR IAAVHNVPLS VLIRPLPSVL DPAKVQSLVD 60 
TIREDPDSVP PIDVLWIKGA OGGDYFYSFG GCHRYAAYQQ LQRETIPAKL VQSTLSDLRV 120 
YLGASTPDLQ 

Seq ID NO: 225 DNA sequence 
Nucleic Acid Accession #: NM_021048 
Coding sequence: 1^.1110 

1 11 21 31 41 51 

I I I I I I 

ATGCCTCGAG CTCCAAAGCG TCAGCGCTGC ATGCCTGAAG AAGATCTTCA ATCCCAAAGT 
GAGACACAGG GCCTCGAGGG TGCACAGGCT CCCCTGGCTG TGGAGGAGGA TGCTTCATCA 
TCCACTTCCA CCAGCTCCTC TTTTCCATCC TCTTTTCCCT CCTCCTCCTC TTCCTCCTCC 
TCCTCCTGCT ATCCTCTAAT ACCAAGCACC CCAGAGGAGG TTTCTGCTGA TGATGAGACA 
CCAAATCCTC CCCAGAGTGC TCAGATAGCC TGCTCCTCCC CCTCGGTCGT TGCTTCCCTT 
CCATTAGATC AATCTGATGA GGGCTCCAGC AGCCAAAAGG AGGAGAGTCC AAGCACCCTA 
CAGGTCCTGC CAGACAGTGA GTCTTTACCC AGAAGTGAGA TAGATGAAAA GGTGACTGAT 
TTGGTGCAGT TTCTGCTCTT CAAGTATCAA ATGAAGGAGC CGATCACAAA GGCAGAAATA 
CTGGAGAGTG TCATAAAAAA TTATGAAGAC CACTTCCCTT TGTTGTTTAG TGAAGCCTCC 
GAGTGCATGC TGCTGGTCTT TGGCATTGAT GTAAAGGAAG TGGATCCCAC TGGCCACTCC 
TTTGTCCTTG TCACCTCCCT GGGCCTCACC TATGATGGGA TGCTGAGTGA TGTCCAGAGC 
ATGCCCAAGA CTGGCATTCT CATACTTATC CTAAGCATAA TCTTCATAGA GGGCTACTGC 
ACCCCTGAGG AGGTCATCTG GGAAGCACTG AATATGATGG GGCTGTATGA TGGGATGGAG 
CACCTCATTT ATGGGGAGCC CAGGAAGCTG CTCACCCAAG ATTGGGTGCA GGAAAACTAC 
CTGGAGTACC GGCAGGTGCC TGGCAGTGAT CCTGCACGGT ATGAGTTTCT GTGGGGTCCA 
AGGGCTCATG CTGAAATTAG GAAGATGAGT CTCCTGAAAT TTTTGGCCAA GGTAAATGGG 
AGTGATCCAA GATCCTTCCC ACTGTGGTAT GAGGAGGCTT TGAAAGATGA GGAAGAGAGA 
GCCCAGGACA GAATTGCCAC CACAGATGAT ACTACTGCCA TGGCCAGTGC AAGTTCTAGC 
GCTACAGGTA GCTTCTCCTA CCCTGAATAA 

Seq ID NO: 226 Protein sequence: 
Protein Accession #: NP_066386 

1 11 21 31 41 51 

MPRAPKRQRC MPEEDLQSOS ETQGLEGAQA PIiAVEEDASS STSTSSSFPS SFPSSSSSSS 



276 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

SSCYPLIPST PEEVSADDET PNPPQSAQIA CSSPSWASL 
QVLPDSESLP RSEIDEKVTD LVQFLLFKYQ MKEPITKAEI 
ECMLLVFGID VKEVDPTGHS FVLVTSLGLT YDGMLSDVQS 
TPBEVIWEAL NKMGLYDGME HLIYGEPRKL LTQDWVQENY 
RAHAEIRKMS LLKFLAKVNG SDPRSFPLWY EEALKDEEER 
ATGSFSYPE 

Seq ID NO: 227 DMA sequence 

Nucleic Acid Accession #: NMJ)0502S.l 

Coding sequence: 82-1314 
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PLDQSDEGSS 
LESVIKNYED 
MPKTGILILI 
LEYRQVPGSD 
AQDRIATTDD 



SQKEESPSTL 
HFPLLFSEAS 
LSIIFIEGYC 
PARYEFLWGP 
TTAMASASSS 



GCGGAGCACA 
GAGGCTTGAA 
AGTATGGCTA 
TATAATCGTC 
GCTCTTGCAA 
CACTCAATGG 
TCAAACATGG 
GTGCAAAATG 
GCAGCAGTAA 
TGGGTGGAGA 
GCTGCCACTT 
TTTAGGCCTG 
ATTCCAATGA 
GAAGCTGGTG 
ATGCTGGTGC 
CAGCTGGTTG 
AGGTTCACAG 
GAAATTTTCA 
TCCAAAGCAA 
GTCTCAGGAA 
CATCCATTTT 
GTCATGCATC 
TTATTTGAAT 
TAGGATTTGT 
AATATATGTA 
TGTTATGTCA 



11 
I 

GTCCGCCGAG 
ACTGTTACAA 
CAGGGGCCAC 
TTAGAGCCAC 
TGGGAATGAT 
GATATGACAG 
TAACTGCTAA 
GATTTCATGT 
ATCATGTGGA 
ATAACACAAA 
ATCTGGCCCT 
AAAATACTAG 
TGTATCAGCA 
GTATCTACCA 
TGTCCAGACA 
AAGAATGGGC 
TGGAACAGGA 
TCAAAGATGC 
TTCACAAGTC 
TGATTGCAAT 
TCTTTCTTAT 
CTGAAACAAT 
AACAAGGAAA 
GTTTTACAGT 
AATTATAAGT 
TTGTGTTTGT 



21 
I 

CACAAGCTCC 
TATGGCTTTC 
TTTCCCTGAG 
TGGTGAAGAT 
GGAACTTGGG 
CCTAAAAAAT 
AGAGAGCCAA 
CAATGAGGAG 
CTTCAGTCAA 
CAATCTGGTG 
CATTAATGCT 
AACCTTTTCT 
AGGAGAATTT 
AGTCCTAGAA 
GGAAGTTCCT 
AAACTCTGTG 
AATTGATTTA 
AAATTTGACA 
CTTCCTAGAG 
TAGTAGGATG 
CAGAAACAGG 
GAACACAAGT 
ACAGTAACTA 
ATATCTTAAG 
AACTTGTCAA 
GTGCTGTTGT 



31 
I 

AGCATCCCGT 
CTTGGACTCT 
GAAGCCATTG 
GAAAATATTC 
GCCCAAGGAT 
GGTGAAGAAT 
TATGTGATGA 
TTTTTGCAAA 
AATGTAGCCG 
AAAGATTTGG 
GTCTATTTCA 
TTCACTAAAG 
TATTATGGGG 
ATACCATATG 
CTTGCTACTC 
AAGAAGCAAA 
AAAGATGTTT 
GGCCTCTCTG 
GTTAATGAAG 
GCTGTGCTGT 
AGAACTGGTA 
GGACATGATT 
AGCACATTAT 
ATAATATTTA 
GGAATGTTAT 
TTAAAATAAA 



41 
I 

CAGGGGTTGC 
TCTCTTTGCT 
CTGACTTGTC 

CTACCCAGAA 
TTTCTTTCTT 
AAATTGCCAA 
TGATGAAAAA 
TGGCCAACTA 
TATCCCCAAG 
AGGGGAACTG 
ATGATGAAAG 
AATTTAGTGA 
AAGGAGATGA 
TGGAGCCATT 
AAGTAGAAGT 
TGAAGGCTCT 
ATAATAAGGA 
AAGGCTCAGA 
ATCCTCAAGT 
CAATTCTATT 
TCGAAGAACT 
GTTTGCAACT 
AAATAGTTCC 
CAGTATTAAG 
AGTACCTATT 



51 
I 

AGGTGTGTGG 
GGTTCTGCAA 
AGTGAATATG 
ATTGAGTATT 
AGAAATCCGC 
GAAGGAGTTT 
TTCCTTGTTT 
ATATTTTAAT 
CATCAATAAG 
GGATTTTGAT 
GAAGTCGCAG 
TGAAGTCCAA 
TGGCTCCAAT 
AATAAGCATG 
AGTCAAAGCA 
ATACCTGCCC 
TGGAATAACT 
GATTTTTCTT 
AGCTGCTGCT 
TATTGTCGAC 
CATGGGACGA 
TTAAGTTACT 
GGTATATATT 
AGATAAAAAC 
CTAATGGTCC 
GAACATGTG 



Seq ID NO: 228 Protein sequence: 
Protein Accession ft: NP_005016.1 



MAFLGLFSLL 
ELGAQGSTQK 
NEEFLQMMKK 
INAVYFKGNW 
VLEIPYEGDE 
IDLKDVLKAL 
SRMAVLYPQV 



11 
I 

VLQSMATGAT 
EIRHSMGYDS 
YFNAAVNHVD 
KSQFRPENTR 
ISMMLVLSRQ 
GITEIFIKDA 
IVDHPFFFLI 



21 
I 

FPEEAIADLS 
LKNGEEFSFL 
FSQNVAVANY 
TFSFTKDDES 
EVPLATLEPL 
NLTGLSDNKE 
RNRRTGTILF 



31 
I 

VNMYNRLRAT 
KEFSNMVTAK 
INKWVENNTN 
EVQIPMMYQQ 
VKAQLVEEWA 
IFLSKAIHKS 
MGRVMHPETM 



41 

I 

GEDENILFSP 
ESQYVMKIAN 
NLVKDLVSPR 
GEFYYGEFSD 
NSVKKQKVEV 
FLEVNEEGSE 
NTSGHDFEEL 



51 
I 

LSIALAMGMM 
SLFVQNGFHV 
DFDAATYLAL 
GSNEAGGIYQ 
YLPRFTVEQE 
AAAVSGMIAI 



Seq ID NO: 229 DNA sequence 
Nucleic Acid AcceBBion #: NM_003695 
Coding sequence: 12-398 



CGACATCAGA 
CAGCCCTTAC 
TCTGCCCGGC 
ATCTGGTGAA 
TCAGCAGCGG 
ACAACGCTGC 
TGAGCCTCCT 
TCATGCCTTT 
GGGTGCCAGG 
CATGGAATGC 
ACAGAGGATG 
GATTTCACAC 
TAAATGATTT 



11 
I 

GATGAGGACA 
CCTGCGCTGC 
CAGCTCTCGC 
GAAGGACTGT 
CACCAGCTCC 
ACCCACCCGC 
GGCCGTCATC 
CCTTCCCTTT 
AGCCCCAGGC 
TGATGACTTG 
CAGCCCCCAG 
TCCTTCTGTT 
AAACC 



21 

I 

GCATTGCTGC 
CACGTGTGCA 
TTCTGCAAGA 
GCGGAGTCGT 
ACCCAGTGCT 
ACCGCCCTCG 
TTAGCCCCCA 
CTCTGGGGAT 
TGAGGGCTTC 
GAGCAGGCCC 
CTGCATGGAA 
TTGTTGCCGT 



31 

I 

TCCTTGCAGC 
CCAGCTCCAG 
CCACGAACAC 
GCACACCCAG 
GCCAGGAGGA 
CCCACAGTGC 
GCCTGTGACC 
TCCACACCTC 
CCCGAAAGTC 
CACAGACCCC 
GGTGGAGGAC 
TTATTTTGTA 



41 

I 

CCTGGCTGTG 
CAACTGCAAG 
AGTGGAGCCT 
CTACACCCTG 
CCTGTGCAAT 
CCTCAGCCTG 
TTCCCCCCAG 
TCTTCCCCAG 
TGGGACCAGG 
ACAGAGGATG 
AGAAGCCCTG 
CTCAAATCTC 



51 
I 

GCTACAGGGC 
CATTCTGTGG 
CTGAGGGGGA 
CAAGGCCAGG 
GAGAAGCTGC 
GGGCTGGCCC 
GGAAGGCCCC 
CCGGCAACGG 
TCCAGGTGGG 
AAGCCACCCC 
TGGATCCCCG 
TACATGGAGA 



Seq ID NO: 230 Protein sequence: 
Protein Accession NP_003686 

1 11 21 31 41 51 

I I I I I I 

MRTALLLLAA LAVATGPALT LRCHVCTSSS NCKHSWCPA SSRFCKTTNT VEPLRGNLVK 
KDCAESCTPS YTLQGQVSSG TSSTQCCQED LCNEKLHNAA PTRTALAHSA LSLGLALSLL 
AVILAPSL 



Seq ID NO: 231 DNA sequence 

Nucleic Acid Accession 8: Eos sequence 

Coding sequence: 126-752 



120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 



277 



10 
15 
20 
25 
30 
35 
40 
45 



55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

1 11 21 31 

I I I I 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC 
AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA 
AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT 
GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA 
GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC 
TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT 
GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG 
ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT 
CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA 
ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT 
AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG 
TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC 
TTCAGGATGG CTGTATTCTG CGGTCAGAAT GAGAGAGTCA 
CAAGAGTTCA GCCTTCCTTT GGAGACTGCT CCATCAGTGC 
TTCACTGCAC CGCCATCTTA CTGAGTTGCT TCACGTGAGG 
TGACTCAGTT CCACATTTTG GATTGCATAC TGGAAAAGAA 
ACCAGCAACC CGGCTGTATA CAGTGGTGAC CCAAGCAATG 
AGGGAGGGGA GAGGTGGAAT ACAGTAGTTC TTGGAATCTG 
TTATTTCCTG GGACTTGGCA AAAATCTGAT TGGTGGGGAT 
TCTGGTATTA ATTTAATCTC AGGAAAAACA AGAAATTAAC 
GGAATTCAGC GTAGCTACCT CCAGACCGTG GTGTCTGGCC 
AGCTCTGACT TACAGCTGCA GTCACCTTTG CTATAAGGCA 
GGCTTCACAT CAATTTTTTT CTTCCTTTAG GGTGGGGGAT 
GTGGTTTTTT GTTTTATTTT TGTCAAGATT GATTTTTAGA 
CAGAAGGATG CCACCAGTTT TTCCTTGAGG CCTAGGATTT 
GGTAATTCCT CACAACTTAG TGCACCAGTA GCACCAGCCA 
TGGGGAGCTT TTCGTTTTGT TTTGTTTTTA ATTCTCTTTC 
TCCTAGAGAA TCTACTCCGT TGCAGAATCA TTGCAACCTC 
TGCTGTCAGC CTGATATACT ACTTTGGACT CTGGAAACAG 
TTCTACTGTG TGTCGTTAAA CAACCGTCGG AGACCAGATG 
TGTATAACTC GACTCTGTAT GTTTCAATGT ATGTTACTGC 
GTGTTTGTGA GATGCTCTTT GAAGATGGTA CTTTTATATT 

Seq ID NO: 232 Protein sequence: 
Protein Accession #: Eos sequence 



11 



21 



31 



41 
I 

TGGCGCGGTT 
GCCCTGGAAC 
CCCCAGTCCT 
CCGTGAAGAT 
AGCAGCCCGA 
GGATGAGGAG 
GACTACTTCC 
TTOGTGGCTT 
CGTGTGGTCT 
GTGGCAAGAA 
GATCTTCTGC 
TGGCTTCAGT 
AGCTGGGCAG 
CGAGGTGTGT 
AAAAGGGGGC 
GCCAATCTTC 
GATATAAACC 
AAGTCTCCTA 
CTCCTAGGAC 
CCAGAGAGAG 
TCCATTTTTG 
CCTGGGTAGA 
TGGTTTGGCT 
TGCAAGGACT 
TTTATTCTGT 
TTTTGAGCAG 
CTTAGCAGCA 
AGGAGCCCTC 
ATATGGGTTC 
ACCTGTTAGA 
AATGCTTCAC 
T 



41 



LSIVCLMITQ LAGFSGPNFQ DGCILRSE 

Seq ID NO: 233 DNA sequence 

Nucleic Acid Accession ft : CAT cluster 



50 | 



TTTTAATGGT 
CACAACATGA 
TGTTTCTTAG 
TAGTAAACTT 
TTTGTTTCCA 
TGTTTGACTG 
TTGTTATGGT 
AAGCTATTTT 
TATATTATGG 
ATTAGCCTAC 
AATAGGTTAA 
AATTAAAGTA 
TATGAAAGCT 
TATATATTTT 
TATCCTGTAT 
TAGAAGACCC 
TTTTTTTTAG 
GATTCTGTTA 
TCCAGATAAA 
CTTAGGCTCT 
CCATAGTATC 
ACTATAATAG 
TATCAATGTT 
TACATAAAAA 
AAGTCATTAA 
AGAATCATCT 
GGAAGATGGA 
GAGTTGAAGA 
CTTTTAAAAC 
TATTCTTCTA 
AGCCCTTGTG 
CTGTTCAGGT 
TGAGATGGAG 
ATCATATGGA 
TTTTAATTCT 
CAATCCCCAA 
TTTCTGGAAT 



11 

I 

GCTCATATAT 
ATCACATAAT 
ATCGTAGTCA 
CTCTTTCATC 
AAGTCACAAT 
AAAGCAAAAC 
ATATCTTTTT 
AATCATCAAG 
ATTAACCAGA 
TATAGTATTA 
AAAGTAGTTA 
CATTTTAAAT 
CTGGCTATCA 
TTAGTTCCTT 
TTTTTTTAAG 
ACTCTTACTA 
GTAGTTTAAA 
AGCATCCAAA 
AGATGGAGAA 
CTCCATGTAT 
AAGTGGAGGG 
AAGTTTGAGT 
ATCCAATGAT 
GTGCTCATGT 
TAATTTAATA 
GAGGACTTTT 
GCCATGCTTG 
CCACTGCTCT 
TAGTAATGTA 
ATAACTAGCA 
TAAGATTATT 
GAGAAAACAT 
GAGGTGGGCA 
AGTTTGGAAA 
CACTGTCTCA 
GGGGCAGTGT 
TGCACCTACA 



21 
I 

ACTGTATTTT 
CATGATTTTT 
TTGAGAAGTC 
TTTGTGTTAG 
TGAATTATTC 
AACGTGACAG 
ATTAAATATT 
TATGGAAAAC 
ATTGTATCAT 
ATAAATTTTT 
CAAATTAAAC 
GAGCTTTATA 
TCCTGGGATA 
TGAGATAACT 
AATTGTTTTA 
GGTTCCCTAA 
GCAAGCACTG 
AACAATGCCT 
TACCTCATGT 
CTTTCTTAAG 
TAGTTCAGAA 
AATATTTTAA 
TTTTATTAAA 
ATTTGAATTT 
ATTGTTTTAA 
AATATGGAAT 
TTTTTCCAAA 
AAATTAGTGC 
CCCAGTTAAG 
TTTATTACAT 
ATTTCTTCTC 
AATGGATTTT 
CATTTAAGGT 
AGAGAGCTTA 
AAAGAGAATC 
TACCTTACTC 
TGTTTTCTTA 



31 
I 

TTGTTGTTTA 
TTTTTTTACT 
CCAATAACTC 
CTCTGTAGTC 
TTAGATACCT 
TTTATTTTCA 
TATTTTGACT 
AAATTACTAT 
TTTTGGCCTA 
CAGTTGGTTT 
TTACTAATTT 
ATACCTTAAA 
GTAATTTCTA 
AATTTCTAAT 
TAAATAGGTC 
GGATCTGCCA 
ATACCAGTGG 
AATTTCAGTT 
ACTGTGACTT 
GAAAAGTTTC 
AAGTTAATAG 
TAAATTTATA 
AAATTACCTT 
TAAATAATTT 
ATCAGTGGTT 
CCACCTCATA 
AGCTCTTTGA 
AGGAAAATGC 
TTTTGATGGT 
GAAATTTAAG 
TATAACTTCA 
TTTTTTTTTC 
CAGTTCACTA 
TCACAGGTTT 
AGCTCTCCAG 
CTTCACTGCT 
TTAACATTCA 



41 
I 

GTTTTACTTA 
TTTACTCCCC 
TAAACTTTTG 
TTAACCTGGA 
TAAGCCACTG 
AACACTAACT 
AAGCTTTCAT 
TGCATTTTCC 
ATGTCTGGAT 
GGGCAAATTT 
ATACCTGATT 
AAGTTGGTTC 
ATTATATAGT 
TATATATGTT 
ATAAGATACA 
TAGATTTTTT 
GAGTTGGTCT 
CTTAGGTTAT 
GAAAATGAAT 
TGAGTGTGAT 
GAAATCTTTT 
TAATTCAAAT 
ATTATTAGAA 
ATTTAAATCA 
TTCAACCCTC 
ACAATTAAGT 
GTGATTCTAA 
TTTTATTTCT 
TTAAATTCCA 
AGTTTAAGTT 
AAATAGATAT 
CTCTGGAGCT 
ACCTATGGTT 
GTATGCTGGT 
CAGTTCTAGA 
TCTTAGAAGG 
GAATTGGGAA 



51 




1 

GTCCTGGAGC 


60 


CTCCACTCAG 


120 


GGGTATAGAA 


180 


TCCAAGTTCA 


240 


GCCGAGGGCC 


300 


CATCCCAAGG 


360 


AAACACCAGC 


420 


TCTTCTCTGG 


480 


CTGTCCAAGC 


540 


GAGCTGAATG 


600 


CGCACCAGGC 


660 


GGACCAAATT 


720 


AATCTCTCGC 


780 


GGGAACAGGC 


840 


TTTGGCCCTG 


900 


TTGCTAGTAA 


960 


TAAAAATCTG 


1020 


TTTGATCAGG 


1080 


CTAGTGGACA 


1140 


TCTGGGTTTT 


1200 


TCTGTCATTC 


1260 


AGGGTGGATG 


1320 


TTCTTTTGTT 


1380 


TGAAAAGACC 


1440 


CCCGAGCAGA 


1500 


AGTACCTCTT 


1560 


AGGTCTTTTT 


1620 


ACTGATTGAG 


1680 


TATTCTCTAT 


1740 


TGGCTAGTCC 


1800 


CTGCTGTACA 


1860 


51 




1 

ETAARAEGLS 


60 


TFSWLSSLAR 


120 


VWIFCRTRLI 


180 


51 




1 

TTGAGAGTGT 


60 


AAATTATTCA 


120 


AGTTATAACG 


180 


TTTTAATTTT 


240 


AATTCAGTTC 


' 300 


TCTTGATATT 


360 


AAAATATTTG 


420 


TATATATGCA 


480 


ATAAAAGATA 


540 


AAACCTGAAA 


600 


TTTTTTCTTG 


660 


TAATTTAAAA 


720 


ATTTCAAAAC 


780 


TCAAAAACCA 


840 


AGGTCTGCAT 


900 


TTTTTTTTTT 


960 


TGATCTAGGA 


1020 


GGCTTGTGAC 


1080 


TCTTAAAATT 


1140 


CTCTCTTTTG 


1200 


GTGACAGCAG 


1260 


GATAAAAATG 


1320 


CTGTGCCTAT 


1380 


AGACCACCAT 


1440 


ACTTCATATT 


1500 


CTAAATTTCT 


1560 


TTTGTAGTCA 


1620 


CCCATGTTAA 


1680 


CTAAAGAACA 


1740 


CCATCAAACT 


1800 


TTCATTCAAA 


1860 


GCCTGTTCAG 


1920 


CAGAGTTCTG 


1980 


GAATGGATAG 


2040 


AAAGCTTTGA 


2100 


TAGAATTAAG 


2160 


TATTAATTTT 


2220 



PCT/US02/12476 



278 



5 

10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
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65 
70 
75 
80 
85 



WO 02/086443 

TCCAGTGAGT AGTTTTCTGA AATTGGTAAC TTGGAGAGTA 
CAATTTTGTG TTTGTTTACT TTTATGTAAA AATTTGATAT 
AAAACCTCAT GCCTTTTCAT TACATCTAAT TTGAACTCTC 
TTTAAAGATG CTTTAATGAA AAGTATTAAG AAAATATATA 
CTTCAGAAAT CCATATATTT GTCATATTTA TTTTTTTAGA 
AGATGGTATT TAAAATGAAT GCCCAAAAAT ATCTTGTACC 
TTGGAAGCCG CCAGCCATTC ATGTAGAGAG TTTATAAGAA 
ATTTTATATT ACTATGGTAT CTGTGTACCA TATTTCTAAG 
CTTCTTAAAA CCATAACCTG GCTTGCCTTT TAGTGTTAAA 
ATAGAGATTC TTCTTTTATG AAGAAGAGCT GACGTAATTT 
AAGACATTAA CATAAGTCTC TGAGCAGTGA TACATTTTCA 
CCACATTAAA CAACCACGGC AACACTCAGA CTTGGCACTT 
TGTGCCTGGT ATCGCCTCTG GCATAACTTA CACGAATCGT 
CCTTCATCAA GCACTTGCCA ACACATTCAC CTCTAACTTG 
ACAACATCTG CAACTCTACC CTATCAACTG CCAACCTAAA 
CCCCAAACAC AAAACCACTA AATCATAACC ACCACACACG 
CACACAACCA ACACACCACG ACCAAACACC CCACCACAAA 
GACAACACAT CACATACACT CACTACCCCC CCATACTCCC 

Seq ID NO: 234 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 27-281 



PCTVUS02/12476 



i 
1 

AGCAGGAGGA 
GTCTACCGCG 
TTCCTGCCCC 
GGCGCTCACT 
CAGGGTTCCG 
GAGTTTTTCT 
CAGAAAGAAT 
ATGTTGCAGG 
TTTTCTTCTC 
ATAAAAACTG 
GATGCCAGGA 
TCAAGCCAAG 
CTCAAACCCG 
GGTCAGAACA 
AGACAGCCTG 
TCTCAGTATC 
CATAAACACA 
GGAAAGGTCT 
TCTTTTGATG 
GCAAAGAAAA 
TTTATTTTTA 
TTTGACCTTG 
GATTAAAACA 
AAATCTCAAG 
AAAAAAAAGC 
GCACTTCTCT 
TCCCATCAAA 
ATAATCCC 



11 

i 

GAGCTGGCGG 
TAGCAGTTAC 
GTGCTCATTT 
GAAACAGTGT 
ACCAATCCAA 
TTGCTCTGAT 
GCAAGGAGAT 
AAGAGCTAGT 
CACTTGGCAT 
TTCAGCGGTT 
GGAAAGATGC 
CCAACAGGTG 
GGGAAGCCCA 
CAGCTAAGCA 
TGACGTTTCA 
TTACGCCCAG 
TAACAGCAGC 
CCTGTGACTG 
ACTCTATATC 
ATAAAAGACA 
AACTGAATTC 
AAATAATCTT 
TATTAGTAAT 
GCTTTTAAAG 
CCTCCATCTG 
TCTCATTTTC 
GCCAAAGAAA 



21 
I 

GAAGACATGC 
ATCAGACTGA 
GGGGCTGACG 
GTTGCTCCAC 
GAGCCTTGCA 
CTTGGAGACA 
AGACCAACGT 
CTTTCAGGCT 
ATCAAGAGCC 
CGCCAACAAG 
CAGGGGTAAA 
TTCTGTTTTT 
CTCTAGAACC 
GATGGCTTGG 
AAAGCAAAAG 
TGACACGATC 
AGCAATAATT 
TTTTATTTTT 
CAACTCTGAG 
ATTTCCAGTA 
AGCAGAGATT 
TACATTGTAA 
TAATTATTAA 
CATTTGTACA 
ATTCTCATTT 
CACTGTCTCG 
GAAAAGAAAA 



31 
I 

ACCCCTTGAA 
GACACTTCCT 
CCATTTTAGG 
ACCGCCTTGT 
GAAAGCATTA 
TCCCTCTGCC 
GAGATTCTCC 
GGGCTGGTGA 
AGGCGTGGAA 
AAGTGGTAAA 
GTGGGAAAAT 
CATCACAGAA 
CATGCTGGTC 
GTCATCAGGA 
TCCCCTACCA 
TACCCTCAAA 
AAAGATGAGA 
AGGGAAACAG 
GTTTGATTAA 
AGTATGCCAG 
TACATGCATT 
ATTCTTAATG 
AGGAGAATAA 
AATGACTGGA 
TCATTGTCAG 
CAAGCTAGAA 
TTGTTCTGTA 



AAATAACGTA 
GTGAATTACA 
AACTTCAGTG 
GATTTGTATG 
AACCTCCTAA 
TTTGTCCAAA 
AATAATTTAA 
TATTCATTAT 
CACAAAATCC 
ATTACCAGTG 
AACATGAAGA 
TCCTACGAAT 
CCTCCCTACT 
TACAACCTTA 
GACCCCCAAC 
CCACACACCA 
CAAGCTAACA 
ACCCACCA - 



41 
I 

GACCCAGAGA 
GTTTACAGGA 
CCTCAGCCCA 
TTTGCTTGTT 
ACGTGCTTTT 
TAGTGGAAAC 
TTCATGCACT 
CCTGAGAAAG 
GACTAAAACA 
GTAGCAAAAA 
GGGAACCTGA 
CTAATAAGTG 
ATCCATATCC 
CGTCCATTAC 
GCCAGTGAAG 
ACTTAAAAAA 
TGAGAACAAT 
AGAGGAAGAA 
AGAAATGACC 
TTCGAATTAA 
ACGATGATTA 
ATCAAAACAA 
TTGCAAATAC 
CATTTTTTAA 
TGCAACAACA 
ATTCTCACGA 
CAGATATATG 



TTTTGCTTTT 
CAGTTCTAAT 
CCAGAAGTGC 
TCAGTTTATA 
TTGGATAACT 
AGTTTATCTG 
AATTGTATGC 
TAAATTGGTA 
AACATTGTAT 
CATCTGCACA 
GTGACAACCA 
CCATCCTATA 
TGTCTACGCT 
CCAACTCACC 
ACAACACAAC 
CACACCCACC 
ACCACAAACA 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 



51 
I 

GAGGCCGTCT 
GACTATAAAA 
TCTGCACCCA 
GGCGCGCTCT 
CTCTTTGGCA 
ATAAGGAATA 
CAAGAGAAAG 
AATGTCCAGC 
GGAAATGTTT 
TGGGGATGGA 
AGCCAGGAGG 
GTGCTGAGGA 
CCAAGGCCCT 
ATCCAAAGGA 
CTACCTGATT 
AAAAGGGAAA 
TAAGAAAAAA 
GAATGATTTT 
TTGAACCACA 
TGATTTACTT 
ACATCTGAAA 
GGTTCTCAGT 
AACATTCCTA 
ATTTGAAAAA 
AAAAAGGTAT 
CTACCTTTGA 
ACATTAAAAA 



Seq ID NO: 235 Protein sequence: 
Protein Accession #: Eos sequence 



11 



21 



31 



41 



51 



MHPLKTQREA VCLPRSSYIR LRHFLFTGDY KIPAPCSFGA DAILGLSPSA PRRSLKQCVA 
PHRLVLLVGA LSGFRPIQEP CRKH 

Seq ID NO: 236 DNA sequence 

Nucleic Acid Accession #: NM_002075 

Coding sequence: 406.. 1428 



1 
I 

CCACAATAGG 
ACAGGATCAG 
AGTCCTTTCT 
ACCACCCTGA 
GGCCAGGCCA 
CGTCGCAGCT 
CCAGCCAGAG 
CAACTGCGTC 
GCTGACGTTA 
CGGACGCGGC 
GATTCTAAGC 
ACCACCAACA 
GCCCCATCAG 
CTCAAATCCC 
CTCTCCTGCT 
TGTGCCTTGT 
GACTGCATGA 
GCCAGTGCCA 



11 

t 

GGCAGACCTG 
ACCCAGAGGC 
AATCTCAGCT 
GCTGAGGAGC 
GGCCAGCTCC 
GAGGGAGTAA 
CCCAAGAGCC 
AGGAAGCGGA 
CTCTGGCAGA 
GGACGTTAAG 
TGCTGGTAAG 
AGGTGCACGC 
GGAACTTTGT 
GTGAGGGCAA 
GCCGCTTCCT 
GGGACATTGA 
GCCTGGCTGT 
AGCTCTGGGA 



21 
I 

TCCATCCTTC 
AGCTGGTTGG 
CCTGCCTGTA 
ACAGTTTGAG 
TCTGGCAGCA 
GGAGGCTCCC 
AGAGTGACCC 
GCAGCTCAAG 
GCTGGTGTCT 
GGGACACCTG 
TGCCTCGCAA 
CATCCCACTG 
GGCATGTGGG 
TGTCAAGGTC 
GGATGACAAC 
GACTGGGCAG 
GTCTCCTGAC 
TGTGCGAGAG 



31 
t 

TCTGTGGGTC 
GGTTTGTCGA 
CCCTCCCATA 
GCCCCCCCAA 
GAGCCTGGGC 
AGGAACCGGA 
CTCGACCTGT 
AAGCAGATTG 
GGCCTAGAGG 
GCCAAGATTT 
GATGGGAAGC 
CGCTCCTCCT 
GGGCTGGACA 
AGCCGGGAGC 
AATATTGTGA 
CAGAAGACTG 
TTCAATCTCT 
GGGACCTGCC 



41 
I 

CCCTGTACCT 
GAAGAAGGAT 
CTCACCAAAC 
CCCCCCGCCG 
AGGTGACGGG 
GCTGGAAACC 
CAGCCATGGG 
CAGATGCCAG 
TGGTGGGACG 
ACGCCATGCA 
TGATCGTGTG 
GGGTCATGAC 
ACATGTGTTC 
TTTCTGCTCA 
CCAGCTCGGG 
TATTTGTGGG 
TCATTTCGGG 
GTCAGACTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



60 



51 
I 

TTCTCCCCCA 
TATCCAGATC 
CCTCTTCCCC 
GTCGGGGCCA 
CGGGCGCGGG 
CGGCCGAGGT 
GGAGATGGAG 
GAAAGCCTGT 
AGTCCAGATG 
CTGGGCCACT 
GGACAGCTAC 
CTGTGCCTAT 
CATCTACAAC 
CACAGGTTAT 
GGACACCACG 
ACACACGGGT 
GGCCTGTGAT 
CACTGGCCAC 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



279 



WO 02/086443 



PCT/US02/12476 



10 

15 
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25 

30 

35 
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GAGTCGGACA 
GATGACGCTT 
CACGAGAGCA 
TTCGCTGGCT 
GGCATCCTCT 
GCTGTGGCCA 
AAGGGAAGTG 
GGTGTTCTCT 
GGGAGCATGG 
CCATCTCCTC 
GGACAACCTG 
GCCCTAGGAT 
TTGGCCCTGT 
TTCTCCTTTT 
GT 



TCAACGCCAT 
CCTGCCGCTT 
.TCATCTGCGG 
ACGACGACTT 
CTGGCCACGA 
CAGGTTCCTG 
GAAGGCAGTG 
TCTATATTCC 
GACTGTGCCT 
CCATGGCCTT 
CCCCTCCCCA 
TCCTCCCCCA 
GACTATGGCT 
TCTACCTTTT 



CTGTTTCTTC 
GTTTGACCTG 
CATCACGTCC 
CAACTGCAAT 
TAACAGGGTG 
GGACAGCTTC 
AACACACTCA 
GGGTGCCATT 
TTGGGAGGCA 
CCCTCCCCAC 
GCCCTTTGCA 
GAGCCACTAC 
CTGGCACCAC 
TTTCTCTCCT 



Seq ID NO: 237 Protein sequence: 
Protein Accession #: NP_002066 



MGEMEQLRQE 
MHWATDSKLL 
CSIYNLKSRE 
VGHTGDCMSL 
ICTGSDDASC 
KSERVGILSG 



11 
I 

AEOLKKQIAD 
VSASQDGKLI 
GNVKVSRELS 
AVSPDFNLFI 
RLFDLRADQE 
HDNRVSCLGV 



21 
I 

ARKACADVTL 
VWDSYTTNKV 
AHTGYLSCCR 
SGACDASAKL 
LICFSHESII 
TADGMAVATG 



CCCAATGGAG 
CGGGCAGACC 
GTGGCCTTCT 
GTCTGGGACT 
AGCTGCCTGG 
CTCAAAATCT 
GCAGCCCCCT 
CCCACTAAGC 
GCATCAGGGA 
AGTCCTCACA 
GGCCCAGCAG 
CTTTGTCCAG 
TAGGGTCCTG 
AAGACACCTG 



31 
I 

AELVSGLEW 
HAIPLRSSWV 
FLDDNNIVTS 
WDVREGTCRQ 
CGITSVAFSL 
SWDSFLKIWN 



AGGCCATCTG 
AGGAGCTGAT 
CCCTCAGTGG 
CCATGAAGTC 
GAGTCACAGC 
GGAACTGAGG 
GCCCGACCCC 
TTTCTCCTTT 
CACAGGGGCA 
GCCTCTCCCT 
ACTTGAGTCT 
GCCTGGGTGG 
GCCCTCTTCT 
CAATAAAGTG 



41 
I 

GRVQMRTRRT 
MTCAYAPSGN 
SGDTTCALWD 
TFTGHESDIN 
SGRLLFAGYD 



CACGGGCTCG 
CTGCTTCTCC 
CCGCCTACTA 
TGAGCGTGTG 
TGACGGGATG 
AGGCTGGAGA 
ATCTCATTCA 
GAGGGCAGTG 
AAGAACTGCC 
TAATGAGCAA 
GAGGCCCCAG 
TATAGGGCGT 
TATTCATGCT 
TAGCACCCTG 



51 
I 

LRGHLAKIYA 
FVACGGLDNM 
IETGQQKTVF 
AICFFPNGEA 
DFNCNVWDSM 



Seq ID NO: 238 DNA sequence 

Nucleic Acid Accession «: CAT cluster 



1 
I 

TCCCAATGTG 
TACCATTTGC 
ACTGATTCTG 
TGCATTGACC 
TAAGTGACTG 
TTTTGACGAG 
CAAAAAGGGG 
AAGAAAGAAA 
GGATCAAGAA 
CAGAAGTGAA 
TAGAAAAGTT 
CAACTACTCA 
CTTGTGTTCC 
ACTGTTGTTT 
GACATGAAAG 
TAGGCTAAGT 
AAAATCCCAA 



11 

I 

TNGAACCTAC 
TTTTAAGGCA 
TGAATTATGA 
AGTGTGAAGC 
GAAAGCTGAA 
TATCGGGTGA 
AAAAAAAAGA 
AATAAAATAC 
GTTTGTGTAC 
TCAAAATATT 
TTTCTGTAAA 
ACTTTCCTAC 
AATAAAGCTT 
GCCAAGTCCT 
TTCATTGGGT 
TATAATACAC 
ATAAAA 



21 
I 

CATAAATTCT 
GATAATCCTC 
AATCTGAAAA 
ACAGTGGAAT 
GAATCACCGG 
CTTTGAGGTG 
GCAACCAAAG 
ACAATATGGA 
ACATAATCTC 
TCAAAATGCT 
AGTCAGATAG 
TGTAGCACAA 
CATTTACAAA 
AATATAGTTG 
TGCTAAAAAG 
TGTTTTAACA 



31 
I 

TTTCTTACNG 
CAAGTTTTCT 
GGAATTGGAA 
GAGAATGCGT 
CTTCAGTGAC 
GTCAAGAAAC 
AAAAAAAATC 
CGATGGAGAA 
ATTTTGAGAT 
GTCTTATGAA 
TAAATATTTT 
GAGTAGCTGT 
AACATGCCAT 
CTTAGCAAGT 
TATGTAGAAA 
ATTGTAAAAT 



41 
I 

GACAATCTTA 
AATGATATCT 
GTTGCTAAAA 
GCCCTGACAC 
ATGGAACCCA 
CACACTTTAA 
CATAAAATTG 
AAACAGTTAC 
ATATAACTAT 
ACTACAATAT 
AGGTTTTGCA 
GGTACTGTGC 
GGGCCATATT 
ATTGTGAGCT 
TTCAAAGGAA 
GTAAGAGAAA 



Seq ID NO: 239 DNA sequence 

Nucleic Acid Accession #: NM_001786.1 

Coding sequence: 130-1023 



1 
I 

GGGGGGGGGG 
GTTGTTGTAG 
TGACTAACTA 
GTGTATAAGG 
GAAAGTGAAG 
CTTCGTCATC 
CTCATCTTTG 
CAGTACATGG 
TTTTGTCACT 
GACAAAGGAA 
AGAGTATATA 
TCAGCTCGTT 
GCAACTAAGA 
AGAGCTTTGG 
AAGAATACAT 
GAAAATGGCT 
GGCAAAATGG 
TAGCTTTCTG 
AACTCTTGTC 
GTCTTCTAAT 
ATTCTGTAAA 



11 
I 

GGCACTTGGC 
CTGCCGCTGC 
TGGAAGATTA 
GTAGACACAA 
AGGAAGGGGT 
CAAATATAGT 
AGTTTCTTTC 
ATTCTTCACT 
CTAGAAGAGT 
CAATTAAACT 
CACATGAGGT 
ACTCAACTCC 
AACCACTTTT 
GCACTCCCAA 
TTCCCAAATG 
TGGATTTGCT 
CACTGAATCA 
ACAAAAAGTT 
TATTTTTGTC 
TTCAAAAATA 
TGTGAAAAAA 



21 
I 

TTCAAAGCTG 
GGCCGCCGCG 
TACCAAAATA 
AACTACAGGT 
TCCTAGTACT 
CAGTCTTCAG 
CATGGATCTG 
TGTTAAGAGT 
TCTTCACAGA 
GGCTGATTTT 
AGTAACACTC 
AGTTGACATT 
CCATGGGGAT 
TAATGAAGTG 
GAAACCAGGA 
CTCGAAAATG 
TCCATATTTT 
TCCATATGTT 
TTATATATAT 
TAACTTAAAA 
AAAAAAAAAA 



31 
I 

GCTCTTGGAA 
GAATAATAAG 
GAGAAAATTG 
CAAGTGGTAG 
GCAATTCGGG 
GATGTGCTTA 
•AAGAAATACT 
TATTTATACC 
GACTTAAAAC 
GGCCTTGCCA 
TGGTACAGAT 
TGGAGTATAG 
TCAGAAATTG 
TGGCCAGAAG 
AGCCTAGCAT 
TTAATCTATG 
AATGATTTGG 
ATGTCAACAG 
TTCTTTGTTA 
ATGTAAATAT 
AAAAA 



41 • 
I 

ATTGAGCGGA 
CCGGGATCTA 
GAGAAGGTAC 
CCATGAAAAA 
AAATTTCTCT 
TGCAGGATTC 
TGGATTCTAT 
AAATCCTACA 
CTCAAAATCT 
GAGCTTTTGG 
CTCCAGAAGT 
GCACCATATT 
ATCAACTCTT 
TGGAATCTTT 
CCCATGTCAA 
ATCCAGCCAA 
ACAATCAGAT 
ATAGTTGTGT 
TCAAACTTCA 
TCTATATGAA 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



51 
I 

TNCTAANCAA 
GAAACTATTA 
ATCTATCATT 
CAAAGAAAAA 
GTGATTTGAT 
GAACAATGTC 
CACAGAAGAA 
ATT TCTTT AT 
TTTTGTCTTT 
TCTCACAGAT 
GTGTCTTTTG 
AAATAAATTG 
TGGCCTGTAC 
ATTTGAGGAA 
AATTAAAATT 
TTTACAAATA 



51 
I 

GAGCGACGCG 
CCATACCCAT 
CTATGGAGTT 
AATCAGACTA 
ATTAAAGGAA 
CAGGTTATAT 
CCCTCCTGGT 
GGGGATTGTG 
CTTGATTGAT 
AATACCTATC 
ATTGCTGGGG 
TGCTGAACTA 
CAGGATTTTC 
ACAGGACTAT 
AAACTTGGAT 
ACGAATTTCT 
TAAGAAGATG 
TTTTATTGTT 
GCTGTACTTC 
TTTAAATATA 



Seq ID NO: 240 Protein sequence: 
Protein Accession ft: NP_001777.1 

1 11 21 31 41 51 

I 1 1 I I I 

MEDYTKIEKI GEGTYGWYK GRHKTTGQW .AMKKIRLBSE EEGVPSTAIR EISLLKELRH 
PNIVSLQDVL MQDSRLYLIF EFLSMDLKKY LDSIPPGQYM DSSLVKSYLY QILQGIVFCH 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



60 
120 



280 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

SRRVLHRDLK PQNLLIDDKG TXKLADFGLA RAFGIPIRVY THEWTLWYR SPEVLLGSAR 
YSTPVDIWSI GTIFAELATK KPLFHGDSEI DQLFRIFRAL GTPNNEVWPE VESLQDYKNT 
FPKWKPGSLA SHVKNLDENG LDLLSKMLIY DPAKRISGKM ALNHPYFNDL DNQIKKM 



Seq ID NO: 241 DNA sequence 

Nucleic Acid Accession 8: NM_033379.1 

Coding sequence: 132-854 



PCT/US02/12476 



180 
240 



1 
I 

CGCCCGCGCG 
GCTTTGCAGA 
ATTGACTAAC 
TTGTGTATAA 
TAGAAAGTGA 
AACTTCGTCA 
ATCTCATCTT 
GTCAGTACAT 
TATTGCTGGG 
TTGCTGAACT 
TCAGGATTTT 
TACAGGACTA 
AAAACTTGGA 
AACGAATTTC. 
TTAAGAAGAT 
TTTTTATTGT 
AGCTGTACTT 
ATTTAAATAT 



11 
I 

CGGGCTCAAC 
GAGCGCCCTC 
TATGGAAGAT 
GGGTAGACAC 
AGAGGAAGGG 
TCCAAATATA 
TGAGTTTCTT 
GGATTCTTCA 
GTCAGCTCGT 
AGCAACTAAG 
CAGAGCTTTG 
TAAGAATACA 
TGAAAATGGC 
TGGCAAAATG 
GTAGCTTTCT 
TAACTCTTGT 
CGTCTTCTAA 
AATTCTGTAA 



21 
I 

TTTGTAGAGC 
CAGGGACTAT 
TATACCAAAA 
AAAACTACAG 
GTTCCTAGTA 
GTCAGTCTTC 
TCCATGGATC 
CTTGTTAAGG 
TACTCAACTC 
AAACCACTTT 
GGCACTCCCA 
TTTCCCAAAT 
TTGGATTTGC 
GCACTGAATC 
GACAAAAAGT 
CTATTTTTGT 
TTTCAAAAAT 
ATGTGAAAAA 



31 
I 

GAGGGGCCAA 
GCGTGCGGGG 
TAGAGAAAAT 
GTCAAGTGGT 
CTGCAATTCG 
AGGATGTGCT 
TGAAGAAATA 
TAGTAACACT 
CAGTTGACAT 
TCCATGGGGA 
ATAATGAAGT 
GGAAACCAGG 
TCTCGAAAAT 
ATCCATATTT 
TTCCATATGT 
CTTATATATA 
ATAACTTAAA 
AAAAAAAAAA 



41 
I 

CTTGGCAGAG 
ACACGGGATC 
TGGAGAAGGT 
AGCCATGAAA 
GGAAATTTCT 
TATGCAGGAT 
CTTGGATTCT 
CTGGTACAGA 
TTGGAGTATA 
TTCAGAAATT 
GTGGCCAGAA 
AAGCCTAGCA 
GTTAATCTAT 
TAATGATTTG 
TATGTCAACA 

AATGTAAATA 



51 
I 

CGCGCGGCCA 
TACCCATACC 
ACCTATGGAG 
AAAATCAGAC 
CTATTAAAGG 
TCCAGGTTAT 
ATCCCTCCTG 
TCTCCAGAAG 
GGCACCATAT 
GATCAACTCT 
GTGGAATCTT 
TCCCATGTCA 
GATCCAGCCA 
GACAATCAGA 
GATAGTTGTG 
ATCAAACTTC 
TTCTATATGA 



Seq ID NO: 242 Protein sequence: 
Protein Accession #: NP_203698.1 



1 11 21 31 41 51 

I I I I I I 

MEDYTKIEKI GEGTYGWYK GRHKTTGQW AMKKIRLESE EEGVPSTAIR EISLLKELRH 
PNIVSLQDVI* MQDSRLYLIF EFLSMDLKKY LDSIPPGQYM DSSLVKWTL WYRSPEVLLG 
SARYSTPVDI WSIGTIFAEL ATKKPLFHGD SEIDQLFRIF RALGTPNNEV WPEVESLQDY 
KNTFPKWKPG SLASHVKNLD ENGLDLLSKM LIYDPAKRIS GKMALNHPYF NDLDNQIKKM 

Seq ID NO: 243 DNA sequence 

Nucleic Acid Accession #: AF101051.1 

Coding sequence: 221-856 



GAGCAACCTC 
CGACCCAGAG 
GCGGGGCCCA 
ACCTGCCACC 
GCTGTTGGGC 
GCCCCAGTGG 
CGAGGGGCTG 
TGACTCCTTG 
CATCCTCCTG 
CTTGGAAGAC 
TCTTGCAGGT 
ATTCTATGAC 
TGGCTGGGCT 
CCGAAAAACA 
GAAAGACTAC 
GGACATTGAG 
GTATGGTATT 
AAACATGGCT 
TTGTATTACT 
TATATATAGA 
CTCATTATGT 
CCATATTGAT 
CAGTCAAATA 
CTAATTTACC 
TTATTTTTTA 
TTTCATTGGT 
AGCCAAGAAG 
GTGATAAATT 
TTTGCTTTGA 
CACAACTTTA 
ACCTTTTTGT 
TATATCTTCC 
GATAATCTGG 
TCTTTTTTCT 
AATATTAATT 
TTTATTTGCT 
CTTCATGTGA 
ACACATACCT 
AAACCTACGC 
ATTCTTTCAG 
TTTCCAGTCT 



11 

I 

AGCTTCTAGT 
CTTCTCCAGC 
GCCACCTTCG 
CCTGAGCCAG 
TTCATTCTCG 
AGGATTTACT 
TGGATGTCCT 
CTGAATCTGA 
GGAGTGATAG 
GATGAGGTGC 
CTGGCTATTT 
CCTATGACCC 
GCTGCTTCTC 
ACCTCTTACC 
GTGTGACACA 
ATACTATCAT 
ACAAAACAAA 
TAATCTTATT 
GCTTCCCATT 
TATGTATATA 
TGATACTAGC 
GAAGATGTTT 
TCATTTACTC 
AAGGATGAAT 
CCATAATCTT 
CTCTATCTCC 
AATTTATTAC 
CCTGTTGACC 
AAATATTTGT 
TTGATTGAAT 
TCCCCATTCC 
TAATAAGGTG 
TGACAAATAT 
ATCTGCCAAA 
AGTTTATATT 
CAGCTGGCTG 
TTCACTGCCT 
TCATGTGGTT 
ACATACCTTC 
CTGTGTCTGA 
GTACAGAATG 



21 

I 

ATCCAGACTC 
GGCGGCGCAG 
GGAGTCCGGG 
CGCGGGCGCC 
CCTTCCTGGG 
CCTATGCCGG 
GCGTGTCGCA 
GCAGCACATT 
CAATCTTTGT 
AGAAGATGAG 
TAGTTGCCAC 
CAGTCAATGC 
TCTGCCTTCT 
CAACACCAAG 
GAGGCAAAAG 
TAACATTAGG 
CAAACAAACA 
TTATCTTCTT 
GAGTAATCAT 
TACATGTTTT 
ATACTTAAAA 
ATTGGTATAT 
TTCTTCATTA 
TCTTTCAATT 
ATAGCACTTG 
TGAATCTAAC 
AAATCAGAAC 
TTCCCACACA 
CCAATTGAGT 
TTTTAAGCTA 
TTAATTGTAT 
TGGTCTGTTT 
TCTCTCTGTA 
TTGAGATAAT 
ACTCTCATTC 
AGACACTGAA 
TCCTCTCTCT 
CAGTGCCTTC 
ATGTGGCTCA 
CATGTTTGTG 
CTATTTCACT 



31 
I 

CAGCGCCGCC 
CGAGCAGGGC 
TTGCCCACCT 
CGAGCGAGTC 
ATGGATCGGC 
CGACAACATC 
GAGCACCGGG 
GCAAGCAACC 
GGCCACCGTT 
GATGGCTGTC 
AGCATGGTAT 
CAGGTACGAA 
GGGAGGTGCC 
GCCCTATCCA 
GAGAAAATCA 
ACCTTAGAAT 
AAAAACCCAT 
TCCTCAATAT 
ACTCAAATGG 
TCTATTAAAA 
TATCTCTAAA 
TTTCTTTTTC 
GCTTTGGGTG 
CTTCATGCGT 
CATCGTTATT 
ACATTTCATA 
TTTGGAGGCA 
ATCCCTGTAC 
AGCTGCATGC 
CTTATTCATA 
TGTTTTCCCA 
GTCTGAACAA 
GCTGTAAGCA 
GATACTTAAC 
TTTGAACATG 
GAAGTCACTG 
ACCAGTCTAT 
CTCTCTCTAC 
GTGCCTTCCT 
CTCTGTTCCA 
TGAGCAAGAT 



41 
I 

CCGGGCGCGG 
TCCCCGCCTT 
GCAAACTCTC 
ATGGCCAACG 
GCCATCGTCA 
GTGACCGCCC 
CAGATCCAGT 
CGTGCCTTGA 
GGCATGAAGT 
ATTGGGGGTG 
GGCAATAGAA 
TTTGGTCAGG 
CTACTTTGCT 
AAACCTGCAC 
TGTTGAAACA 
TTTGGGTATT 
GTGTTAAAAT 
AGGAGGGAAG 
GGGAAGGGGT 
ATAGACAGTA 
ATAGGTAAAT 
GTCCTTATAT 
CCTTTGCCAC 
GCCCTTTTCA 
AAGCCCTTAT 
GCCTACATTT 
AATCTTTCTG 
TCTGACCCAT 
TGTTCCCCCA 
GTTTTATATC 
AGTGTAATTA 
AGTGCTAGAC 
AGTCACTTAA 
CAGTTAGAAG 
AACTATGCCT 
AACAAAACCT 
TTCCACTGAA 
CAGTCTATTT 
CTCTCTACCA 
TTTTAACAAC 
GATGTATGGA 



51 
I 

ACCCCAACCC 
AACTTCCTCC 
CGCCTTCTGC 
CGGGGCTGCA 
GCACTGCCCT 
AGGCCATGTA 
GCAAAGTCTT 
TGGTGGTTGG 
GTATGAAGTG 
CGATATTTCT 
TCGTTCAAGA 
CTCTCTTCAC 
GTTCCTGTCC 
CTTCCAGCGG 
AACCGAAAAT 
GTAATCTGAA 
ACTCAGTGCT 
ATTTTACCAT 
GCTCCTTAAA 
AAATACTATT 
GTATTTAATT 
ACATATGTAA 
AAGACCTAGC 
TATACTTATT 
TTGTTTTGTG 
TAGTTTCTAA 
CATGACCAAA 
AGCACTCTTG 
GGTGTTGTAA 
CCCCTAAACT 
TCATGCGTTT 
TTTCTGGAGT 
TCTTTCTACC 
AGGTAGTGTG 
ATGTAGTGTC 
ACACACGTAC 
CAAAACCTAC 
CCACTGAACA 
GTCTATTTCC 
TGCTCTTACT 
AAGGGTGTTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
- 660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



281 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



WO 02/086443 

GCACTGGTGT CTGGAGACCT GGATTTGAGT 
AGCAAGGCAT TTGGCTGCTG TAAGCTTATT 
CTGATCTTCC CACCTCACAG TGATGTTGTG 
GTGGTTTTGT AATTTGAAAA GTGCTATACT 
CGTTTTGGTG TTGCTTTTCA AATGTTTGAA 
GCCTTAACCA GTCTCTCAAG TGATGAGACA 
AAGATTCTGA GGAAGTCTTA TCTTCTGCAG 
ACAGATGTAA TGGGAAGAAA TAAAAGCCTA 
TTTTGAATCA TAATAACTCA TAAGGTGCTA 
TGTTAGCTGG CAGCTGACGC TGCTAGGATA 
CTACACAAGG AAAGTCAGCC ACCGTGTCTT 
TGCCTTCCAA ACCTGAGAAT ATATGCTTTT 
ATACATAGAT CTTCATGATG TGTGAGTGTA 
ACAAAAAAAT TTTATGGCCC AAAATGACCA 
TTTGATCTTT TTATATTCTT CTACCACACC 
TTATAATGGG AATTTGTATA AAGCATTACT 
AAAAGGAAAA AAAAAAAAAA AAA 



Seq ID NO: 244 Protein sequence: 
Protein Accession 8: AAD16433.1 

1 11 21 31 41 51 

I I I I I I 

MANAGLQL.LG FILAFLGWIG AIVSTALPQW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 
QIQCKVFDSL LNLSSTLQAT RALMWGILL GVIAIFVATV GMKCMKCLED DEVQKMRMAV 
IGGAIFLLAG LAILVATAWY GNRIVQEFYD PMTPVNARYE FGQALFTGWA AASLCLLGGA 
LLCCSCPRKT TSYPTPRPYP KPAPSSGKDY V 

Seq ID NO: 245 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



PCT/US02/12476 



CTTGGTGCTA 
GCTTCATCTG 
GGGATCCAGT 
AAGGGAAAGA 
AATAAAAAAA 
GTGAAGTAAA 
TGAGTATGGC 
CGTGTTGGTA 
TCTGTTCAGT 
GTTAGTTTGG 
ATGAGGAATT 
GGAAGTTAAA 
ATTCCATGTG 
ACGAAATTGT 
TGGAAACAGA 
CTTTTTCAAT 



TCAATCACCG 
TAAGCGGTGG 
GAGATAGAAT 
ATTGAGGAAT 
TGTTAAGAAA 
ATTGAGTGCA 
CCAATGCTTT 
AATCCAACAG 
GATGCCCTCA 
AAATGGTACT 
GGACCTAATA 
ATTTAAATGG 
GATATCAGTT 
TACAATAGAA 
CCAATAGACA 
AAATTGTTTT 



TCTGTGTTTG 
TTTGTAATTC 
ACATGTAAGT 
TAACTGCATA 
TGGGTTTCTT 
CTAAACGAAT 
CTGTGGCTAA 
CAAGGGAGAT 
GAGCTCTTGC 
TCATAATAAA 
AATTTTAGTG 
CTTTTGCCAC 
ACCAAACATT 
TTTATCCAAT 
TTTTGGGGTT 
TTAATTTAAA 



1 
I 

TTTTTTTTTT 
TTAATGGTTA 
AGCATGGTCC 
TTTTCTTCCT 
CAGTTGTTAT 
AGGTGGGGAG 
TTAATAGCCA 
GTCCTACGCC 



11 
I 

TTTTTTTTTT 
AATGCTGTTT 
CGAGAGTCTG 
GAGATTTAGT 
GAAGAATGCA 
GTCGCTCAAG 
CTGCACTTCA 
CACGGAGTCT 



21 
I 

TTTTTCAAGG 
ACCAAGTGAC 
ACAAACCTCA 
TTCTTCATCG 
TATATTAGAA 
CCCAGGAATT 
GCCTGGGCAA 
CGCTGATTGC 



31 
I 

AGAGCACAAG 
CCAGAGGCAG 
GTTCAAATCC 
TTAACAATGA 
TGCCTGTAGT 
CAAAGCTGCA 
TGTAGTAAGA 
TAGCACAGCA 



Seq ID NO: 246 DNA sequence 

Nucleic Acid Accession #: XM_058553.2 

Coding sequence: 897-1400 



AATTTTCAGA 
TAAATGTATT 
GTGAAACCAT 
CTGTTATCCA 
ATAGAGGGAA 
TGGGATGAGA 
GATCATGTTT 
GTTGAGTGTA 
ACTGGTAACC 
TTATTTCTGT 
TTGTCCAGGC 
GCCTTTGCCT 
TTTTTTTGTT 
TAGTCTTGCT 
CAGCCTCCCA 
AAGAAACTTA 
ACCATCAAAT 
CTGATGTTGC 
CTGAAATTAG 
TCAACCAAAC 
CTTGCGATGA 
GCACAACTCA 
ATAACCTGGC 
ACAATGGAAA 
GTTGCTTCTT 
AACTCCCTGT 
TTTAATGCAA 



11 
I 

AGTTTCGTAT 
TAGTCTCAGT 
TTCTCTTTTA 
TAATATGGAC 
TGAGTATTAA 
GGAGGTGAAA 
AAGAAAAGTC 
TACTGTCTGT 
TGCCTATCTG 
GTTTATGTAT 
CAAGTGCAAT 
CCTGAGTAGC 
TGTTTGTTTG 
TTGTTGCCAG 
GAGTGCTAGG 
CACCGACTCC 
CAGGGCTTGC 
AAGCAAATTG 
TCATCATATC 
CAGGAGCCTT 
AGACTGGGAT 
CTACTCTGAC 
TTCAGGCATG 
TGCACAGTAA 
CTTCTACCAG 
GACTTTCCAA 
GAACCCTCAT 



21 
I 

GGGGATGGTT 
GCTCAATAGA 
ATGTTTCACA 
AGTTCTTGAG 
TTGGAGAAGC 
CCTCACTAGA 
ATGAAAATGG 
CAAAGACTTC 
TATTTTTAAG 
AAGGGGTTTT 
GGCACGAACC 
TGGGACTACA 
TTTGTTTTTG 
GCTAGTCTCA 
ATTACAGCAC 
CTGGACCCTG 
AGGTTTCCTT 
GCTACTTGTC 
TCAAGCTGTG 
AGACAAGAGA 
AAAGATTTGT 
AACAACAGCC 
CGAGTTCCCA 
CTGAATACCT 
TGGGTTCTCA 
ACTGACAAGC 
ACTCAGAAGC 



31 
I 

TTATATAAAT 
AGAGATTTCT 
TTCCTGTTAC 
TCCTAACATT 
TTAAAGTATT 
AAAAGGGACA 
TGAACTAGTG 
CAGCATTTCC 
AACCCAGGAG 
TTGTTTTTTT 
TCATAGCTCC 
GGCATGAGCC 
GGGGGGGTTG 
AACTCCTGGC 
TTGGATTCAG 
AGAAGCTATT 
ATCATCTTAT 
CCTTCAATGC 
ATGACAGAAG 
CTCTGGCTGA 
GGGAGCAGAC 
CTGCGAGCAA 
AATCTCTGCC 
ATCTCATCAA 
TTTTCCTCCT 
ACACTTTTTT 
TTCCAAATAA 



41 
I 

TCAGGTTTTT 
AATAGAAAAG 
AGATTTGTTC 
GAGAGGTTTT 
GCCACTTTAG 
ATGTTAGTGT 
TTTCCAAGCA 
AGGTCCTAGA 
GAAAGCTTTA 
AAAGACAGGA 
TGGACTTAAG 
CCCATGCCTG 
TTTTGTTTTT 
TTCAAGTGAT 
CTTCTTCATT 
GCAATGCCCC 
CAAGTGCAGA 
TCGCCACCAG 
TTGTATTGAG 
GAGCACTTGG 
CAGCACCCCA 
CATAGTTACA 
GTATGTTCTG 
ATGCCAGACC 
AATCTAATTA 
CCTCCCCCCT 
ACCTTTGATA 



51 
I 

CCCACAATAA 
GATTCAAACT 
TCTTGTGACT 
CCCTTAGTGC 
CACTGAAGAT 
GGCCCTTCCT 
TATTGGAAGG 
GAGGAACAAG 
TAATAGAACA 
TCTCACTCCA 
TGATCTGCCT 
GCTAAGTTTG 
TGTAGAGACG 
CCTCCTGCCT 
TCCAACATGG 
TATGACAAAA 
AAGAATCATC 
GTTCCTCGAG 
CAAGATGTTG 
CAGTGCCCTC 
TTTGTCTGGG 
GAACATAAGA 
CCATGGAAAA 
CTAGAAGACT 
TAGAATGGTA 
TGAATCCTCA 
CAGATTG 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



60 
120 
180 



41 


51 




1 

GAACTTTATT 


1 

AATGACTTTC 


60 


CGTGGTTTAG 


TGGTTTCAAC 


120 


TTCTTTTGTC 


TTCACTTAGT 


180 


GGATATTAAT 


ATGTTTCACA 


240 


CTCAGCTACT 


CAGGAGGCTA 


300 


ATGCATTATG 


ATTACAGCTG 


360 


TCCCATCTCT 


GGCTCGGAGG 


420 


GTCTGAGATC 


AAACTGCA 





60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



80 
85 



Seq ID NO: 247 Protein sequence: 
Protein Accession #: XP_058553.l 

1 11 21 31 41 51 

I I I I I I 

MEETYTDSLD PEKLLQCPYD KNHQIRACRF PYHLIKCRKN HPDVASKLAT CPFNARHQVP 
RAEISHHISS CDDRSCIEQD WNQTRSLRQ ETLAESTWQC PPCDEDWDKD LWEQTSTPFV 
WGTTHYSDNN SPASNIVTEH KNNIiASGMRV PKSLPYVLPW KNNGNAQ 



60 
120 



282 



WO 02/086443 

Seq ID NO: 248 DNA sequence 
Nucleic Acid Accession 8: NM_003392 
Coding sequence: 758.. 1855 



PCT/US02/12476 
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TTAAGGAAAT 
AACTGATTAT 
CGCGCTGGTC 
CATCCTCCAC 
TCCACTCGCC 
GGCCCAGGTT 
GAGGGACTGA 
GGAAGGAGGC 
TCGCTGGCGT 
TGGGTGGATT 
GAGAAGCGCA 
TGTATGCTTG 
GTCCATTGGA 
CAAGTTCTTC 
CAATTCTTGG 
AGGA6CACAG 
CCACTTGTAT 
ATGCCAGTAT 
TTTTGGCAGG 
AGCAGGGGTG 
CTGCAGCCGC 
CGACAACATC 
GCGCATCCAC 
CGAGGCCGGC 
GTCCGGCTCA 
TGATGCCCTG 
GTTGGTACAG 
CCCCAGCCCT 
CCTGTGCAAC 
GTACGACCAG 
CTACGTCAAG 
GCCACCCAGC 
TGGTTTTTGG 
TTTTTTCCTG 
GGCAATAATG 
TACAAGACTT 
CTGTGTGGGA 
TGCCATCATA 
TTCAGCTTCT 
AAAACAAAAC 
CATTTTCAAA 
GGTATATCAC 
AATAGCTCAT 
CTCTTATGTC 
AAGACCCCCA 
ATGAAATATC 
TACATGAATC 
GCACTGCACC 
ACACTGAGCC 
GCAGCTCCAC 
TGGAAAACAG 
TAGTTTTCTA 
TATCACTGTT 
GTGTACCTTA 
GGTTTAATGG 
ATATATAAAT 
CTCTGGGGTT 
ATTCCAAAAG 
GCACGACGAA 
TGTTGGGTTG 
TTCTGTTCAC 
ATTCAAAACT 
AACCCATGCC 
CACTACATAG 
CAGGCCATCA 
ACATCTTTTC 
CTCTTAATTT 
ATAATGATAT 
AAAGCACTAA 
TACTTTTTTT 
GTTGAGTTTA 
CATTCAGATA 
GTCTTGCGTG 
AATGGAAGAT 



11 
I 

CCGGGCTGCT 
GAAACATACG 
CCCGGGGCCT 
CCCCCGCGCT 
TCCCGTGCTC 
GGGGAGAGGA 
GCACAGCACC 
AGCGCCTGGC 
GCCCCGCGCA 
AATTTGGAAA 
GTCAATCAAC 
AAAATTATCT 
ATATTAAGCC 
CTAGTGGCTT 
TGGTCGCTAG 
CCTCTCTGCA 
CAGGACCACA 
CAATTCCGAC 
GTGATGCAGA 
GTGAACGCCA 
GCCGCGCGCC 
GACTATGGCT 
GCCAAGGGCT 
CGCAGGACGG 
TGTAGCCTGA 
AAGGAGAAGT 
GTCAACAGCC 
GACTACTGCG 
AAGACGTCGG 
TTCAAGACCG 
TGCAAGAAGT 
ACTCAGCCCC 
TTTTTAGAAA 
TTACCATCTA 
GGGGTGGGAA 
CTTTTGGATA 
CATGGTACAC 
TGGGATGGGT 
ATGACCAAAA 
AAAACAAAAC 
ATGATAATTT 
ATGTCTCATT 
GAAATTTGGG 
AAGATGTTGA 
ATGATTCTGG 
CTGTATTTTC 
CCATTCACAG 
AGAGCAGACA 
CTCTCTGATT 

TTCACTACTT 
TGTTTTAATG 
ATGATCCTGT 
AAACTGTTCC 
TGCCTGATAT 
ATAAATATAA 
ATCTCTCTGT 
TTTTTTGAGT 
GCAACCTCGT 
AAGATATCTT 
TTTGTGGAGA 
CAGAAGCATC 
TATTAGAAAT 
ATAGCTTTTT 
AATATGTTCT 
ATACCCCCCC 
ATTGCATAAT 
TCACATCCCC 
TTAGTTTAAA 
ATTTGCTAAA 
ACAATCCTAG 
TTATGTATAT 
ATTTGTATAT 
AGAATATAAA 



21 
I 

CTTCCCCATC 
ATGTTAATTC 
CGCCCCCCAC 
GGCCACCCCG 
CTCTCGCCCA 
CGGAGGGTGG 
AACTAGAGAG 
ACCAGGGCTT 
CAGGATCCCA 
AAGAAACTGC 
AGTAAACTTA 
GAGAGGGAAT 
CAGGAGTTGC 
TGGCCATATT 
GTATGAATAA 
GCCAACTGGC 
TGCAGTACAT 
ATCGACGGTG 
TAGGCAGCCG 
TGAGCCGGGC 
CCAAGGACCT 
ACCGCTTTGC 
CCTACGAGAG 
TGTACAACCT 
AGACATGCTG 
ACGACAGCGC 
GCTTCAACTC 
TGCGCAATGA 
AGGGCATGGA 
TGCAGACGGA 
GCACGGAGAT 
GCTCCCAGGA 
TATTTTTTAT 
AGAACTCTGT 
CCACGAAAAA 
GTATAGAATG 
ATCCAGAAGG 
AGGTTCCAGT 
TGAGTTGTAA 
AAACCTCCCT 
ACAATGGAAG 
CTCCTCAAAT 
CAGCAGGGAG 
TTTGAAGCTG 
ACACTAGATT 
TTAGGGATAC 
GTTTCTCAGC 
ACCTATTTGA 
CCTCCGTGTT 
TTGGTTGTAG 
AGGGATTTTT 
ACAGAACTTG 
GTTTAGATTA 
CAGTGTACTT 
CTCAAAGTCT 
ATATATCTCA 
CTAGAGCATT 
CTTGAGCTTG 
TTCTGAGGAA 
TTTTTCTTTT 
GGGCATTACT 
AGCAATGTTT 
GACAGTACTT 

TATCTCAGAC 
TTAGGAGGTT 
GATATCCACA 
TCAGTTGCAG 
ATGTCACTTT 
TCAGATTGTT 
CTTTTAAAAG 
CTTCTAGCCT 
TTCACTGGTT 
ATAAAACGTT 



31 
I 

TGGAAGTGGC 
GGAGCTGCAT 
CCCCTGCCCT 
CCTCCTTGGC 
TGGAATTAAT 
CCGCAGCGGG 
GGGTCAGGGG 
TGACTCAACA 
GCGAAAATCA 
CTATATCTTG 
AGAGACCCCC 
AAACATCTTT 
TTTGGGGATG 
TTTCTCCTTC 
CCCTGTTCAG 
AGGACTTTCT 
CGGAGAAGGC 
GAACTGCAGC 
CGAGACGGCC 
GTGCCGCGAG 
GCCGCGGGAC 
CAAGGAGTTC 
TGCTCGCATC 
GGCTGATGTG 
GCTGCAGCTG 
GGCGGCCATG 
GCCCACCACA 
GAGCACCGGC 
TGGCTGCGAG 
GCGCTGCCAC 
CGTGGACCAG 
CCCGCTTATT 
TTTTCCCCAA 
GGTTTATTAT 
TATTTATTTT 
AAGGGGGAAA 
TAAAGAAATA 
TGAAAGAGGG 
ATTCTCTGGT 
TCCCCAGCAG 
GACAAGAATG 
ATTCCATTTG 
GAAAGTCCCC 
TTATAAGAAT 
TTTTGTTTGG 
TTGGTTAGTA 
CCAAGCAACA 
GGAAAAACAG 
GTGATGTGAT 
GACAGGAAAT 
GTTTCCTAAA 
GCTAATGGAA 
TCCACTCATG 
GAACAGTTGC 
TTTGTACATA 
TTGCAGCCAG 
GTTGTCCTTC 
GGCTGTGGCC 
GAAGCTTGAG 
CTGCCTCACC 
TGTTCGTTAT 
CTCTTTTCTT 
ATTAATTGAG 
TTTTTTTTAA 
TTACGTTGTT 
GGGCTTTCAT 
TCAGCCAACT 
TGAATTGTGA 
TTTGGTTTTT 
CCTTTTTAGT 
AAACTATTTA 
TTATTCTGTA 
TAAAAAACAA 
ACTTGTAAAA 



41 
I 

TTTCCCCACA 
TTCCCAGCTG 
TCCCTCCCGC 
AGCCTCTGGC 
TCTGGCTCCA 
TTCCTGAGTG 
GTGCGGGACT 
GAATTGAGAC 
GATTTCCTGG 
CCATCAAAAA 
GATGCTCCCC 
TCCTTCTTCC 
GCTGGAAGTG 
GCCCAGGTTG 
ATGTCAGAAG 
CAAGGACAGA 
GCGAAGACAG 
ACTGTGGATA 
TTCACATACG 
GGCGAGCTGT 
TGGCTCTGGG 
GTGGACGCCC 
CTCATGAACC 
GCCTGCAAGT 
GCAGACTTCC 
CGGCTCAACA 
CAAGACCTGG 
TCGCTGGGCA 
CTCATGTGCT 
TGCAAGTTCC 
TTTGTGTGCA 
TATAGAAAGT 
GAATTGCAAC 
TAATATTATA 
GTGGATCTTT 
TAACACATAC 
CATTTTCTTT 
TGGTAGAAAT 
GCAAGATAAA 
GGCTGCTAGC 
TCATATTCTC 
CAGACAGACC 
AGAAATTAAA 
TGGGATTCCA 
GGAGGTTGGC 
AATTATAATA 
AGGTAATTGC 
TGAAATCCAC 
GCTGGCCACG 
GAAACATTAG 
ACTTTTATTT 
TTCACAGAGG 
CTTCTCCTAT 
ATTTATAAGG 
ACATATATAT 
TGATTTAGAT 
ACTGCAGTCC 
CCGCTGTGAT 
TTCTGACTCA 
CCTTTGTCTC 
AGACATGGAC 
AGTTCATTCT 
TCCCTAAGGA 
TAAGGACACC 
TTAAAAGTTT 
ATCACCTCAG 
GTGGCTCTTT 
GCAAAAGATC 
ATTATACAAA 
GACTCATGTT 
ATGTAAAATA 
CTTTTAATGT 
ACATCGAAAG 
AAAAAAAA 



Seq ID NO: 249 Protein sequence: 
Protein Accession #: NP 003383 



11 



21 



31 



51 . 
I 

TCGGCTCGTA 
GGCACTCTCG 
GTCCTGCCCC 
GGCAGCGCGC 
CTTGTTGCTC 
AATTACCCAG 
CGAGCGAGCA 
ACGTTTGTAA 
TGAGGTTGCG 
ACTCACGGAG 
TGGTTTAACT 
CTCTCCAGAA 
CAATGTCTTC 
TAATTGAAGC 
TATATATTAT 
AGAAACTGTG 
GCATCAAAGA 
ACACCTCTGT 
CCGTGAGCGC 
CCACCTGCGG 
GCGGCTGCGG 
GCGAGCGGGA 
TGCACAACAA 
GCCATGGGGT 
GCAAGGTGGG 
GCCGGGGCAA 
TCTACATCGA 
CGCAGGGCCG 
GCGGCCGTGG 
ACTGGTGCTG 
AGTAGTGGGT 
ACAGTGATTC 
CGGAACCATT 
ATTATTATTT 
GAAAAGGTAA 
CCTAACTTAG 
TTCTCAAATA 
CTATTCACAA 
AGGTCTTGGG 
TTGCTTTCTG 
AAGGAAAAAA 
GTCATATTCT 
AAATTTAAAA 
GATTTGTAAA 
TTGAACATAA 
GTAGAAATAA 
GTGCCATTCA 
CTTCCTCTTC 
TTTCCAAACG 
GAGCTCTGCT 
TGAGGAGCAG 
TGTTGCAGCG 
TGTACTGCAG 
GGGGAAATGT 
ATATATACAT 
TTACAGCTTA 
AGTTGGGATT 
CATACCCTGA 
CTGAAATGCG 
CAACCTCCAT 
GTTAAGAGAT 
GCAGAATGGA 
ATATTCAGCC 
TCTTTCCAAA 
GGAAAGATAC 
CCAACTGTGG 
AATTTATTGC 
TTGAAAGCAA 
AACCATGAAG 
TATGAAGAGA 
TTCTACATGT 
ACATATTTCT 
GCTTATTCCA 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2860 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 



283 



10 
15 
20 
25 
30 
35 
40 



WO 02/086443 

MAGSAMSSKF FLVALAIFFS FAQWIBANS WWSLGMNNPV 
SQGQKKLCHL YQDHMQYIGE GAKTGIKECQ YQFRHRRWNC 
AFTYAVSAAG WNAMSRACR EGELSTCGCS RAARPKDLPR 
FVDARERERI HAKGSYESAR ILMNLHNNEA GRRTVYNLAD 
LADFRKVGDA LKEKYDSAAA MRLNSRGKLV QVNSRFNSPT 
GSLGTQGRLC NKTSBGMDGC ELMCCGRGYD QFKTVQTERC 
QFVCK 

Seq ID NO: 250 DNA sequence 
Nucleic Acid Accession 8: NM_014058 
Coding sequence: 56.. 1324 



PCTAJS02/12476 



QMSEVYIIGA 
STVDNTSVFG 
DWLWGGCGDN 
VACKCHGVSG 
TQDLVYIDPS 
HCKFHWCCYV 



QPLCSQLAGL 
RVMQIGSRET 
IDYGYRFAKE 
SCSLKTCWLQ 
PDYCVRNEST 
KCKKCTEIVD 



TGACTTGGAT 
TCGGCCAGAT 
CGTCATCTTC 
GAGATATAAT 
ACTATATGCT 
TGAATCAATG 
TCAGGTTATC 
TAGATTTCAC 
TGAAAAGCTG 
AAAAATCAAC 
TAAAACTCTA 
GCCCTGGCAG 
TGCCACATGG 
GACTGCTTCC 
AATTGTCCAT 
TTCTAGCCCT 
TGAGTTTCAA 
TTACAGTCAA 
TGAACCTCAA 
AGGAAAAACA 
AGATATCTGG 
GCCTGGTGTT 
CTAAGAGAGA 
CCATTTTTAG 
AATAAACTGT 



11 
I 

GTAGACCTCG 
GTGGTGAGGG 
ATATCCCTGA 
CAAAAGAAGA 
GAGTTTGGCA 
GTGAAAAATG 
AAGTTCAGTC 
TCTACTGAGG 
CAAGATGCTG 
AAGACAGAAA 
GGTCAGAGTC 
GCTAGCCTGC 
CTTGTGAGTG 
TTTGGAGTAA 
GAAAAATACA 
GTTCCCTACA 
CCAGGTGATG 
AATCATCTTC 
GCTTACAATG 
GATGCATGCC 
TACCTTGCTG 
TATACTAGAG 
AAAGCCTCAT 
AGATACAGAA 
TTGCTTGATG 



21 
I 

ACCTTCACAG 
CTAGGAAAAG 
TTGTCCTGGC 
CCTACAATTA 
GAGAGGCTTC 
CATTTTATAA 
AACAGAAGCA 
ATCCTGAAAC 
TAGGACCCCC 
CAGACAGCTA 
TCAGGATCGT 
AGTGGGATGG 
CTGCTCACTG 
CAATAAAACC 
AACACCCATC 
CAAATGCAGT 
TGATGTTTGT 
GACAAGCACA 
ACGCCATAAC 
AGGGTGACTC 
GAATAGTGAG 
TTACGGCCTT 
GGAACAGATA 
TTGGAGAAGA 
CAAAAAAAAA 



31 
I 

GACTCTTCAT 
AGTTTGTTGG 
AGTGTGCATT 
CTATAGCACA 
TAACAATTTT 
ATCTCCATTA 
TGGAGTGTTG 
TGTAGATAAA 
TAAAGTAGAT 
TCTAAACCAT 
TGGTGGGACA 
GAGTCATCGC 
TTTTACAACA 
TTCGAAAATG 
ACATGACTAT 
ACATAGAGTT 
GACAGGATTT 
GGTGACTCTC 
TCCTAGAATG 
TGGAGGACCA 
CTGGGGAGAT 
GCGGGACTGG 
ACATTTTTTT 
CTTGCAAAAC 
A 



41 
I 

TGCTGGTTGG 
GAACCCTGGG 
GGACrCACTG 
TTGTCATTTA 
ACAGAAATGA 
AGGGAAGAAT 
GCTCATATGC 
ATTGTTCAAC 
CCTCACTCAG 
TGCTGCGGAA 
GAAGTAGAAG 
TGTGGAGCAA 
TATAAGAACC 
AAACGGGGTC 
GATATTTCTC 
TGTCTCCCTG 
GGAGCACTGA 
ATAGACGCTA 
TTATGTGCTG 
CTGGTTAGTT 
GAATGTGCGA 
ATTACTTCAA 
TTGTTTTTTG 
AGCTAGATTT 



51 
I 

CAATGATGTA ■ 
TTATCGGCCT 
TTCATTATGT 
CAACTGACAA 
GCCAGAGACT 
TTGTCAAGTC 
TGTTGATTTG 
TTGTTTTACA 
TTAAAATTAA 
CACGAAGAAG 
AGGGTGAATG 
CCTTAATTAA 
CTGCCAGATG 
TCCGGAGAAT 
TTGCAGAGCT 
ATGCATCCTA 
AAAATGATGG 
CAACTTGCAA 
GCTCCTTAGA 
CAGATGCTAG 
AACCCAACAA 
AAACTGGTAT 
GGTGTGGAGG 
GACTGATCTC 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



Seq ID NO: 251 Protein sequence: 
Protein Accession #: NP 054777 



45 



50 



55 



60 



65 



70 



75 



80 



85 



I 

MYRPDWRAR 
DKLYAEFGRE 
ICRFHSTEDP 
RSKTItGQSLR 
RWTASFGVTI 
SYEFQPGDVM 
LEGKTDACQG 
GI 



11 
I 

KRVCWEPWVI 
ASNNFTEMSQ 
ETVDKIVQLV 
IVGGTEVEEG 
KPSKMKRGLR 
FVTGFGALKN 
DSGGPLVSSD 



21 
I 

GLVIFISLIV 
RLESMVKNAF 
LHEKLQDAVG 
EWPWQASLQW 
RIIVHEKYKH 
DGYSQNHLRQ 
ARDIWYLAGI 



31 
I 

LAVCIGLTVH 
YKSPLREEFV 
PPKVDPHSVK 
DGSHRCGATL 
PSHDYDISLA 
AQVTLIDATT 
VSWGDECAKP 



41 

I 

YVRYNQKKTY 
KSQVIKFSQQ 
IKKINKTETD 
INATWLVSAA 
ELSSPVPYTN 
CNEPQAYNDA 
NKPGVYTRVT 



51 
I 

NYYSTliSFTT 
KHGVLAHML.L. 
SYLNHCCGTR 
HCFTTYKNPA 
AVHRVCLPDA 
ITPRMLCAGS 
ALRDWITSKT 



Seq ID NO: 252 DNA sequence 

Nucleic Acid Accession &: NM_003504.2 

Coding sequence: 71-1771 



GGCACGAGGC 
CGCCGTGGCT 
GAGGGTCCTT 
GGCCTTGTTC 
ACTTGAAACT 
TGGAGCTAAT 
• GTGTGACACC 
ACTCATTAAA 
AGAGGAGGAT 
CACACGGTTA 
GGAGGCCCGG 
GTCAGCCATG 
GTGGTGGGCC 
ATACGTGACT 
GGATGAGGAG 
CCTGGTGCTC 
AGCCAGGTTC 
CATGGGTCTT 
GGAGAATTTG 
CGTGCAGACT 
CTTTGCCACC 
CATCCAGGCT 
ACTCGCCAAG 
CCTCGTCATC 
CATGCTGTTC 
TGTGTGTTCG 



11 
I 

CTCGTGCCGC 
ATGTTCGTGT 
CTCTTCGTGG 
CAGTGTGACC 
GCATTTCTTG 
GTAGACCTAT 
CATAGGCCAG 
CAAGATGATG 
GAAGAGCATT 
GAAGAGGAGA 
AGAAGAGACA 
GTGATGTTTG 
ATCGTTGGAC 
GATGTTGGTG 
AACACACTCT 
TACCAGCACT 
AAGCTGTGGT 
CCCCTGAAGC 
CGGGAAATGA 
TTCAGCATTC 
ATGTCTTTGA 
CTGGACAGCC 
AAGCAGCTGC 
TCCCAGGGGC 
TCTAGGCCGG 
ACAAAGAACC 



21 

I 

CGGGCTCTTG 
CCGATTTCCG 
CCTCGGACGT 
ACGTGCAATA 
AGCATAAAGA 
TGGATATTCT 
TCAATGTCGT 
ACCTTGAAGT 
CAGGAAATGA 
TAGTGGAGCA 
TCCTCTTTGA 
AGCTGGCTTG 
TAACAGACCA 
TCCTGCAGCG 
CCGTGGACTG 
GGTCCCTCCA 
CTGTGCATGG 
AGGTGAAGCA 
TTGAAGAGTC 
ATTTTGGGTT 
TGGAGAGCCC 
TCTCCAGGAG 
GAGCCACCCA 
CTTTCCTGTA 
CATCCCTAAG 
GGCGCTGCAA 



31 
I 

GTACCTCAGC 
CAAAGAGTTC 
GGATGCTCTG 
TACGCTGGTT 
ACAGTTTCAT 
TCAACCTGAT 
CAATGTATAC 
TCCCGCCTAT 
CAGTGATGGG 
AACCATGCGG 
CTACGAGCAG 
GATGCTGTCC 
GTGGGTGCAA 
CCACGTTTCC 
CACACGGATC 
TGACAGCCTG 
ACAGAAGCGG 
GAAGTTCCAG 
TGCAAATAAA 
CAAGCACAAG 
CGAGAAGGAT 
TAACCTGGAC 
GCAGACCATT 
CTGCTCTCTC 
CCTGCTCAGC 
ACTGCTGCCC 



41 
I 

GCGAGCGCCA 
TACGAGGTGG 
TGTGCGTGCA 
CCAGTTTCTG 
TATTTTATTC 
GAAGACACTA 
AACGATACCC 
GAAGACATCT 
TCAGAGCCTT 
AGGAGGCAGC 
TATGAATATC 
AAGGACCTGA 
GACAAGATCA 
CGCCACAACC 
TCCTTTGAGT 
TGCAACACCA 
CTCCAGGAGT 
GCCATGGACA 
TTTGGGATGA 
TTTCTGGCCA 
GGCTCAGGGA 
AAGCTGTACC 
GCCAGCTGCC 
ATGGAGGGCA 
AAACACCTGC 
CTGGTGATGG 



51 
I 

GGCGTCCGGC 
TCCAGAGCCA 
AGATCCTTCA 
GGTGGCAAGA 
TCATAAACTG 
TATTCTTTGT 
AGATCAAATT 
TCAGGGATGA 
CTGAGAAGCG 
GGCGAGAGTG 
ATGGGACATC 
ATGACATGCT 
CTCAAATGAA 
ACCGGAACGA 
ATGACCTCCG 
GCTATACCGC 
TCCTTGCAGA 
TCTCCTTGAA 
AGGACATGCG 
GCGACGTGGT 
CAGATCACTT 
ATGGCCTGGA 
TTTGCACCAA 
CTCCAGATGT 
TCAAGTCCTT 
CTGCCCCCCT 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



284 



10 
15 
20 
25 



WO 02/086443 

GAGCATGGAG CATGGCACAG TGACCGTGGT GGGCATCCCC CCAGAGACCG ACAGCTCGGA 
CAGGAAGAAC TTTTTTGGGA GGGCGTTTGA GAAGGCAGCG GAAAGCACCA GCTCCCGGAT 
GCTGCACAAC CATTTTGACC TCTCAGTAAT TGAGCTGAAA GCTGAGGATC GGAGCAAGTT 
TCTGGACGCA CTTATTTCCC TCCTGTCCTA GGAATTTGAT TCTTCCAGAA TGACCTTCTT 
ATTTATGTAA CTGGCTTTCA TTTAGATTGT AAGTTATGGA CATGATTTGA GATGTAGAAG 
CCATTTTTTA TTAAATAAAA TGCTTATTTT AGGCTCCGTC CCCAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AA 

Seq ID NO: 253 Protein sequence: 
Protein Accession ft: NP 003495.1 



PCT/US02/12476 



1 

I 

MFVSDFRKEF 
AFLEHKEQFH 
QDDDLEVPAY 
RRDILFDYEQ 
DVGVLQRHVS 
KLWSVHGQKR 
FSIHFGFKHK 
KGLRATQQTI 
TKNRRCKLLP 
HFDLSVIELK 



11 
I 

YEWQSQRVL 
YFILINCGAN 
EDIFRDEEED 
YEYHGTSSAM 
RHNHRNEDEE 
LQEFLADMGL 
FLASDWFAT 
ASCLCTNLVI 
LVMAAPLSME 
AEDRSKFLDA 



21 
I 

LFVASDVDAL 
VDLLDILQPD 
EEHSGNDSDG 
VMFELAWMLS 
NTLSVDCTRI 
PLKQVKQKFQ 
MSLMESPEKD 
SQGPFLYCSL 
HGTVTWGIP 
LISLLS 



31 
I 

CACKILQALF 
EDTIFFVCDT 
SEPSEKRTRL 
KDLNDMLWWA 
SFEYDLRLVL 
AMDISLKENL 
GSGTDHFIQA 
MEGTPDVMLF 
PETDSSDRKN 



41 
I 

QCDHVQYTLV 
HRPVNWNVY 
EEEIVEQTMR 
IVGLTDQWVQ 
YQHWSLHDSL 
REMIEESANK 
LDSLSRSNLD 
SRPASLSLLS 
FFGRAFEKAA 



51 
I 

PVSGWQELET 
NDTQIKLLIK 
RRQRREWEAR 
DKITQMKYVT 
CNTSYTAARF 
FGMKDMRVQT 
KLYHGLELAK 
KHLLKSFVCS 
ESTSSRMLHN 



Seq ID NO: 254 DNA sequence 
Nucleic Acid Accession ft: NM_022337 
Coding sequence : 48 . . 683 



1620 
1680 
1740 
1800 
1860 
1920 



60 
120 
180 
240 
300 
360 
420 
480 
540 



30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



1 
I 

GGCTGCGCTT 
ACAAGGAGCA 
TCATCAAGCG 
ACTTCGCGCT 
ATATCGCAGG 
GTGCATTTAT 
AAAATGATTT 
TGGCCAACAA 
AGTTCTGCAA 
ACATTGATGA 
TGGAGTCTAT 
GCTCTGGCTG 
TTGTTCCACA 
CACATGTGGC 
GTTCTTTCTA 
TCTGTTACAA 
TTATTTGCTT 
AACTAGCTGT 
AATATATTCT 
GACCTCCATT 
ACAGGTGTGC 
AACTGAATAT 
CTCAAGCTGT 
GCAAGTGAAC 



11 
I 

CCCTGGTCAG 
CCTGTACAAG 
CTACGTGCAC 
CAAGGTGCTC 
TCAAGAAAGA 
TGTCTTCGAT 
GGACTCCAAG 
ATGTGACCAG 
GGAGCACGGT 
AGCCTCCAGA 
TGAGCCGGAC 
TGCCAAATCC 
AATTGTGCCT 
AAGCCAAAGA 
TGCTTTCCTC 
ACTTCTGTCA 
CTTTTAATCA 
CAAGTCAAGG 
CTGATGGCCT 
CTCGGCAGAC 
TATATTGTCC 
TGTATGAAAA 
GGGGCTCCTC 
AATAAAACAT 



21 
I 

GCACGGCACG 
TTGCTGGTGA 
CAGAACTTCT 
CACTGGGACC 
TTTGGAAACA 
GTCACCAGGC 
TTAAGTCTCC 
GGGAAGGATG 
TTCGTAGGAT 
TGCCTGGTGA 
GTCGTGAAGC 
TAGTAGGCAC 
CTATTTTTAC 
TCTATGCCTC 
ACCATCATCA 
TGTAGCTGAC 
GCAAAGGCCT 
ACTGGCTTTC 
GACAGGCCTA 
CTAAGAGTTG 
TTGTCCTAAC 
GACATGCCTC 
TATACATGCT 
TAAAAGATAA 



31 
I 

TCTGGCCGGC 
TTGGCGACCT 
CCTCGCACTA 
CGGAGACTGT 
TGACGAGGGT 
CAGCCACATT 
CTAATGGCAA 
TGCTCATGAA 
GGTTTGAAAC 
AACACATACT 
CCCATCTCAC 
CTTTGCTGGT 
CATTTTGGGT 
TGTTTTTTCA 
CAGTGTTTAC 
CAAAATCCTG 
CAAGTCTTAA 
ACCTTGCCCT 
TTAAGTAGAT 
CCTCTGAGTT 
TGTCACTTGC 
CATATGTGCC 
ATACATGTAA 
AA 



41 
I 

CGCCAGGATG 
GGGCGTGGGG 
CCGGGCCACA 
GGTGCGCCTG 
CTATTACCGA 
TGAAGCAGTG 
ACCGGTTTCA 
CAATGGCCTC 
ATCAGCAAAG 
TGCAAATGAG 
ATCAACCAAG 
GTCTGGTAGG 
AAACGTCAGG 
ATGAGAGAGA 
AAACTTTTGA 
CAGGGCCACA 
AATAAAAGGG 
GGTGTCTTTT 
GTGATATTTT 
AGCTCTTTGG 
CATGGCCTGA 
TTTCTGTTAG 
TATATATTAT 



51 
I 

CAGGCCCCGC 
AAGACCAGTA 
ATCGGCGTGG 
CAGCTCTGGG 
GAAGCTATGG 
GCAAAGTGGA 
GTGGTTTTGT 
AAGATGGACC 
GAAAATATAA 
TGTGACCTAA 
GTTGCCAGCT 
AATGACCTCA 
ATAGATATAC 
AATAGCAAAT 
AAATATTTAG 
GTCGGCACTG 
GAGAAGAACA 
TCCAGATTTC 
CTTCCAAGAT 
AATCGTGAAC 
ATGTTGGCTT 
CTCTCTTTGA 
ATATATTTTT 



Seq ID NO: 255 Protein sequence: 
Protein Accession ft: NP_071732 



11 



21 



31 



41 • 51 

,1111 
MQAPHKEHLY KLLVIGDLGV GKTSIIKRYV HQNFSSHYRA TIGVDFALKV LHWDPETWR 
LQLWDIAGQE RFGNMTRVYY REAMGAFIVF DVTRPATFEA VAKWKNDLDS KLSLPNGKPV 
SWLLANKCD QGKDVLMNNG LKMDQFCKEH GFVGWFETSA KEN IN IDEAS RCLVKHILAN 
ECDLMESIEP DWKPHLTST KVASCSGCAK S 

Seq ID NO: 256 DNA sequence 
Nucleic Acid Accession ft: NM_016321 
Coding sequence: 25.. 1464 



GGAACCGCCC 
CCGCTGGCGG 
GGTGTTCGTG 
GAACTTGAGC 
CGTGATGGTC 
CGCCGTGGGC 
GGGCTGGTTC 
CGCTGACTTC 
CCCCATTCAG 
CATTCTCCTT 
TGGCGCCTAC 
CAAGGAGAGA 
CCTGTGGATG 
CCGAGCCGCC 



11 
I 

GCTGCCAGCC 
CTGCCGCTCA 
CGCTACGACT 
GACATGGAGA 
TTCGTGGGCT 
TTCAACTTCC 
CACTTCTTAC 
TGCGTGGCCT 
CTGCTCATCA 
AACCTGCTAA 
TTTGGGCTCA 
CAGAATTCTG 
TACTGGCCCA 
ATCAACACCT 



21 
I 

CGGCCAGGCA 
CCTGCCTGCT 
TCGAGGCCGA 
ACGAATTCTA 
TCGGCTTCCT 
TGTTGGCAGC 
AAGACCGCTA 
CTGTCTGCGT 
TGACTTTCTT 
AGGTGAAGGA 
CAGTGACCCG 
TGTACCAGTC 
GCTTCAACTC 
ACTGCTCCTT 



31 
I 

CCCCTGCAGC 
CCTGCAGGTG 
CGCCCACTGG 
CTATCGCTAC 
CATGACTTTC 
CTTCGGCATC 
CATCGTCGTG 
GGCCTTTGGG 
CCAAGTGACC 
TGCAGGAGGC 
GATCCTCTAC 
GGACCTCTTT 
AGCCATATCC 
GGCAGCCTGC 



41 
I 

ATGGCCTGGA 
ATTATGGTGA 
TGGTCAGAGA 
CCAAGCTTCC 
CTGCAGCGCT 
CAGTGGGCGC 
GGCGTGGAGA 
GCAGTTCTGG 
CTCTTCGCTG 
TCCATGACCA 
CGACGCAACC 
GCCATGATTG 
TACCATGGGG 
GTGCTTACCT 



51 

1 

ACACCAACCT 
TTCTCTTCGG 
GGACGCACAA 
AGGACGTGCA 
ACGGCTTCAG 
TGCTCATGCA 
ACCTCATCAA 
GTAAAGTCAG 
TGAATGAGTT 
TCCACACATT 
TAGAGCAGAG 
GCACCCTCTT 
ACAGCCAGCA 
CGGTGGCAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



285 



WO 02/086443 

ATCCAGTGCC CTGCACAAGA AGGGCAAGCT GGACATGGTG CACATCCAGA ATGCCACGCT 900 

CGCAGGAGGG GTGGCCGTGG GTACCGCTGC TGAGATGATG CTCATGCCTT ACGGTGCCCT 960 

CATCATCGGC TTCGTCTGCG GCATCATCTC CACCCTGGGT TTTGTATACC TGACCCCATT 1020 

CCTGGAGTCC CGGCTGCACA TCCAGGACAC ATGTGGCATT AACAATCTGC ATGGCATTCC 1080 

TGGCATCATA GGCGGCATCG TGGGTGCTGT GACAGCGGCC TCOGCCAGCC TTGAAGTCTA 1140 

TGGAAAAGAA GGGCTTGTCC ATTCCTTTGA CTTTCAAGGT TTCAACGGGG ACTGGACCGC 1200 

AAGAACACAG GGAAAGTTCC AGATTTATGG TCTCTTGGTG ACCCTGGCCA TGGCCCTGAT 1260 

GGGTGGCATC ATTGTGGGGC TCATTTTGAG ATTACCATTC TGGGGAGAAC CTTCAGATGA 1320 

GAACTGCTTT GAGGATGCGG TCTACTGGGA GATGCCTGAA GGGAACAGCA CTGTCTACAT 1380 

CCCTGAGGAC CCCACCTTCA AGCCCTCAGG ACCCTCAGTA CCCTCAGTAC CCATGGTGTC 1440 

CCCACTACCC ATGGCTTCCT CGGTACCCTT GGTACCCTAG GCTCCCAGGG CAGGTGAGGA 1500 

GCAGGCTCCA CAGACTSTCC TGGGGCCCAG AGGAGCTGGT GCTGACCTAG CTAGGGATGC 1560 

AAGAGTGAGC AAGCAGCACC CCCACCTGCT GGCTTGGCCT CAAGGTGCCT CCACCCCTGC 1620 

CCTCCCCTTC ATCCCAGGGG GTCTGMCTGA GAATGGAGAA GGAGAAGCTA CAAAGTGGGC 1680 

ATCCAAGCCG GGTTCTGGCT GCAGAAGTTC TGCCTCTGCC TGGGGTCTTG GCCACATTGG 1740 

AGAAAAACAG GCTCAAAGTG GGGCTGGGAC CTGGTGGGTG AACCTGAGCT CTCCCAGGAG 1800 

ACAACTTAGC TGCCAGTCAC CACCTATGAG GCTCTTCTAC CCCGTGCCTG CACCTCGGCC 1860 

AGCATCTCCT ATGCTCCCTG GGTCCCCCAG ACCTCTCTGT GTTGTGTGCG TGGCAGCCTC 1920 
CAGGAATAAA CATTCTTGTT GTCCTTTGTA AAAAAAAAAA AAAAAAAA 



Seq ID NO: 257 Protein sequence: 
protein Accession ft: NP_057405 



1 11 21 31 41 51 

I I II I I 

MAWNTNLRWR LPLTCLLLQV IMVILFGVFV RYDFEADAHW WSERTHKNLS DMENEFYYRY 60 

PSFQDVHVMV FVGFGFLMTF LQRYGFSAVG FNFLLAAFGI QWAIiLMQGWF HFLQDRYIW 120 

GVENLINADF CVASVCVAFG AVLGKVSPIQ LLIMTFFQVT LFAVNEFILL NLLKVKDAGG 180 

SMTIHTFGAY FGLTVTRILY RRNLBQSKER QNSVYQSDLF AMIGTLFLWM YWPSFNSAIS 240 

YHGDSQHRAA INTYCSLAAC VLTSVAISSA LHKKGKLDMV HIQNATLAGG VAVGTAAEMM 300 

LMPYGALIIG FVCGIISTLG FVYLTPFLES RLHIQDTCGI NNLHGIPGII GGIVGAVTAA 360 

SASLEVYGKE GLVHSFDFQG FNGDWTARTQ GKFQIYGLLV TLAMALMGGI IVGLILRLPF 420 
WGQPSDENCF EDAVYWEMPE GNSTVYIPED PTFKPSGPSV PSVPMVSPLP MASSVPLVP 



Seq ID NO: 258 DNA sequence 

Nucleic Acid Accession *h NM_002358.2 

Coding sequence: 75.. 692 



1 11 21 31 41 51 

I I I I I I 

GGGAAGTGCT GTTGGAGCCG CTGTGGTTGC TGTCCGCGGA GTGGAAGCGC GTGCTTTTGT 60 

TTGTGTCCCT GGCCATGGCG CTGCAGCTCT CCCGGGAGCA GGGAATCACC CTGCGCGGGA 120 

GCGCCGAAAT CGTGGCCGAG TTCTTCTCAT TCGGCATCAA CAGCATTTTA TATCAGCGTG 180 

GCATATATCC ATCTGAAACC TTTACTCGAG TGCAGAAATA CGGACTCACC TTGCTTGTAA 240 

CTACTGATCT TGAGCTCATA AAATACCTAA ATAATGTGGT GGAACAACTG AAAGATTGGT 300 

TATACAAGTG TTCAGTTCAG AAACTGGTTG TAGTTATCTC AAATATTGAA AGTGGTGAGG 360 

TCCTGGAAAG ATGGCAGTTT GATATTGAGT GTGACAAGAC TGCAAAAGAT GACAGTGCAC 420 

CCAGAGAAAA GTCTCAGAAA GCTATCCAGG. ATGAAATCCG TTCAGTGATC AGACAGATCA 480 

CAGCTACGGT GACATTTCTG CCACTGTTGG AAGTTTCTTG TTCATTTGAT CTGCTGATTT 540 

ATACAGACAA AGATTTGGTT GTACCTGAAA AATGGGAAGA GTCGGGACCA CAGTTTATTA 600 

CCAATTCTGA GGAAGTCCGC CTTCGTTCAT TTACTACTAC AATCCACAAA GTAAATAGCA 660 

TGGTGGCCTA CAAAATTCCT GTCAATGACT GAGGATGACA TGAGGAAAAT AATGTAATTG 720 

TAATTTTGAA ATGTGGTTTT CCTGAAATCA GGTCATCTAT AGTTGATATG TTTTATTTCA 780 

TTGGTTAATT TTTACATGGA GAAAACCAAA ATGATACTTA CTGAACTGTG TGTAATTGTT ' 840 

CCTTTATTTT TTTGGTACCT ATTTGACTTA CCATGGAGTT AACATCATGA ATTTATTGCA 900 

CATTGTTCAA AAGGAACCAG GAGGTTTTTT TGTCAACATT GTGATGTATA TTCCTTTGAA 960 

GATAGTAACT GTAGATGGAA AAACTTGTGC TATAAAGCTA GATGCTTTCC TAAATCAGAT 1020 

GTTTTGGTCA AGTAGTTTGA CTCAGTATAG GTAGGGAGAT ATTTAAGTAT AAAATACAAC 1080 

AAAGGAAGTC TAAATATTCA GAATCTTTGT TAAGGTCCTG AAAGTAACTC ATAATCTATA 1140 

AACAATGAAA TATTGCTGTA TAGCTCCTTT TGACCTTCAT TTCATGTATA GTTTTCCCTA 1200 

TTGAATCAGT TTCCAATTAT TTGACTTTAA TTTATGTAAC TTGAACCTAT GAAGCAATGG 1260 

ATATTTGTAC TGTTTAATGT TCTGTGATAC AGAACTCTTA AAAATGTTTT TTCATGTGTT 1320 

TTATAAAATC AAGTTTTAAG TGAAAGTGAG GAAATAAAGT TAAGTTTGTT TT AAAAAAAA 1380 
AAAAAAAAAA 



Seq ID NO: 259 Protein sequence: 
Protein Accession #: NP_002349.1 

1 11 21 31 41 51 

I I I I I I 

MALQLSREQG ITLRGSAEIV AEFFSFGINS ILYQRGIYPS ETFTRVQKYG LTLIiVTTDLE 60 
LIKYLNNWE QLKDWLYKCS VQKLVWISN IESGEVLERW QFDIECDKTA KDDSAPREKS 120 
QKAIQDEIRS VIRQITATVT FLPLLEVSCS FDLLIYTDKD LWPEKWEES GPQFITNSEE 180 
VRLRSFTTTI HKVNSMVAYK IPVND 



Seq ID NO: 260 DNA sequence 
Nucleic Acid Accession #: NM_001211 
Coding sequence: 43.. 3195 



1 11 21 31 41 51 

I I I I I I 

AAAGGCCTGC AGCAGGACGA GGACCTGAGC CAGGAATGCA GGATGGCGGC GGTGAAGAAG 60 

GAAGGGGGTG CTCTGAGTGA AGCCATGTCC CTGGAGGGAG ATGAATGGGA ACTGAGTAAA 120 

GAAAATGTAC AACCTTTAAG GCAAGGGCGG ATCATGTCCA CGCTTCAGGG AGCACTGGCA 180 

CAAGAATCTG CCTGTAACAA TACTCTTCAG CAGCAGAAAC GGGCATTTGA ATATGAAATT 240 



286 



WO 02/086443 

CGATTTTACA CTGGAAATGA CCCTCTGGAT GTTTGGGATA GGTATATCAG CTGGACAGAG 300 

CAGAACTATC CTCAAGGTGG GAAAGAGAGT AATATGTCAA CGTTATTAGA AAGAGCTGTA 360 

GAAGCACTAC AAGGAGAAAA ACGATATTAT AGTGATCCTC GATTTCTCAA TCTCTGGCTT 420 

AAATTAGGGC GTTTATGCAA TGAGCCTTTG GATATGTACA GTTACTTGCA CAACCAAGGG 480 

ATTGGTGTTT CACTTGCTCA GTTCTATATC TCATGGGCAG AAGAATATGA AGCTAGAGAA 540 

AACTTTAGGA AAGCAGATGC GATATTTCAG GAAGGGATTC AACAGAAGGC TGAACCACTA 600 

GAAAGACTAC AGTCCCAGCA CCGACAATTC CAAGCTCGAG TGTCTCGGCA AACTCTGTTG 660 

GCACTTGAGA AAGAAGAAGA GGAGGAAGTT TTTGAGTCTT CTGTACCACA ACGAAGCACA 720 

CTAGCTGAAC TAAAGAGCAA AGGGAAAAAG ACAGCAAGAG CTCCAATCAT CCGTGTAGGA 780 

GGTGCTCTCA AGGCTCCAAG CCAGAACAGA GGACTCCAAA ATCCATTTCC TCAACAGATG 840 

CAAAATAATA GTAGAATTAC TGTTTTTGAT GAAAATGCTG ATGAGGCTTC TACAGCAGAG 900 

TTGTCTAAGC CTACAGTCCA GCCATGGATA GCACCCCCCA TGCCCAGGGC CAAAGAGAAT 960 

GAGCTGCAAG CAGGCCCTTG GAACACAGGC AGGTCCTTGG AACACAGGCC TCGTGGCAAT i020 

ACAGCTTCAC TGATAGCTGT ACCCGCTGTG CTTCCCAGTT TCACTCCATA TGTGGAAGAG 1080 

ACTGCACAAC AGCCAGTTAT GACACCATGT AAAATTGAAC CTAGTATAAA CCACATCCTA 1140 

AGCACCAGAA AGCCTGGAAA GGAAGAAGGA GATCCTCTAC AAAGGGTTCA GAGCCATCAG 1200 

CAAGCGTCTG AGGAGAAGAA AGAGAAGATG ATGTATTGTA AGGAGAAGAT TTATGCAGGA 1260 

GTAGGGGAAT TCTCCTTTGA AGAAATTCGG GCTGAAGTTT TCCGGAAGAA ATTAAAAGAG 1320 

CAAAGGGAAG CCGAGCTATT GACCAGTGCA GAGAAGAGAG CAGAAATGCA GAAACAGATT 1380 

GAAGAGATGG AGAAGAAGCT AAAAGAAATC CAAACTACTC AGCAAGAAAG AACAGGTGAT 1440 

CAGCAAGAAG AGACGATGCC TACAAAGGAG ACAACTAAAC TGCAAATTGC TTCCGAGTCT 1500 

CAGAAAATAC CAGGAATGAC TCTATCCAGT TCTGTTTGTC AAGTAAACTG TTGTGCCAGA 1560 

GAAACTTCAC TTGCGGAGAA CATTTGGCAG GAACAACCTC ATTCTAAAGG TCCCAGTGTA 1620 

CCTTTCTCCA TTTTTGATGA GTTTCTTCTT TCAGAAAAGA AGAATAAAAG TCCTCCTGCA 1680 

GATCCCCCAC GAGTTTTAGC TCAACGAAGA CCCCTTGCAG TTCTCAAAAC CTCAGAAAGC 1740 

ATCACCTCAA ATGAAGATGT GTCTCCAGAT GTTTGTGATG AATTTACAGG AATTGAACCC 1800 

TTGAGCGAGG ATGCCATTAT CACAGGCTTC AGAAATGTAA CAATTTGTCC TAACCCAGAA 1860 

GACACTTGTG ACTTTGCCAG AGCAGCTCGT TTTGTATCCA CTCCTTTTCA TGAGATAATG 1920 

TCCTTGAAGG ATCTCCCTTC TGATCCTGAG AGACTGTTAC CGGAAGAAGA TCTAGATGTA 1980 

AAGACCTCTG AGGACCAGCA GACAGCTTGT GGCACTATCT ACAGTCAGAC TCTCAGCATC 2040 

AAGAAGCTGA GCCCAATTAT TGAAGACAGT CGTGAAGCCA CACACTCCTC TGGCTTCTCT 2100 

GGTTCTTCTG CCTCGGTTGC AAGCACCTCC TCCATCAAAT GTCTTCAAAT TCCTGAGAAA 2160 

CTAGAACTTA CTAATGAGAC TTCAGAAAAC CCTACTCAGT CACCATGGTG TTCACAGTAT 2220 

CGCAGACAGC TACTGAAGTC CCTACCAGAG TTAAGTGCCT CTGCAGAGTT GTGTATAGAA 2280 

GACAGACCAA TGCCTAAGTT GGAAATTGAG AAGGAAATTG AATTAGGTAA TGAGGATTAC 2340 

TGCATTAAAC GAGAATACCT AATATGTGAA GATTACAAGT TATTCTGGGT GGCGCCAAGA 2400 

AACTCTGCAG AATTAACAGT AATAAAGGTA TCTTCTCAAC CTGTCCCATG GGACTTTTAT 2460 

ATCAACCTCA AGTTAAAGGA ACGTTTAAAT GAAGATTTTG ATCATTTTTG CAGCTGTTAT 2520 

CAATATCAAG ATGGCTGTAT TGTTTGGCAC CAATATATAA ACTGCTTCAC CCTTCAGGAT 2580 

CTTCTCCAAC ACAGTGAATA TATTACCCAT GAAATAACAG TGTTGATTAT TTATAACCTT 2640 

TTGACAATAG TGGAGATGCT ACACAAAGCA GAAATAGTCC ATGGTGACTT GAGTCCAAGG 2700 

TGTCTGATTC TCAGAAACAG AATCCACGAT CCCTATGATT GTAACAAGAA CAATCAAGCT 2760 

TTGAAGATAG TGGACTTTTC CTACAGTGTT GACCTTAGGG TGCAGCTGGA TGTTTTTACC 2820 

CTCAGCGGCT TTCGGACTGT ACAGATCCTG GAAGGACAAA AGATCCTGGC TAACTGTTCT 2880 

TCTCCCTACC AGGTAGACCT GTTTGGTATA GCAGATTTAG CACATTTACT ATTGTTCAAG 2940 

GAACACCTAC AGGTCTTCTG GGATGGGTCC TTCTGGAAAC TTAGCCAAAA TATTTCTGAG 3000 

CTAAAAGATG GTGAATTGTG GAATAAATTC TTTGTGCGGA TTCTGAATGC CAATGATGAG 3060 

GCCACAGTGT CTGTTCTTGG GGAGCTTGCA GCAGAAATGA ATGGGGTTTT TGACACTACA 3120 

TTCCAAAGTC ACCTGAACAA AGCCTTATGG AAGGTAGGGA AGTTAACTAG TCCTGGGGCT 3180 

TTGCTCTTTC AGTGAGCTAG GCAATCAAGT CTCACAGATT GCTGCCTCAG AGCAATGGTT 3240 

GTATTGTGGA ACACTGAAAC TGTATGTGCT GTAATTTAAT TTAGGACACA TTTAGATGCA 3300 

CTACCATTGC TGTTCTACTT TTTGGTACAG GTATATTTTG ACGTCACTGA TATTTTTTAT 3360 

ACAGTGATAT ACTTACTCAT GGCCTTGTCT AACTTTTGTG AAGAACTATT TTATTCTAAA 3420 

CAGACTCATT ACAAATGGTT ACCTTGTTAT TTAACCCATT TGTCTCTACT TTTCCCTGTA 3480 

CTTTTCCCAT TTGTAATTTG TAAAATGTTC TCTTATGATC ACCATGTATT TTGTAAATAA 3540 
TAAAATAGT A TCTGTTAAAA AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 261 Protein sequence: 
Protein Accession #: NP_001202 

1 11 21 31 41 51 

I I I I I I 

MAAVKKEGGA LSEAMSLEGD EWELSKENVQ PLRQGRIMST LQGALAQESA CNNTLQQQKR 60 

AFEYEIRFYT GNDPLDVWDR YISWTEQNYP QGGKESNMST LLERAVEALQ GEKRYYSDPR 120 

FLNLWLKLGR LCNEPLDMYS YLHNQGIGVS LAQFYISWAE EYEARENFRK ADAIFQEGIQ 180 

QKAEPLERLO SQHRQFQARV SRQTLLALEK EEEEEVFESS VPQRSTLAEL KSKGKKTARA 240 

PIIRVGGALK APSQNRGLQN PFPQQMQNNS RITVFDENAD EASTAELSKP TVQPWIAPPM 300 

PRAKENELQA GPWNTGRSLE HRPRGNTASL IAVPAVLPSF TPYVEETAQQ PVMTPCKIEP 3 60 

SINHILSTRK PGKEEGDPLQ RVQSHQQASE EKKEKMMYCK EKIYAGVGEF SFEEIRAEVF 420 

RKXLKEQREA ELLTSAEKRA EMQKQIEEME KKLKEIQTTQ QERTGDQQEE TMPTKETTKL 480 

QIASESQKIP GMTLSSSVCQ VNCCARETSL AENIWQEQPH SKGPSVPFSI FDEPLLSEKK 540 

NKSPPADPPR VLAQRRPIAV LKTSESITSN EDVSPDVCDE FTGIEPLSED AIITGFRNVT 600 

ICPNPEDTCD FARAARFVST PFHEIMSLKD LPSOPERLLP EEDLDVKTSE DQQTACGTIY . 660 

SQTLSIKKLS PIIEDSREAT HSSGFSGSSA SVASTSSIKC LQIPEKLELT NETSENPTQS 720 

PWCSQYRRQL LKSLPELSAS AELCIEDRPM PKLEIEKEIE LGNEDYCIKR EYLICEDYKL 780 

FWVAPRNSAE LTVIKVSSQP VPWDFYINLK LKERLNEDFD HFCSCYQYQD GCIVWHQYIN 840 

CFTLQDLLQH SEYITHEITV LIIYNLLTIV EMLHKAEIVH GDLSPRCLIL RNRIHDPYDC 900 

NKNNQALKIV DFSYSVDLRV QLDVFTLSGF RTVQILEGQK ILANCSSPYQ VDLFGIADLA 960 

HLLLFKEHLQ VFWDGSFWKI* SQNISELKDG ELWNKFFVRI LNANDEATVS VLGELAAEMN 1020 
GVFDTTFQSH LNKALWKVGK LTSPGALLFQ 

Seq ID NO: 262 DNA sequence 
Nucleic Acid Accession ft: NM_003784 
Coding sequence: 365.. 1507 

1 11 21 31 41 51 



287 



WO 02/086443 



PCT/US02/12476 



5 
10 
15 
20 
25 
30 
35 
40 



GTCTACTTAT 
TAAAACTGAA 
ATTAGGTAGT 
TGTACAGGGA 
CAACAGTCTT 
AAGGAAACCA 
TGCAATGGCC 
GGATGACAAT 
GGCCCTGGTC 
TGTTAACACT 
ACTGAAAAGA 
GAATGGGCTT 
AAAATTATAC 
ACGTAATATT 
TGAAGGTGGC 
CAAGTGGCAA 
GTGCTCTGGG 
TGAGGACCCA 
TCTGCTGCCT 
GGAATGGACC 
CAAGATAGAG 
CTTTGATGAA 
AAGGATGATG 
CACAGGAAGT 
CCCATTCCTA 
CCCTTGAAAA 
AAGTCAATAG 
CTAACATTGG 
TCCTGATCCC 
TCTTTCTTCC 
RGTGCTCAAC 
AGTTATCGCA 
CTAGAAATAA 
CTTCATATGT 
ATTAGGGACC 
TGATAAGACA 
GTTATCTACA 
AGAATAAAGA 



CAATAAGCAG 
TTCTCAGAAT 
GGTAAAACAG 
AGCTCTCCTT 
GAGAAGTGTG 
GATTCCCATC 
TCCCTTGCTG 
CAAGGAAATG 
CGCTTGGGCG 
GCCTCAGGAT 
GTTTTTTCTG 
TTTGCTGAAA 
GATGCCAAAG 
AATAAGTGGG 
ATAAGCTCAT 
TCAGCCTTCA 
AAGGCAGTCG 
TCAATGAAGA 
GAGAATGACC 
AATCCAAGGC 
AAGAATTATG 
TCCAAAGCAG 
CACAAATCTT 
AATATTGTAG 
TTTGTTATCA 
TCCAATTGGT 
ATYTGRGTTT 
TCAGCAGATG 
TGCTCTTAGC 
CACGCTCATT 
TGGTAAGGAG 
GATATTCTAG 
GTGTGAAGGA 
TTGATTTTAA 
TACCAAAATA 
ATATGTACAT 
GAATCATATT 
AATACAACAT 



I 

CTGCCTGTGC 
TTTAGAACAA 
GCTCCCTTCG 
CATCACCTTC 
GAAACATTTT 
ACTGCTTCTG 
CAGCAAATGC 
GAAATGTGTT 
CTCAAGATGA 
ATGGAAACTC 
ATATAAATGC 
AAGTGTATGG 
TGGAGCGAGT 
TTGAAAATGA 
CTGCTGTAAT 
CCAAGAGCGA 
CCATGATGCA 
TTCTTGAGCT 
TCTCTGAAAT 
GAATGACCTC 
AAATGAAACA 
ATCTCTCTGG 
ACATAGAGGT 
AAAAGCAACT 
GGAAGGATGA 
TTCTGTTATA 
AATTGGAAAA 
ACACTGGTGA 
ATTCTACCAC 
TCTATCATTC 
AACGTAGAAG 
CTTCATTGTA 
TAAATTTTCT 
ATCAGTGTAT 
TTTCATTAAT 
GTTTTTTCAA 
TCATATGCTG 
ACCTGTAAA 



AGAGTGCAGG 
ATTTTTGTCT 
AAGCTCTCCT 
CTAAGTGCAT 
CTTTGTGAGT 
GGTATCAGAT 
AGAGTTTTGC 
CTTTTCCTCT 
CTCCCTCTCT 
TTCTAATAGT 
ATCCCACAAG 
CTTTCATAAG 
TGACTTTACG 
AACACATGGC 
GGTGCTGGTG 
AACCATAAAT 
TCAGGAACGG 
CAGATACAAT 
TGAAAACAAA 
TAAGTATGTT 
ATATTTGAGA 
GATTGCTTCG 
CACTGAGGAG 
CCCTCAGTCC 
CATCATCTTA 
GCAGTCCCCA 
ATGTGGTGTT 
CTTGACCCTT 
CATGTGTCTC 
TCCCCCATGA 
TAGCCCTAGG 
AGCAATCTAG 
TTGTTGACCT 
AATCTAGATG 
GCTTTCAATT 
ATATTAAAGA 
TGTAGTTTAT 



CTGCACCTTT 
AGAAATGCTG 
TCATCACCTT 
GGGGGAAAAT 
GAGAACAGAT 
GCTAGCGCTG 
TTCAACCTGT 
CTGAGCCTCT 
CAGATTGATA 
CAGTCAGGGC 
GATTATGATC 
GACTACATTG 
AATCATTTAG 
AAAATCAAGA 
AATGCTGTGT 
TGCCATTTCA 
AAGTTCAATT 
GGTGGCATAA 
CTGACCTTTC 
GAGGTATTTT 
GCCCTAGGGC 
GGGGGTCGTC 
GGCACCGAGG 
ACGCTGTTTA 
TTCAGTGGCA 
CAACATCAAA 
TCCTTTGAGT 
CCTAGACACC 
ACCCATTTCT 
CCCGTCTGGA 
GATCCTTTTT 
GAAATAAGCC 
ATGAAGATTT 
GTAAAAAATG 
GACAAATTTT 
TCTTTTAACT 
AAGTTTTTCC 



GGACAGCCTT 
ACTTTGGTTC 
CCTAAGTGCA 
ACCTAGGGCT 
CACCTAGAGA 
CACTCCATTT 
TCAGAGAGAT 
TCGCTGCCCT 
AGTTGCTTCA 
TCCAGTCTCA 
TCAGCATTGT 
AGTGTGCCGA 
AAGACACTAG 
ACGTGATTGG 
ACTTCAAAGG 
AATCTCCCAA 
TGTCTGTTAT 
ACATGTACGT 
AGAATCTAAT 
TTCCTCAGTT 
TGAAAGATAT 
TGTATATATC 
CTACTGCTGC 
GAGCTGACCA 
AAGTTTCTTG 
GRACCACCAC 
TTATTTCTTC 
TGGTTGATTG 
AATTTCATTG 
AATTATGGAG 
GAAACTCTAC 
CTGCTGCTTT 
TAGAGTTTAC 
TGAAATTGGG 
GGCCTTTCTT 
GTTGGCAGTT 
TCTATTTATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



Seq ID NO: 263 Protein sequence: 
Protein Accession ft: NP 003775 



45 
50 
55 
60 
65 
70 
75 
80 
85 



MASLAAANAE 
NTASGYGNSS 
LYDAKVERVD 
WQSAFTKSET 
LPENDLSEIE 
DESKADLSGI 
FLFVIRKDDI 



11 
I 

FCFNLFREMD 
NSQSGLQSQL 
FTNHLEDTRR 
INCHFKSPKC 
NKLTFQNLME 
ASGGRLYISR 
ILFSGKVSCP 



21 
I 

DNQGNGNVFF 
KRVFSDINAS 
NINKWVENET 
SGKAVAMMHQ 
WTNPRRMTSK 
MMHKSYIEVT 



31 
I 

SSLSLFAALA 
HKDYDLSIVN 
HGKIKNVIGE 
ERKFNLSVIE 
YVEVFFPQFK 
EEGTEATAAT 



41 
I 

LVRLGAQDDS 
GLFAEKVYGF 
GGISSSAVMV 
DPSMKILELR 
IEKNYEMKQY 
GSNIVEKQLP 



51 

1 

LSQIDKLLHV 
HKDYIECAEK 
LVNAVYFKGK 
YNGGINMYVL 
LRALGLKDIF 
QSTLFRADHP 



Seq ID NO: 264 DNA sequence 
Nucleic Acid Accession #: AB052906 
Coding sequence: 74-814 



AAAACCTTGA 
CTCTGGGTCC 
GCTCCTGCTG 
CATCACCGTC 
GGATGAAAAG 
CCTGGGGAAG 
GGTGGTGGAC 
GGAACCCCTC 
TGGATCTTGG 
AATGTGGACA 
GGTTGTGGCC 
CTTCTTGATG 
CTCAGGCACA 
CATCCTCCCC 
AAGCTGATAC 
CCAGCTGCCC 
TGGACCCAAT 
TACCTAACAT 
TTCTGGCTGA 
GTACTTCTTT 
TAGACTTCAG 
ATAAGAAAAA 
TTTAAATAAA 



11 
I 

GGTGATTCAT 
TTAATGGCAG 
TCCGGCTGGT 
ATCCCTAAGT 
ACTTTTCTTC 
AAACTAAATG 
ATACTTACAG 
ACCCTGCAGG 
CAGTTCAGTT 
ACGGTTCATC 
ATGTCCTTCC 
GGCATGGACA 
ACCCAACTCA 
TGCTTCATCC 
CAAAAGGCTC 
ACGACCTACG 
AGCTCATTCA 
ATTATGCAAT 
CTAAACAAGA 
GAATGATGAT 
ACCTCTGGGG 
ATTTATATTA 
GAGTTCTATT 



21 
I 

CTTCCAGGCT 
CAGCCGCCGC 
CCCGGGCTGG 
TCAGACCTGG 
ACTATGACTG 
TCACAACGGC 
AGCAACTGCG 
CCAGGATGTC 
TCGATGGGCA 
CTGGAGCCAG 
ATTACTTCTC 
GCACCCTGGA 
GGGCCACAGC 
TCCCTGGCAT 
CTGTGAGCAC 
GTGTATGTCC 
CTGCCTTGAT 
TTTCTCTTGG 
TATATCATTT 
CTCTTTCTTG 
ATTCTTTCCG 
ATGATTGTTT 
TCCCAAAAAA 



31 
I 

CTCCTTCCAT 
TACCAAGATC 
GCGAGCCGAC 
ACCACGGTGG 
TGGCAACAAG 
CTGGAAAGCA 
TGACATTCAG 
TTGTGAGCAG 
GATCTTCCTC 
AAAGATGAAA 
AATGGGAGAC 
GCCAAGTGCA 
CACCACCCTC 
CTGAGGAGAG 
GGTCTTGATC 
AGTGGCCTCC 
TCCTTTTGCC 
TGCTACCTGA 
TCTTTCTTCT 
CAAATGATAT 
TGTCCTGAAA 
CCTTTAGTAA 
AAAAAAAAAA 



41 
I 

CAAGTCTCTC 
CTTCTGTGCC 
CCTCACTCTC 
TGTGCGGTTC 
ACAGTCACAC 
CAGAACCCAG 
CTGGAGAATT 
AAAGCTGAAG 
CTCTTTGACT 
GAAAAGTGGG 
TGTATAGGAT 
GGAGCACCAC 
ATCCTTTGCT 
TCCTTTAGAG 
AAACTCGCCC 
AGCAGATCAT 
AACAATTTTA 
TGGAATTCCT 
CTTTTTGTTT 
TGTCAGTAAA 
GAGAATTTTT 
TTTATTGTTC 
A 



51 
I 

CTCCCTAGCG 
TCCCGCTTCT 
TTTGCTATGA 
AAGGCCAGGT 
CTGTCAGTCC 
TACTGAGAGA 
ACACACCCAA 
GACACAGCAG 
CAGAGAAGAG 
AGAATGACAA 
GGCTTGAGGA 
TCGCCATGTC 
GCCTCCTCAT 
TGACAGGTTA 
TTCTGTCTGG 
GATGACATCA 
CCAGCAGTTA 
GCACTTAAAG 
GGAAAATCAA 
ATAATCACGT 
AAATTATTTA 
TGTACTGATA 



60 
120 
180 
240 
300 
360 



1 60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID NO; 265 Protein sequence: 
Protein Accession #: BAB61048.1 



288 



WO 02/086443 PCT/US02/12476 

1 11 21 31 41 51 

I I I I I I 

MAAAAATKIL LCLPLLLLLS GWSRAGRADP HSLCYDITVI PKFRPGPRWC AVQGQVDEKT 60 
FLHYDCGNKT VTPVSPLGKK LNVTTAWKAQ NPVLRBWDI LTEQLRDIQL ENYTPKEPLT 120 
5 LQARMSCEQK AEGHSSGSWQ FSFDGQIFLL FDSBKRMWTT VHPGARKMKB KWENDKWAM 180 
SFHYFSMGDC IGWLEDFLMG MDSTLBPSAG APLAMSSGTT QLRATATTLI LCCLLIILPC 240 
FILPGI 

Seq ID NO: 266 DNA sequence 
10 Nucleic Acid Accession #: XM_084853.1 
Coding sequence: 127-444 

1 11 21 31 41 51 

, r I I I I I I 

1 J ATTGATGATA TATTTAACGA AATCAAATTT GGTGAATATG TGGACACTGG AAAGCTAATC 60 
GACAAGATCA ACTTACCAGA TTTCCTAAAA GTGTACCTTA ACCACAAGCC ACCTTTTGGT 120 
AACACCATGA GTGGCATCCA CAAGAGCTTT GAGGTGCTCG GTTATACCAA CTCCAAAGGG 180 
AAAAAGGCCA TTCGAAGAGA GGACTTCCTG AGACTGCTCG TTACTAAAGG TGAGCATATG 240 
ACGGAGGAGG AGATGTTGGA TTGCTTTGCT TCACTGTTTG GCCTGAATCC CGAGGGATGG 300 
20 AAATCCGAGC CTGCAACCTG CTCCGTCAAA GGTTCAGAAA TTTGCCTTGA AGAAGAACTT 360 
CCAGACGAAA TCACTGCAGA AATATTCGCG ACTGAAATTC TTGGCTTAAC ■ CATTTCAGAA 420 
GATTCCGGCC AGGATGGTCA GTGAAGTTAC CAGGAATGTT TAAAGCACAA AGGACTTTGG 480 
GTGTGTGTGC ATGCACATGT GTGTGTTTTC - CATGAGGCAC TGCTTTTTAT GCATTTCCCT 540 
CCCCCCTCTC ATCTTTAGAA CATTTAGACA TTAAAGCAAG TTTCTGGTGA GCAATG 



25 



80 
85 



Seq ID NO: 267 Protein sequence: 
Protein Accession #: XP_084853.1 



30 1 11 21 31 41 51 

I I I I I I 

MSGIHKSFEV LGYTNSKGKK AIRREDFLRL LVTKGEHMTE EEMLDCFASL FGLNPEGWKS 60 
EPATCSVKGS EICLEEELPD EITAEIFATE ILGLTISEDS GQDGQ 

35 Seq ID NO: 268 DNA sequence 

Nucleic Acid Accession &: NM_001898 
Coding sequence: 57-482 

1 11 21 31 41 51 

40 | | | | I I 

GGCTCTCACC CTCCTCTCCT GCAGCTCCAG CTTTGTGCTC TGCCTCTGAG GAGACCATGG 60 

CCCAGTATCT GAGTACCCTG CTGCTCCTGC TGGCCACCCT AGCTGTGGCC CTGGCCTGGA 120 

GCCCCAAGGA GGAGGATAGG ATAATCCCGG GTGGCATCTA TAACGCAGAC CTCAATGATG 180 

AGTGGGTACA GCGTGCCCTT CACTTCGCCA TCAGCGAGTA TAACAAGGCC ACCAAAGATG 240 

45 ACTACTACAG ACGTCCGCTG CGGGTACTAA GAGCCAGGCA ACAGACCGTT GGGGGGGTGA 300 

ATTACTTCTT CGACGTAGAG GTGGGCCGCA CCATATGTAC CAAGTCCCAG CCCAACTTGG 360 

ACACCTGTGC CTTCCATGAA CAGCCAGAAC TGCAGAAGAA ACAGTTGTGC TCTTTCGAGA 420 

TCTACGAAGT TCCCTGGGAG AACAGAAGGT CCCTGGTGAA ATCCAGGTGT CAAGAATCCT 480 

AGGGATCTGT GCCAGGCCAT TCGCACCAGC CACCACCCAC TCCCACCCCC TGTAGTGCTC 54 0 

50 CCACCCCTGG ACTGGTGGCC CCCACCCTGC GGGAGGCCTC CCCATGTGCC TGCGCCAAGA 600 

GACAGACAGA GAAGGCTGCA GGAGTCCTTT GTTGCTCAGC AGGGCGCTCT GCCCTCCCTC 660 

CTTCCTTCTT GCTTCTAATA GCCCTGGTAC ATGGTACACA CCCCCCCACC TCCTGCAATT 720 
AAACAGTAGC ATCGCC 

55 Seq ID NO: 269 Protein sequence: 
Protein Accession # :NP_001889. 1 

1 11 21 31 41 51 

OU MAQYLSTLliL LLATLAVAIA WSPKEEDRII PGGIYNADLN DEWVQRALHF AISEYNKATK 60 
DDYYRRPLRV LRARQQTVGG VNYFFDVEVG RTICTKSQPN LDTCAFHEQP ELQKKQLCSF 120 
EIYEVPWENR RSLVKSRCQE S 

Seq ID NO: 270 DNA sequence 
65 Nucleic Acid Accession #: XM_093210 
Coding sequence: 13-1854 

1 11 21 31 41 51 

™ 1 1 1 1 1 1 

/U ATGGCAAGCG CCGGAATCTC CTCAGCTGCC GTTTCACAAA AGAGGTACCA GGTCCGCACC 60 

AAACGAGCAC ACAAGCAGCA CCAGGAGCTG CAGAAGAAGG AGGCGGCAGC GATGGACCAG 120 

GGCAGAGGGA ATGGGGAGGG GGCATCCTAC CCCATATCTG AGGTGCGACT GCGGGACGTA 180 

GAGCGGACTG GGCCTTTCCC GTTGGCGCGT GGCCTCAATC AGGACTTCTT GCCCACGTGC 240 

GCCTTCAAAA CGGTAAGAGC TGCAACTGAA CGTGTGAGAC ATGGTGCAGA TAGGCTGAGA 300 

75 GGCGGCGGGA GAGATGCCCA TGAACTCAAG TACCCGGACA CGCCCTCCAC TTCTACCACC 360 

ACGAGTAACA CCGCCCCCAC GGGACCGCTC TCGAGGTCCC CCAAGCCAAG GACGCAAGGA 420 

GGAACGCCCC GGCGCGCGGC CAGCAGCGGC GGGCACCGGC CCAATGGCCA CGGAACTCAG 480 

CACTGGCAGT CGGCCCTCCT CACACCGCAG GCGTGCAGTG TGGCCGACGG AGCCTCCCGG 540 

GCCGAGGACC CAGCTAGGCC GTCACCCCGG TTGCTCCCAC GGGAAGGGGC ACCAGGCAAA 600 

CTGCCCAAGG CCCCGAGCCC AGGCTCCCTG GCGGAGGCCT CCGCTGGTCC CGCCCAGATC 660 

ATGGCCGCCA CCAGGCTCCC GAGCCATGGC TTCCTGTCCG GGAACGGCCC GGCGTCCTGG 720 
CTGTCCAGCT AG 



Seq ID NO: 271 Protein sequence: 
Protein Accession #: XPJD93210 

1 11 21 31 41 51 



289 



WO 02/086443 



10 
15 

20 



MLRHGEQKRK 
TTTSNTAPTG 
SRAEDPARPS 
PNSSVGRKEE 
LPREGAPGKL 



RARKKWDFLP 
PLSRSPKPRT 
PRLLPREGAP 
RPGAGQQRRA 
PKAPSPGSLA 



J 

TCAFKTVRAA 
QGGTPRRRPA 
GKLPKAPSPG 
PAPMATELST 
EASAGPAQIM 



TERVRHGADR 
AAGTRANGHG 
SLAEASAGLL 
GSRPSSHRRR 
AATRLPSRGF 



I I 

LRGGGRDAHE LKYPDTPSTS 
TQHWQSALLT POACSVADGA 
AHVRLQNADA QRVSISQALP 
AVWPTEPPGP RTQLEPSPRL 
LSGNGPASWL SS 



PCT/US02/12476 



Seq ID NO: 272 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..732 



GGATACTGTG 
TGAAAAAGCT 
TAATGTGGAG 
ATACCCACTT 
ATGATTTTGT 
TAAATTATTT 
TTAGTATCAC 
AAAATTGCAG 
TTAAGCC 



11 
I 

TCACTCAAAG 
TTTTTTCCCA 
GAAATTATTC 
GAAGCCTCTG 
CTTGTTTCTG 
TTATTTATCT 
AATTTATGGG 
AAGTCATAGG 



21 
I 

TAATGGGAGG 
CTTTTAACTT 
TTTCTCATTG 
TAGAAATGTC 
CAGTGAGAAA 
TTCATATAGT 
AGAGGGTTTT 
ACTGTCATGT 



31 
I 

GAGAGAGAAC 
GCTTTAGCGT 
GAGATTACAG 
TCGTCCTCCG 
TTACATCCAT 
TCTTACAATT 
TTGTATTTTT 
ATTGCAGCTC 



41 
I 

AGGGAGGGTA 
TAAGAGTACT 
AATATATCTA 
GTTGTATTTC 
AGCAAAGACA 
TCTAAAAAAT 
AAGCATATGT 
TGAGAACCAA 



51 
I 

GGGATGCTTT 
TACCAGCTAA 
TTCATCTTGA 
TAAAACCTAC 
AAAGTCTTTT 
TAACACTCAT 
GGCTTATATA 
TGCCTGAAAC 



60 
120 
180 
240 



60 
120 
160 
240 
300 
360 
420 
480 



25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 273 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 

1 I I I 

MGGRENREGR DAFEKAFFPT FNLL 

Seq ID NO: 274 DNA sequence 

Nucleic Acid Accession #: NM_003976.2 

Coding sequence: 299-961 



41 



51 



CTCTGAGCTT 
CATGGAGTTG 
CTACTTCTGC 
GGGTGGCAGG 
CAGGAGGGTG 
GGAACTTGGA 
TGCCCTGTGG 
GGGCTCCGCG 
CGCCGGCCAC 
GCCGCCGCAG 
CGGGGGCCGC 
GGGCTGCCGC 
CGACGAGCTG 
CGACCTCAGC 
GCCCGTCAGC 
CAACAGCACC 
AGGGCTCGCT 
CCTCCCGCAG 
AGGCCCCTAC 
CAGCCCCAGA 
GGAGCCCTTC 
CCCTCCTCTG 
ACAGCATTTG 
CCTGTACTCA 



11 
I 

CTCTGAGCCT 
TGAAAGAATA 
TGGGTTGAGT 
CCGGTCCCCC 
GGGGAACAGC 
CTTGGAGGCC 
CCCACCCTGG 
CCCCGCAGCC 
CTGCCGGGGG 
CCTTCTCGGC 
GCGGCGCGGG 
CTGCGCTCGC 
GTGCGTTTCC 
CTGGCCAGCC 
CAGCCCTGCT 
TGGAGAACCG 
CCAGGGCTTT 
AGTCCCACTA 
CGGTGGGTGA 
GCCCTCACCC 
GGACCCACTT 
ATGAACACTA 
AAGGACACAT 
CTCATGGGAG 



21 
I 

TGTTTGCTCA 
GCTGCAAAGC 
CTAGCTGTGT 
ACAAAAGATA 
TCAACAATGG 
TCTCCACGCT 
CCGCTCTGGC 
CTGCCCCCCG 
GACGCACGGC 
CCGCGCCCCC 
CTGGGGGCCC 
AGCTGGTGCC 
GCTTCTGCAG 
TACTGGGCGC 
GCCGACCCAC 
TGGACCGCCT 
GCAGACTGGA 
GCCAGCGGCC 
TGGATATCAT 
TGCGGATCCC 
CTCACAGACT 
CAGTGGCTGA 
ATTGCAGTTG 
CTGGCCCC 



31 
I 

TCTGGAAAAA 
ACCTAACACA 
AGGCCCCTTG 
ACTCATCTCT 
CTGATGGGCG 
GTCCCACTGC 
TCTGCTGAGC 
CGAAGGCCCC 
CCGCTGGTGC 
GCCGCCTGCA 
GGGCAGCCGC 
GGTGCGCGCG 
CGGCTCCTGC 
CGGGGCCCTG 
GCGCTACGAA 
CTCCGCCACC 
CCCTTACCGG 
TCAGCCAGGG 
CCCCGAACAG 
AGCCTAAAAG 
CTGGCACTGG 
GGCATCAGCC 
CTTGGTTGAA 



41 

! 

GGGGATTAAA 
TAGTAAGGTT 
TTCCTCACCT 
TAATTTGCAA 
CTCCTGGTGT 
CCCTGGCCTA 
AGCGTCGCAG 
CCGCCTGTCC 
AGTGGAAGAG 
CCCCCATCTG 
GCTCGGGCAG 
CTCGGCCTGG 
CGCCGCGCGC 
CGACCGCCCC 
GCGGTCTCCT 
GCCTGCGGCT 
TGGCTCTTCC 
ACGAAGGCCT 
GTGAAGGGAC 
ACACCAGAGA 
CCAGGCCTCG 
CCCGCCCAGG 
AGTGCCTGTG 



51 
I 

CCATTTACCT 
CCCAGTGCAG 
GGAGAAACTG 
GCTGCCTCAA 
TGATAGAGAT 
GGCGGCAGCC 
AGGCCTCCCT 
TGGCGTCCCC 
CCCGGCGGCC 
CTCTTCCCCG 
CGGGGGCGCG 
GCCACCGCTC 
GCTCTCCACA 
CGGGCTCCCG 
TCATGGACGT 
GCCTGGGCTG 
TGCCTGGGAC 
CAAAGCTGAG 
AACTGACTAG 
CCTCAGCTAT 
AACCTGGGAC 
CCCTGTAGGG 
CTGGAACTGG 



Seq ID NO: 275 Protein sequence: 
Protein Accession #: NP_003 967.1 

1 11 



51 



21 31 41 

I I I I I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 
PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 
RGCRLRSQLV PVRALGIiGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 276 DNA sequence 

Nucleic Acid Accession #: NM_057091.1 

Coding sequence: 783-1445 



ACTGGCCGCT 
GGACCCCCAA 
TCGCTCCCCG 
CGCGTGTCTA 
CTCCATATCC 
CAAGCTAGGG 
CGGGGCAGGG 
CACCGGACGG 
CAGACAAGGC 



11 
I 

GAGAGAAGAA 
ATCTGCACGT 
CCCTCACTCA 
CAAACTCAAC 
GAGGGGCCCC 
GGGACTGGAT 
GCGCTCCCAG 
CTGCGGCGGC 
CCGGGGGCTC 



21 
I 

TCGGGTGGAG 
ACCAGCAGTC 
CTTTCTCCCG 

TCCCAGCATC 
CCGACGGGTG 
CCCCACCCCG 
GGGCAGGAGG 
CGCCAGCAGC 



31 
I 

CAGAGAGCAG 
AGCCGCCCCA 
CCCTCGGCCC 
CGTGCCTCTC 
TACCCCCCTC 
GAGCAGCCAG 
GGATCTGGTG 
CTGCTGAGGG 
AGGTCCCTCG 



41 

I 

CTGCTGCAGG 
CGCAGGGACC 
GGCCTCCCAG 
CACCGCTCGA 
CCAACCTCGG 
GTGAGCCCCG 
ACGCTGGGGC 
ATGGAGTTGG 
GGCCCCAGCC 



51 
I 

GCAGACAGCC 
GGCTTACCCC 
CTCTCTACTT 
GTTCTCTACT 
GGGACCTAGC 
AAAGGTGGGG 
TGGAATTTGA 
GCCCGGCCCC 
CTCGCTGCCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 



290 



WO 02/086443 

CCCGGGCCTG GAGCCCCACA CCCGAGGGTG CAGACTGGCT GCCAAGGCCA CACTTTTGGC 600 

TAAAAGAGGC ACTGCCAGGT GTACAGTCCT GGGCATGCGC TGTTTGAGCT TCGGGGGAGA 660 

GCCCAGCACT GGTCCCCGGA AAGGTGCCTA GAAGAACAAG GTGCAGGACC CCGTGCTGCC 720 

TCAACAGGAG GGTGGGGGAA CAGCTCAACA ATGGCTGATG GGCGCTCCTG GTGTTGATAG 780 

AGATGGAACT TGGACTTGGA GGCCTCTCCA CGCTGTCCCA CTGCCCCTGG CCTAGGCGGC 840 

AGCCTGCCCT GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT 900 

CCCTGGGCTC CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT 960 

CCCCCGCCGG CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC 1020 

GGCCGCCGCC GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC 1080 

CCCGCGGGGG CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG 1140 

CGCGGGGCTG CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC 1200 

GCTCCGACGA GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC 1260 

CACACGACCT CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT 1320 

CCCGGCCCGT CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG 1380 

ACGTCAACAG CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG 1440 

GCTGAGGGCT CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG 1500 

GGACCCTCCC GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC 1560 

TGAGAGGCCC CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA 1620 

CTAGCAGCCC CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG 1680 

CTATGGAGCC CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG 1740 

GGACCCCTCC TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT 1800 

AGGGACAGCA TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA 1860 
CTGGCCTGTA CTCACTCATG GGAGCTGGCC CC 

Seq ID NO: 277 Protein sequence: 
Protein Accession #: NP_003 967.1 
1 11 21 31 

I I I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS 
PAGKLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP 
RGCRLRSQLV PVRALGLGHR SDELVRPRFC SGSCRRARSP 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 278 DNA sequence 
Nucleic Acid Accession #; NMJ>57160.1 
Coding sequence: 1-714 

1 11 21 31 41 51 

I I I I I I 

ATGCCCGGCC TGATCTCAGC CCGAGGACAG CCCCTCCTTG AGGTCCTTCC TCCCCAAGCC 60 

CACCTGGGTG CCCTCTTTCT CCCTGAGGCT CCACTTGGTC TCTCCGCGCA GCCTGCCCTG 120 

TGGCCCACCC TGGCCGCTCT GGCTCTGCTG AGCAGCGTCG CAGAGGCCTC CCTGGGCTCC 180 

GCGCCCCGCA GCCCTGCCCC CCGCGAAGGC CCCCCGCCTG TCCTGGCGTC CCCCGCCGGC 240 

CACCTGCCGG GGGGACGCAC GGCCCGCTGG TGCAGTGGAA GAGCCCGGCG GCCGCCGCCG 300 

CAGCCTTCTC GGCCCGCGCC CCCGCCGCCT GCACCCCCAT CTGCTCTTCC CCGCGGGGGC 360 

CGCGCGGCGC GGGCTGGGGG CCCGGGCAGC CGCGCTCGGG CAGCGGGGGC GCGGGGCTGC 420 

CGCCTGCGCT CGCAGCTGGT GCCGGTGCGC GCGCTCGGCC TGGGCCACCG CTCCGACGAG 480 

CTGGTGCGTT TCCGCTTCTG CAGCGGCTCC TGCCGCCGCG CGCGCTCTCC ACACGACCTC 540 

AGCCTGGCCA GCCTACTGGG CGCCGGGGCC CTGCGACCGC CCCCGGGCTC CCGGCCCGTC 600 

AGCCAGCCCT GCTGCCGACC CACGCGCTAC GAAGCGGTCT CCTTCATGGA CGTCAACAGC 660 

ACCTGGAGAA CCGTGGACCG CCTCTCCGCC ACCGCCTGCG GCTGCCTGGG CTGAGGGCTC 720 

GCTCCAGGGC TTTGCAGACT GGACCCTTAC CGGTGGCTCT TCCTGCCTGG GACCCTCCCG 780 

CAGAGTCCCA CTAGCCAGCG GCCTCAGCCA GGGACGAAGG CCTCAAAGCT GAGAGGCCCC 840 

TACCGGTGGG TGATGGATAT CATCCCCGAA CAGGTGAAGG GACAACTGAC TAGCAGCCCC 900 

AGAGCCCTCA CCCTGCGGAT CCCAGCCTAA AAGACACCAG AGACCTCAGC TATGGAGCCC 960 

TTCGGACCCA CTTCTCACAG ACTCTGGCAC TGGCCAGGCC TCGAACCTGG GACCCCTCCT 1020 

CTGATGAACA CTACAGTGGC TGAGGCATCA GCCCCCGCCC AGGCCCTGTA GGGACAGCAT 1080 

TTGAAGGACA CATATTGCAG TTGCTTGGTT GAAAGTGCCT GTGCTGGAAC TGGCCTGTAC 1140 
TCACTCATGG GAGCTGGCCC C 

Seq ID NO: 279 Protein sequence: 
Protein Accession #: NP_476501.1 

1 11 21 31 

I I I -I 

MPGLISARGQ PLLEVLPPQA HLGALFLPEA PLGLSAQPAL 
APRSPAPREG PPPVLASPAG HLPGGRTARW CSGRARRPPP 
RAARAGGPGS RARAAGARGC RLRSQLVPVR ALGLGHRSDE 
SLASLLGAGA LRPPPGSRPV SQPCCRPTRY EAVSFMDVNS 

Seq ID NO: 280 DNA sequence 
Nucleic Acid Accession #: NM_057090.1 
Coding sequence: 29-715 

1 11 21 31 41 51 

I I I I I I 

CTGATGGGCG CTCCTGGTGT TGATAGAGAT GGAACTTGGA CTTGGAGGCC TCTCCACGCT 60 

GTCCCACTGC CCCTGGCCTA GGCGGCAGGC TCCACTTGGT CTCTCCGCGC AGCCTGCCCT 120 

GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT CCCTGGGCTC 180 

CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT CCCCCGCCGG 240 

CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC GGCCGCCGCC 300 

GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC CCCGCGGGGG 360 

CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG CGCGGGGCTG 420 

CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC GCTCCGACGA 480 

GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC CACACGACCT 540 

CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT CCCGGCCCGT 600 

CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG ACGTCAACAG 660 



41 51 
I I 

LGSAPRSPAP REGPPPVLAS 60 
RGGRAARAGG PGSRARAAGA 120 
HDLSLASLLG AGALRPPPGS 180 



41 51 
I I 

WPTLAALALL SSVAEASLGS 60 
QPSRPAPPPP APPSALPRGG 120 
LVRFRFCSGS CRRARSPHDL 180 
TWRTVDRLSA TACGCLG 



291 



WO 02/086443 

CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG GCTGAGGGCT 720 

CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG GGACCCTCCC 780 

GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC TGAGAGGCCC 840 

CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA CTAGCAGCCC 900 

CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG CTATGGAGCC 960 

CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG GGACCCCTCC 1020 

TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT AGGGACAGCA 1080 

TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA CTGGCCTGTA 1140 
CTCACTCATG GGAGCTGGCC CC 

Seq ID NO: 281 Protein sequence: 
Protein Accession ft: NP_476431.1 • 

1 11 21 31 41 51 

I I I I I I 

MELGLGGLST LSHCPWPRRQ APLGbSAQPA LMPTLAALAL liSSVAEASLG SAPRSPAPRE 60 

GPPPVLASPA GHLPGGRTAR WCSGRARRPP PQPSRPAPPP PAPPSALPRG GRAARAGGPG 120 

SRARAAGARG CRLRSQbVPV RAtfSLGHRSD ELVRFRFCSG SCRRARSPHD LSLASLLGAG 180 
ALRPPPGSRP VSQPCCRPTR YEAVSFMDVN STWRTVDRLS ATACGCLG 

Seq ID NO: 282 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

1 11 21 31 41 51 

I I I I I. I 

CTACTGCACC TGCCCTCTGT TTCCTTTGGA AATCTCTTAC CTTTCATTAG GGTTTCTTTC 60 

ATAGCAATTT CCTTTGGTTT TTAAGACTTC TACATTGCTT TTTCTTTTAT TATCTGTGCT 120 

CCGTGAACCT TATGAATGCT GCTTAAAAAT AATGTCAAAA TATGTTTTAG CTGCCTACTC 180 

AGGTAACGTT TTCTTTTGCT CTCATCTTGG TTTCCATATA CTATTTTTGG TTTTTTGTGA 240 

GATCTAATCA ATGATCTAGT CAGAAGCTAC TTCACTGGCT AACAGTGATC ATGTTCATGT 300 

GCTAAAAATG AACTTGAAAC ACGGAAGTAG TGGTTGGTCC AGTTTGAAAG CTCTTATTAG 360 

TATTCTTCAT CCTGGCTGTA ATAATAGCCA TTATTTGTTA TGCCTTTGTT ATGTAGCAGA 420 

CACTCTTAAG GATTTTATGT GTATTATTCA AATTGCTATT ACTGTTCTTT TTATAGTTGA 480 
GAATCTCAGG ATACCTACAT TTATCACTTT TTCAATATAT ATGTATTTCT TATT 

Seq ID NO: 283 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence*. 564-1461 

1 11 21 31 41 51 

I I I I I 1 

GAGACTTTTA ATCATCTATC CCTTGTGCTT TACGCAGACC CTACAATACA CTAGAGGCTT 60 

CAAAGAGGTC AAAAATTCAC ATGTGTAGAC AAATTAGGTC CCTTAAGATG CCAGGCAAAC 120 

GAAGTGCTAC CAAAACACGC AATGACTGTC CTAAAAGTGC GTTCTGGGAT ACACCTGTAA 180 

ACTTGGATCA AGTTCCCTCC CCTCTCCTCA AAATATATCG ACTTGTGCTG AAAGAAATCA 240 

CGACCGATGC TCACAATTCT GACCTCGTAA TTATATAGGG GGTGGTTTTG GTTTCTGCGT 300 

CTTTCCCTGA TTCAGTGGCA GGTAACATAT TTCATGTACA AAATGAACTG CAACACCACG 360 

GCAAACAAGG GACAGGCCCT CAAAGTTGTC GGTAGGGAGC CAGGACCCCG CCAGTGGCGT 420 

GGGGAGACAC CGTACTAAAC AAGCTTGCAA ACAGCAGGCA CCTTCCTGCC ACTGAGGAGG 480 

AAGGGCTGGC TAAGGGAGGC CGGGGCGGAG GAAGCCAAGC TCTGCAGGCC CTGACAAAGT 540 

CCTCCCGGCC TCCACGCGTC GCCATGGCAA CGCGGGGTCT GTGCTGGCCG GGATTGGCCG 600 

GCCTGGCGCG CGCAGGGCCC GCTGGGAAAG CGCGTCCCCG CCGCGGCTCC GCCAGTTTGA 660 

ACTTGGCGGG CCAGATGTGG GCGGCGGGGC GCTGGGGGCC TACTTTTCCC TCTTCCTACG 720 

CCGGTTTCTC TGCTGACTGC AGACCCAGGT CTCGGCCCTC CTCGGACTCC TGCTCAGTCC 780 

CTATGACGGG CGCACGTGGG CAGGGGCTGG AGGTGGTGCG CTCGCCGTCG CCGCCGCTGC 840 

CGCTGAGCTG CAGCAATTCC ACCAGGTCGC TGTTGTCTCC CCTTGGCCAC CAGAGCTTCC 900 

AGTTTGACGA GGACGACGGT GACGGGGAGG ATGAGGAAGA CGTGGATGAT GAGGAAGACG 960 

TGGATGAAGA TGCCCATGAT TCAGAGGCCA AAGTGGCGAG CCTGAGAGGA ATGGAGTTAC 1020 

AGGGGTGCGC CAGCACTCAG GTTGAATCAG AAAATAACCA AGAAGAACAG AAACAGGTGC 1080 

GCTTACCAGA AAGCCGCCTG ACACCATGGG AGGTGTGGTT TATTGGCAAA GAAAAAGAAG 1140 

AACGTGACCG GCTGCAACTG AAAGCTCTAG AGGAATTAAA TCAACAACTA GAAAAAAGAA 1200 

AAGAAATGGA AGAACGTGAA AAAAGAAAGA TAATTGCTGA AGAAAAGCAC AAGGAATGGG 1260 

TTCAGAAAAA GAATGAGCAA AAAAGAAAAG AAAGAGAACA AAAAATTAAT AAAGAAATGG 1320 

AGGAAAAAGC AGCAAAGGAA CTGGAGAAAG AATACTTGCA AGAAAAAGCA AAAGAAAAAT 1380 

ATCAAGAATG GTTAAAGAAA AAAAATGCTG AAGAATGTGA GAGGAAGAAG AAAGAAAAGA 1440 

AAAACAACAG CAAGCTGAAA TACAGGAGAA AAAGGAAATA GCAGAAAAAA AGTTTCAAGA 1500 

ATGGTTGGAA AATGCGAAAC ATAAACCTCG TCCAGCTGCA AAGAGCTATG GTTATGCCAA 1560 

TGGAAAACTT ACAGGTTTTT ACAGTGGAAA TTCCTATCCA GAACCAGCCT TTTATAATCC 1620 

AATTCCGTGG AAACCAATTC ATATGCCACC TCCCAAAGAA GCTAAGGATC TATCAGGAAG 1680 

GAAGAGTAAA AGACCTGTGA TAAGTCAGCC ACACAAGTCA TCATCTCTGG TAATTCATAA 1740 

AGCCAGGAGC AATCTTTGCC TTGGAACTCT GTGCAGAATA CAAAGATAGC GTATGTGGAA 1800 

AATAACATGC TTTTATCTGG AGCTATTTAA TTTAAAAATC AGAAATTGTT TTTTACTGCT 1860 

CAGTCAATAA CTCAACACTT AATGTGATTA TTGACAAATA GCAATTTTTG CATTTGTATA 1920 

TGGAGTCCTT AGAGTTGAGG AAGATATTTT CTGGATTTTG GTTTTTATAA ACTTTTTAAG 1980 

GTTGATCTTG GCATGTTGTT TTGCAGAATA AGTGGCTGAA TATGTAAGAA TTGTGTTTGT 2040 
ATTTAGCTTG TATTAAAAGT ACACTGTAAT ACCAATAAAA CTAACAATTT TTCTTG 

Seq ID NO: 284 Protein sequence: 
Protein Accession ft : . Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MATRGLCWPG LAGLARAGPA GKARPRRGSA SLNLAGQMWA AGRWGPTFPS SYAGFSADCR 60 

PRSRPSSDSC SVPMTGARGQ GLEWRSPSP PLPLSCSNST RSLLSPLGHQ SFQFDEDDGD 120 

GEDEEDVDDE EDVDEDAHDS EAKVASLRGM BLQGCASTQV ESENNQEEQK QVRLPESRLT 180 

PWEVWFIGKE KEERDRLQLK ALEELNQQLE KRKEMEEREK RKI I AEEKHK EWVQKKNEQK 240 

RKEREQKINK EMEEKAAKEL EKEYLQEKAK BKYQEWLKKK NAEECERKKK EKKNNSKLKY 300 
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Seq ID NO: 285 DNA sequence 
Nucleic Acid Accession 8: Eos sequence 
5 Coding sequence: 1-1746 

1 11 21 31 41 51 

I I I I I I 

ATGCCACTGA AGCATTATCT CCTTTTGCTG GTGGGCTGCC AAGCCTGGGG TGCAGGGTTG 60 

10 GCCTACCATG GCTGCCCTAG CGAGTGTACC TGCTCCAGGG CCTCCCAGGT GGAGTGCACC 120 

GGGGCACGCA TTGTGGCGGT GCCCACCCCT CTGCCCTGGA ACGCCATGAG CCTGCAGATC 1B0 

CTCAACACGC ACATCACTGA ACTCAATGAG TCCCCGTTCC TCAATATCTC AGCCCTCATC 240 

GCCCTGAGGA TTGAGAAGAA TGAGCTGTCG CGCATCACGC CTGGGGCCTT CCGAAACCTG 300 

GGCTCGCTGC GCTATCTCAG CCTCGCCAAC AACAAGCTGC AGGTTCTGCC CATCGGCCTC 360 

15 TTCCAGGGCC TGGACAGCCT TGAGTCTCTC CTTCTGTCCA GTAACCAGCT GTTGCAGATC 420 

CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC TGCAGTTGCA CGGCAACCAC 480 

CTGGAATACA TCCCTGACGG AGCCTTCGAC CACCTGGTAG GACTCACGAA GCTCAATCTG 540 

GGCAAGAATA GCCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 

GTCCTCCGGC TGTATGAGAA CAGGCTCACG GATATCCCCA TGGGCACTTT TGATGGGCTT 660 

20 GTTAACCTGC AGGAACTGGC TCTACAGCAG AACCAGATTG GACTGCTCTC CCCTGGTCTC 720 

TTCCACAACA ACCACAACCT CCAGAGACTC TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 

CCACCCAGCA TCTTCATGCA GCTGCCCCAG CTCAACCGTC TTACTCTCTT TGGGAATTCC 840 

CTGAAGGAGC TCTCTCTGGG GATCTTCGGG CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 900 

TATGACAACC ACATCTCTTC TCTACCCGAC AATGTCTTCA GCAACCTCCG CCAGTTGCAG 960 

25 GTCCTGATTC TTAGCCGCAA TCAGATCAGC TTCATCTCCC CGGGTGCCTT CAACGGGCTA 1020 

ACGGAGCTTC GGGAGCTGTC CCTCCACACC AACGCACTGC AGGACCTGGA CGGGAATGTC 1080 

TTCCGCATGT TGGCCAACCT GCAGAACATC TCCCTGCAGA ACAATCGCCT CAGACAGCTC 1140 

CCAGGGAATA TCTTCGCCAA CGTCAATGGC CTCATGGCCA TCCAGCTGCA GAACAACCAG 1200 

CTGGAGAACT TGCCCCTCGG CATCTTCGAT CACCTGGGGA AACTGTGTGA GCTGCGGCTG 1260 

30 TATGACAATC CCTGGAGGTG TGACTCAGAC ATCCTTCCGC TCCGCAACTG GCTCCTGCTC 1320 

AACCAGCCTA GGTTAGGGAC GGACACTGTA CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 1380 

GGCCAGTCCC TCATTATCAT CAATGTCAAC GTTGCTGTTC CAAGCGTCCA TGTCCCTGAG 1440 

GTGCCTAGTT ACCCAGAAAC ACCATGGTAC CCAGACACAC CCAGTTACCC TGACACCACA 1500 

TCCGTCTCTT CTACCACTGA GCTAACCAGC CCTGTGGAAG ACTACACTGA TCTGACTACC 1560 

35 ATTCAGGTCA CTGATGACCG CAGCGTTTGG GGCATGACCC AGGCCCAGAG CGGGCTGGCC 1620 

ATTGCCGCCA TTGTAATTGG CATTGTCGCC CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 1680 

TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 

TGTTAAAGAG GCAGGCTGGA GCAGGGCTGG GGAATGATGG GACTGGAGGA CCTGGGAATT 1800 

TCATCTTTCT GCCTCCACCC CTGGGTCCAT GGAGCTTTCC CGTGATTGCT CTTTCTGGCC 1860 

40 CTAGATAAAG GTGTGCCTAC CTCTTCCTGA CTTGCCTGAT TCTCCCGTAG AGAAGCAGGT 1920 

CGTGCCGGAC CTTCCTACAA TCAGGAAGAT AGATCCAACT GGCCATGGCA AAAGCCCTGG 1980 

GGATTTCCGA TTCATACCCC TGGGCTTCCT TCGAGAGGGC TCTTCCTCCA AATCCTCCCC 2040 

ACCTGTCCTC CAAGAACAGC CTTCCCTGCG CCCAGGCCCC CTCCGGGCCT CTGTAGACTC 2100 

AGTTAGTCCA CAGCCTGCTC ACTTCGTGGG AATAGTTCTC CGCTGAGATA GCCCCTCTCG 2160 

45 CCTAAGTATT ATGTAAGTTG ATTTCCCTTC TTTTGTTTCT CTTGTTTGTG CTATGGCTTG 2220 

ACCCAGCATG TCCCCTCAAA TGAAAGTTCT CCCCTTGATT TTCTGCTCCT GAAGGCAGGG 2280 

TGAGTTCTCT CCTCAAAGAA GACTTCAAAC CATTTAACTG GTTTCTTAAG AGCCGTCAAT 2340 

CAGCCTGGTT TTGGGGATGC TATGAAAGAG AGAAGGAAAA TCATGCCGCT CAGTTCCTGG 2400 

AGACAGAAGA GCCGTCATCA GTGTCTCACT TGTGATTTTT ATCTGGAAAA GGAAGAAACA 2460 

50 CCCCAGCACA GCAAGCTCAG CCTTTTAGAG AAGGATATTT CCAAACTGCA AACTTTGCTT 2520 

TGAAAAGTTT AGCCCTTTAA GGAATGAAAT CATGTAGAAT TTTGGACTTC TAAAAACATT 2580 

AAAATCAGCT TATTAATACG GGATAGAGAA AGAAATCTGG TGCCTGGGGG TCCCTGTGTT 2640 

CACCCCTAGA GTTTGTTTTA AAATTTTTAA TTGAAGCATG TGAAGTGTAC STGCAGAAAA 2700 

GTGGGAACAT GATAGTGTAT GGCTTGGTGG ATTTTCACAA ACTGAACATA CCTGTGTAAT 2760 

55 CAGCATCTAG ACCCAGACCC AGAGCATCAC AAATATCCCC CATCCTGGGC TTTTCCCAGA 2820 

GGAGATGGGG GCTTCTGAAG ATGGACTTAC CTGGGACCTG CCCCCCATGA GCCAGGACGG 2880 

TCCCCCCACA GTCAGCCTGT GCAAAGGCCC CGTGGCCAGG GGTGGAGGAG AATATGTGGG 2940 

TGTGGACAGG ATGGGAGACT GTGGCCTGAA CAGGAGATTT TATTATATCT GGAGACCCTG 3000 

AGAGACCCTG AGACCTGGGG CACCATGGCT GGCCAGGTCA GAAGCATCCT GACTGCAGAG 3060 

60 GTCCGTGCAG CCACACCCTC TTCCCTGCCA GCAAGTTGTC TGCGGCTCAT CGGAGGCCCC 3120 

TCCGCCTGGA GCCTTCTATG GACGTGATAT GCCTGTATCT GTTTTTAATT TTCATTCTTC 3180 

ACTTAGGGGA AGTGAAATCG CTCAGAGATG AGATCCTTTA ATTGAAAACG AAGTGTAACG 3240 

GAATCTAGTG TCTTTCTAAT GTGGTAAAAT TCTCCATCAA CATCACAGTC AGCTGGCAGC 3300 

TGAACTTCAG AATCTCACTT ACAGCAGGCG ACACGGGGGT ACACCGATGG GTCACACTGG 3360 

65 GTCTGGGGGC TCCCTGGAGC TCCTCCTGCG TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 3420 

TCCAGGGTTA TTCTCCTCCT CGAGTCACAG TCACACGAAT ACCTGCCTTC TCTGGCTTTC 3480 

CTGCTATACA CATATTCACA TGGCGCTCAA GAAGTTAGGC TCATGGCAAC GTGTGTCTTT 3540 

CTCTGGACAA CTGGCCCAGT TTACAGTGAA ATGGAGAATT TCAGGTCTCC ACGTCTGCCC 3600 

AGGAAAGAAC TTCAGCTGAC TCCACGGGGA TCTGGAAATC CACGACCAAT CCCGATCGGC 3660 

70 TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGCTTTGG AAATCCACCA CCAATCCCGA 3720 

TCGGCTCTTA TTAGCTCCCC GCTCCACAAG ACACCTGTGA TCTGGAAATC TACCACCAAT 3780 

CCCGATCGGC TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGACATCC TCCAGGGCCA 3840 

CAGGAGCACG TGCTGACCAG TTTTCCCTTC CAGTTCCTGC ACAAAAAGTG TCCAGAGGGC 3900 

TGTTTGCAAA CACTAGTGCA CTTTGTAGCT TTTCACCCTC TGTCCCAGGG AATCTAGGAG 3960 

75 AGATGAGGCC CGTCAGAGTC AAGAGATGTC ATCCCCCCAG GGTCTCCAAG GCATTTCCAC 4020 

ACTATTGGTG GCACCTGGAG GACATGCACC AAGGCTTGCC AGAGCCAACA GGAAGTGAGC 4080 

CCAGAGCATG GCACATGAGC ATCACCCGCT GATGGTGGCC TGCTGTGCCT GGTGCCAACA 4140 

GGGGCATCCC GGCCCGTACC CCTCCAGACA GGAAGCATGG GTTTGCCCAC AGACCTGTCG 4200 

GGTGCTCCTG TGAGTGGCCT CCAGATGTCT TTGTGCATAG GCACAAGTGG GCCAGGGCTG 4260 

80 GAGGGAGGTG GGAAACCTCA TCATCCGGTG GGCCCTGCCA ATCTTAACCC AGAACCCTTA 4320 

GGTATTCCTG GCAGTAGCCA TGACATTGGA GCACCTTCCT CTCCAGCCAG AGGCTGACCT 4380 

GAGGGCCACT GTCCTCAGAT GACACCACCC AGGAGCACCC TAGGTGAGGG GTGAGGGCCC 4440 

CCTTATGTGA ACCTCTTGCC TCTTCCTTTC TCCCATCAGA GTGGTTGGAT GGAGCCATTG 4500 

'GCCTCCTTTT CTTCAGCGGG CCCTTCAACC TCTCTGCACC ATGTTGTCTG GCTGAGGAGC 4560 

85 TACTAGAAAA GCTGAGTGGA GTCTCCTTTC CAACAGGATG ATGCATTTGC TCAATTCTCA 4620 

GGGCTGGAAT GAGCCGGCTG GTCCCCCAGA AAGCTGGAGT GGGGTACAGA GTTCAGTTTT 4680 

CCTCTCTGTT TACAGCTCCT TGACAGTCCC ACGCCCATCT GGAGTGGGAG CTGGGAGTTA 4740 
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GTGTTGGAGA AGAAACAACA 
TGCACAGATA CTCTTCAAGC 
GGTAGGAGTG COGCCTCTAC 
GGTGTTCAAT AGGCTGGGAG 
TTGTCTTGGG CTTTCGTCAT 
TTAGTCTTGG TCATCAGAAC 
GGAAAAAATA AACTCTTCCA 
TGGGCTGTAT GTATATTGTT 
AACTTTTCAT GGACACAATT 
GAACTTCCAA ACTCAGGAAG 
AGTTGGTCGA CAGATGTTAG 
GCCCCCAGAT CCCACAGTCA 
GGAAGGAAGC CATGGCTGTG 
CTCCTTCCGC CCCAGGTTTC 
CTTCATGCTG CCTTCAAAGC 
GCCCCAGTGC TTGGCGATGC 
AGCCCTGGTG GGCAGGGTTG 
GGTGTACAGA ATCAACAATA 



PCTAJS02/12476 



AAAGCCAATT 
ACTGGACGTG 
CCACTTGTGA 
TTTTATTTAT 
TAAACCAAAG 
CTCACTTGGT 
TCCCTTAAAG 
CTTCCTCCTT 
TCCACAACCT 
TTTGCAGAGA 
ATGTATCCTA 
GAACTGAATC 
GTTCAGAGAG 
TTCTTCTCTT 
TAGATCATGT 
ATTTACAGAT 



AATAATATAC 



AGAACCACTA 
GATTCTCTCT 
TGGGGTACAG 
CTCTTCAAAC 
GAAATGGAAG 
ACCATATAGA 
AATAGAATAG 
AGAATTTAGA 
TTCAGATGCT 
GCAGACAGCT 
GCTTTTAGCC 
TGCGTTGTTG 
GGTGGGCTGG 
AAGGAGAGAT 
TTGCCTTGCT 
TTCTAGGCCC 
CTTCTGCT6G 
ATGTAT 



TTTTTAAAAA 
CTAGCCCTCA 
AGGCACTTGC 
TTTGTACAAG 
CCATTCCCCT 
TCAAAAGCTT 
TTTGTCCCTC 
GATACAAGAG 
GATGTAGAGC 
AGAGATAACT 
ATAAACCACT 
GGAAGCCAGC 
CAAGCCACTT 
TGTTCTCACC 
TAGAGAATTA 
TCAGGGTTTT 
ATGCTGCTTG 



GTGCTTACTG 
GCACCCCTGC 
TCTTCTGCAT 
AGCTCATGGC 
GTTGCTCTCC 
TGTAACCACA 
TCATGGGAAT 
TTCTACTTAG 
TATTGGGAAA 
CGGGACCCAG 
CAAAGATTCA 
AGTGGCCTTG 
CCGGGGAAAA 
AACCOGCTGC 
CTGCAAATCA 
GTAGAGTGTG 
TAATCCATTT 



Seq ID NO: 286 Protein sequence: 
Protein Accession ft: NP 570843.1 



1 
I 

MPLKHYLLLL 
LNTHITELNE 
FQGLDSLESL 
GKNSLTHISP 
FHNNHNLQRL 
YDNHISSLPD 
FRMLANLQNI 
YDNPWRCDSD 
VPSYPETPWY 
IAAIVIGIVA 



11 
I 

VGCQAWGAGL 
SPFLNISALI 
LLSSNQLLQI 
RVFQHLGNLQ 
YLSNNHISQL 
NVFSNLRQLQ 
SLQNNRLRQL 
ILPLRNWLLL 
PDTPSYPDTT 
LACSLAACVG 



21 
I 

AYHGCPSECT 
ALRIEKNELS 
QPAHFSQCSN 
VLRLYENRLT 
PPSIFMQLPQ 
VLILSRNQIS 
PGNIFANVNG 
NQPRLGTDTV 
SVSSTTELTS 
CCCCKKRSQA 



31 
I 

CSRASQVECT 
RITPGAFRNL 
LKELQLHGNH 
DIPMGTFDGL 
LNRLTLFGNS 
FISPGAFNGL 
LMAIQLQNNQ 
PVCFSPANVR 
PVEDYTDLTT 
VLMQMXAPNE 



41 
I 

GARIVAVPTP 
GSLRYLSLAN 
LEYIPDGAFD 
VNLQELALQQ 
LKELSLGIFG 
TELRELSLHT 
LENLPLGIFD 
GQSLIIINVN 
IQVTDDRSVW 
C 



51 
I 

IiPWNAMSLQI 
NKLQVLPIGL 
HLVGLTKLNL 
NQIGLLSPGL 
PMPNLRELWIi 
NALQDLDGNV 
HLGKLCELRL 
VAVPSVHVPE 
GMTQAQSGLA 



Seq ID NO: 287 DNA sequence 
Nucleic Acid Accession #: NMJ)02362 
Coding sequence : 1 . . 954 



ATGTCTTCTG 
GAGGCCCTGG 
TCCTCCTCCT 
GGTCCTCCCC 
TGGAGGCAAC 
GACGCAGAGT 
CTGCTCCGCA 
ATCAAAAATT 
ATGATCTTTG 
ACCTGCCTGG 
GGCCTTCTGA 
GAAATCTGGG 
GGGGAGCCCA 
CAGGTACCCG 
GAAACCAGCT 
GCCTACCCAT 



11 
I 

AGCAGAAGAG 
GCCTGGTGGG 
CTCCTCTGGT 
AGAGTCCTCA 
CCAATGAGGG 
CCTTGTTCCG 
AGTATCGAGC 
ACAAGCGCTG 
GCATTGACGT 
GCCTTTCCTA 
TAATCGTCCT 
AGGAGCTGGG 
GGAAACTGCT 
GCAGTAATCC 
ATGTGAAAGT 
CCCTGCGTGA 



21 
I 

TCAGCACTGC 
TGCACAGGCT 
CCCTGGCACC 
GGGAGCCTCT 
TTCCAGCAGC 
AGAAGCACTC 
CAAGGAGCTG 
CTTTCCTGTG 
GAAGGAAGTG 
TGATGGCCTG 
GGGCACAATT 
TGTGATGGGG 
CACCCAAGAT 
TGCGCGCTAT 
CCTGGAGCAT 
AGCAGCTTTG 



31 
I 

AAGCCTGAGG 
CCTACTACTG 
CTGGAGGAAG 
GCCTTACCCA 
CAAGAAGAGG 
AGTAACAAGG 
GTCACAAAGG 
ATCTTCGGCA 
GACCCCGCCA 
CTGGGTAATA 
GCAATGGAGG 
GTGTATGATG 
TGGGTGCAGG 
GAGTTCCTGT 
GTGGTCAGGG 
TTAGAGGAGG 



41 

I 

AAGGCGTTGA 
AGGAGCAGGA 
TGCCTGCTGC 
CTACCATCAG 
AGGGGCCAAG 
TGGATGAGTT 
CAGAAATGCT 
AAGCCTCCGA 
GCAACACCTA 
ATCAGATCTT 
GCGACAGCGC 
GGAGGGAGCA 
AAAACTACCT 
GGGGTCCAAG 
TCAATGCAAG 
AAGAGGGAGT 



51 
I 

GGCCCAAGAA 
GGCTGCTGTC 
TGAGTCAGCA 
CTTCACTTGC 
CACCTCGCCT 
GGCTCATTTT 
GGAGAGAGTC 
GTCCCTGAAG 
CACCCTTGTC 
TCCCAAGACA 
CTCTGAGGAG 
CACTGTCTAT 
GGAGTACCGG 
GGCTCTGGCT 
AGTTCGCATT 
CTGA 



Seq ID NO: 288 Protein sequence: 
Protein Accession 8: NP 002353.1 



MSSEQKSQHC 
GPPQSPQGAS 
LLRKYRAKEL 
TCLGLSYDGL 
GBPRKLLTQD 
AYPSLREAAL 



11 
I 

KPEEGVEAQE 
ALPTTISFTC 
VTKAEMLERV 
LGNNQIFPKT 
WVQENYLEYR 
LEEEEGV 



31 



21 

I I 
EAIiGLVGAQA PTTEEQEAAV 
WRQPNEGSSS QEEEGPSTSP 
IKNYKRCFPV IFGKASESLK 
GLLIIVLGTI AMEGDSASEE 
QVPGSNPARY EFLWGPRALA 



41 



51 
I 



SSSSPIiVPGT LEEVPAAESA 
DAESLFREAL SNKVDELAHF 
MIFGIDVKEV DPASNTYTLV 
EIWEELGVMG VYDGREHTVY 
ETSYVKVLEH WRVNARVRI 



Seq ID NO: 289 DNA sequence 
Nucleic Acid Accession #: NM_002362 
Coding sequence: 46.. 1344 



1 
I 

CGGCGGCCGC 
GGCGACCTGA 
CATCAGCGCG 
CTCAACAGAC 
TTGACCAGAA 
CAGCCCATCG 
GGCCCCAGCA 
GTTCTACCTG 
AAATCCCATC 



11 

I 

GCCCTGGTTG 
AGCAGGCGCT 
GCAGCAGCAC 
ATAATATTGT 
ATGTGCAGTC 
ATTTGAGTGC 
GTGAAAATCT 
CAGCTGAATT 
TCCTCGATTA 



21 
I 

GGTCCCCACT 
TCCCTGTGTG 
TGCAAAGAAA 
GTTTGGTGAT 
TGTGTCTATT 
ATGCACTGTT 
GGAGGAAGAG 
CCATGGGCTT 
TGTGATGACA 



31 
I 

GCTCTCGGGG 
GCCGAGTCGC 
GAAGACATAA 
TACACATGGA 
ATTGACACAG 
GCACTTCACA 
ACAGAAAACA 
TGGGACAGCT 
ACTTTACTGT 



41 
I 

GCGCCATGGA 
CAACGGTCCA 
ACCTGAGTGT 
CTGAGTTTGA 
AATTAAAGGT 
TTTTCCAGCT 
TAATTGCAGC 
TGGTATACGA 
TTTCAGACAA 



51 

I • 

CGAGGCCGTG 
CGTGGAGGTG 
TAGAAAGCTA 
TGAACCTTTT 
TAAAGACTCA 
GAATGAAGAT 
AAATCACTGG 
TGTGGAAGTC 
GAACGTCAAC 



4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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AGCAACCTCA TCACCTGGAA CCGGGTGGTG CTGCTCCACG GT CCTC CTGG CACTGGAAAA 600 

ACATCCCTGT GTAAAGCGTT AGCCCAGAAA TTGACAATTA GACTTTCAAG CAGGTACCGA 660 

TATGGCCAAT TAATTGAAAT AAACAGCCAC AGCCTCTTTT CTAAGTGGTT TTCGGAAAGT 720 

GGCAAGCTGG TAACCAAGAT GTTTCAGAAG ATTCAGGATT TGATTGATGA TAAAGACGCC 780 

CTGGTGTTCG TGCTGATTGA TGAGGTGGAG AGTCTCACAG CCGCCCGAAA TGCCTGCAGG 840 

GCGGGCACCG AGCCATCAGA TGCCATCCGC GTGGTCAATG CTGTCTTGAC CCAAATTGAT 900 

CAGATTAAAA GGCATTCCAA TGTTGTGATT CTGACCACTT CTAACATCAC CGAGAAGATC 960 

GACGTGGCCT TCGTGGACAG GGCTGACATC AAGCAGTACA TTGGGCCACC CTCTGCAGCA 1020 

GCCATCTTCA AAATCTACCT CTCTTGTTTG GAAGAACTGA TGAAGTGTCA GATCATATAC 1080 

CCTCGCCAGC AGCTGCTGAC CCTCCGAGAG CTAGAGATGA TTGGCTTCAT TGAAAACAAC 1140 

GTGTCAAAAT TGAGCCTTCT TTTGAATGAC ATTTCAAGGA AGAGCGAGGG CCTCAGCGGC 1200 

CGGGTCCTGA GAAAACTCCC CT T TCTGGCT CATGCGCTGT ATGTCCAGGC CCCCACCGTC 1260 

ACCATAGAGG GGTTCCTCCA GGCCCTGTCT CTGGCAGTGG ACAAGCAGTT TGAAGAGAGA 1320 

AAGAAGCTTG CAGCTTACAT CTGATCCTGG GCTTCCCCAT CTGGTGCTTT TCCCATGGAG 1380 

AACACACAAC CAGTAAGTGA GGTTGCCCCA CACAGCCGTC TCCCAGGGAA TCCCTTCTGC 1440 

AAACCAAACG TTACTTAGAC TGCAAGCTAG AAAGCCACCA AGGCCAGGCT TTGTTAAAAG 1500 

AAGTGTATTC TATTTATGTT GTTTTAAAAT GCATACTGAG AGACAAACAT CTTGTCATTT 1560 

TCACTGTTTG TAAAAGATAA TTCAGATTGT TTGTCTCCTT GTGAAGAACC ATCGAAACCT 1620 

GTTTGTTCCC AGCCCACCCC CAGTGGATGG GATGCATAAT GCCAGCAAGT TTTGTTTAAC 1680 

AGCAAAAAAG GAAGATTAAT GCAGGTGTTA TAGAAGCCAG AAGAGAAACT GTGTCACCCT 1740 

AAAGAAGCAT ATAATCATAG CATTAAAAAT GCACACATTA CTCCAGGTGG AAGGTGGCAA 1800 

TTGCTTTCTG ATATCAGCTC GTTTGATTTA GTGCAAAAAT GTTTTCAAGA CTATTTAATG 1860 

GATGTAAAAA AGCCTATTTC TACATTATAC CAACTGAGAA AAAAATGGTC GGTAAAGTGT 1920 

TCTTTCATAA TAAATAATCA AGACATGGTC CCATTTGCAG GAAAAGTGCA GACTCTGAGT 1980 

GTTCCAGGGA AACACATGCT GGACATCCCT TGTAACCCGG TATGGGCGCC CCTGCATTGC 2040 

TGGGATGTTT CTGCCCACGG TTTTGTTTGT GCAATAACGT TATCACATTT CTAATGAGGA 2100 

TTCACATTAA TATAATATAA AATAAATAGG TCAGTTACTG GTCTCTTTCT GCCGAATGTT 2160 
ATGTTTTGCT TTTATCTCAC AGTAAAATAA ATATAATTAA AAA 



Seq ID NO: 290 Protein sequence: 
Protein Accession #: NP_0G4228 

1 11 21 31 41 51 

I I I I I I 

MDEAVGDLKQ ALPCVAESPT VHVEVHQRGS STAKKEDINL SVRKLLNRHN IVFGDYTWTE 60 
FDEPFLTRNV QSVSIIDTEIi KVKDSQPIDL SACTVALHIF QLNEDGPSSE NIiEEETENII 120 
AANHWVLPAA BFHGLWDSLV YDVBVKSHLL DYVMTTLLFS DKNVNSNLIT WNRWLIiHGP 180 
PGTGKTSLCK ALAQKLTIRL SSRYRYGQLI EINSHSLFSK WFSESGKLVT KMFQKIQDLI 240 
DDKDALVFVL IDEVESLTAA RNACRAGTEP SDAIRWNAV LTQIDQIKRH SNWILTTSN 300 
ITEKIDVAFV DRADIKQYIG PPSAAAIFKI YLSCLEELMK CQIIYPRQQL LTLRELEMIG 360 
FIENNVSKLS LLLNDISRKS EGLSGRVLRK LPFLAHALYV QAPTVTIEGF LQALSLAVDK 420 
QFEERKKLAA YI 



Seq ID NO: 291 DNA sequence 

Nucleic Acid Accession NM__002658.1 

Coding sequence: 77-1372 

1 11 21 31 41 . 51 

I I I I I I 

GTCCCCGCAG CGCCGTCGCG CCCTCCTGCC GCAGGCCACC GAGGCCGCCG CCGTCTAGCG 60 

CCCCGACCTC GCCACCATGA GAGCCCTGCT GGCGCGCCTG CTTCTCTGCG TCCTGGTCGT 120 

GAGCGACTCC AAAGGCAGCA ATGAACTTCA TCAAGTTCCA TCGAACTGTG ACTGTCTAAA 180 

TGGAGGAACA TGTGTGTCCA ACAAGTACTT CTCCAACATT CACTGGTGCA ACTGCCCAAA 240 

GAAATTCGGA GGGCAGCACT GTGAAATAGA TAAGTCAAAA ACCTGCTATG AGGGGAATGG 300 

TCACTTTTAC CGAGGAAAGG CCAGCACTGA CACCATGGGC CGGCCCTGCC TGCCCTGGAA 360 

CTCTGCCACT GTCCTTCAGC AAACGTACCA TGCCCACAGA TCTGATGCTC TTCAGCTGGG 420 

CCTGGGGAAA CATAATTACT GCAGGAACCC AGACAACCGG AGGCGACCCT GGTGCTATGT 480 

GCAGGTGGGC CTAAAGCCGC TTGTCCAAGA GTGCATGGTG CATGACTGCG CAGATGGAAA 540 

AAAGCCCTCC TCTCCTCCAG AAGAATTAAA ATTTCAGTGT GGCCAAAAGA CTCTGAGGCC 600 

CCGCTTTAAG ATTATTGGGG GAGAATTCAC CACCATCGAG AACCAGCCCT GGTTTGCGGC 660 

CATCTACAGG AGGCACCGGG GGGGCTCTGT CACCTACGTG TGTGGAGGCA GCCTCATCAG 720 

CCCTTGCTGG GTGATCAGCG CCACACACTG CTTCATTGAT TACCCAAAGA AGGAGGACTA 780 

CATCGTCTAC CTGGGTCGCT CAAGGCTTAA CTCCAACACG CAAGGGGAGA TGAAGTTTGA * 840 

GGTGGAAAAC CTCATCCTAC ACAAGGACTA CAGCGCTGAC ACGCTTGCTC ACCACAACGA 900 

CATTGCCTTG CTGAAGATCC GTTCCAAGGA GGGCAGGTGT GCGCAGCCAT CCCGGACTAT 960 

ACAGACCATC TGCCTGCCCT CGATGTATAA CGATCCCCAG TTTGGCACAA GCTGTGAGAT 1020 

CACTGGCTTT GGAAAAGAGA ATTCTACCGA CTATCTCTAT CCGGAGCAGC TGAAAATGAC 1080 

TGTTGTGAAG CTGATTTCCC ACCGGGAGTG TCAGCAGCCC CACTACTACG GCTCTGAAGT 1140 

CACCACCAAA ATGCTATGTG CTGCTGACCC CCAATGGAAA ACAGATTCCT GCCAGGGAGA 1200 

CTCAGGGGGA CCCCTCGTCT GTTCCCTCCA AGGCCGCATG ACTTTGACTG GAATTGTGAG 1260 

CTGGGGCCGT GGATGTGCCC TGAAGGACAA GCCAGGCGTC TACACGAGAG TCTCACACTT 1320 

CTTACCCTGG ATCCGCAGTC ACACCAAGGA AGAGAATGGC CTGGCCCTCT GAGGGTCCCC 1380 

AGGGAGGAAA CGGGCACCAC CCGCTTTCTT GCTGGTTGTC ATTTTTGCAG TAGAGTCATC ' 1440 

TCCATCAGCT GTAAGAAGAG ACTGGGAAGA TAGGCTCTGC ACAGATGGAT TTGCCTGTGG 1500 

CACCACCAGG GTGAACGACA ATAGCTTTAC CCTCACGGAT AGGCCTGGGT GCTGGCTGCC 1560 

CAGACCCTCT GGCCAGGATG GAGGGGTGGT CCTGACTCAA CATGTTACTG ACCAGCAACT 1620 

TGTCTTTTTC TGGACTGAAG CCTGCAGGAG TTAAAAAGGG CAGGGCATCT CCTGTGCATG 1680 

GGCTCGAAGG GAGAGCCAGC TCCCCCGACC GGTGGGCATT TGTGAGGCCC ATGGTTGAGA 1740 

AATGAATAAT TTCCCAATTA GGAAGTGTAA GCAGCTGAGG TCTCTTGAGG GAGCTTAGCC 1800 

AATGTGGGAG CAGCGGTTTG GGGAGCAGAG ACACTAACGA CTTCAGGGCA GGGCTCTGAT 1860 

ATTCCATGAA TGTATCAGGA AATATATATG TGTGTGTATG TTTGCACACT TGTTGTGTGG 1920 

GCTGTGAGTG TAAGTGTGAG TAAGAGCTGG TGTCTGATTG TTAAGTCTAA ATATTTCCTT 1980 

AAACTGTGTG GACTGTGATG CCACACAGAG TGGTCTTTCT GGAGAGGTTA TAGGTCACTC 2040 

CTGGGGCCTC TTGGGTCCCC CACGTGACAG TGCCTGGGAA TGTACTTATT CTGCAGCATG 2100 

ACCTGTGACC AGCACTGTCT CAGTTTCACT TTCACATAGA TGTCCCTTTC TTGGCCAGTT 2160 

ATCCCTTCCT TTTAGCCTAG TTCATCCAAT CCTCACTGGG TGGGGTGAGG ACCACTCCTT 2220 

ACACTGAATA TTTATATTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTG 2280 



295 



WO 02/086443 

ATCAATAAAA TGTGATTTTT CTGA 

Seq ID NO: 292 Protein sequence: 
Protein Accession # :NP_002649 . 1 

1 11 21 31 41 51 

I I I I I I 

MRALLARLLL, CVLWSDSKG SNELHQVPSN CDCLNGGTCV SNKYFSNIHW CNCPKKFGGQ 60 

HCEIDKSKTC YEGNGHFYRG KASTDTMGRP CLPWNSATVL QQTYHAHRSD ALQLGLGKHN 120 

YCRNPDNRRR PWCYVQVGLK PLVQECMVHD CADGKKPSSP PEELKFQCGQ KTLRPRFKI I 180 

GGEFTTIENQ PWFAAIYRRH RGGSVTYVCG GSLISPCWVI SATHCFIDYP KKEDYIVYLG 240 

RSRLNSNTQG EMKFEVENLI LHKDYSADTL AHHNDIALLK IRSKEGRCAQ PSRTIQT1CL 300 

PSMYNDPQFG TSCEITGFGK ENSTDYLYPE QLKMTWKLI SHRECQQPHY YGSEVTTKML 360 

CAADPQWKTD SCQGDSGGPL VCSLQGRMTL TGIVSWGRGC ALKDKPGVYT RVSHFLPWIR 420 
SHTKEENGLA L 

Seq ID NO: 293 DNA sequence 
Nucleic Acid Accession #: NM_001498 
Coding sequence: 93.. 2006 

1 11 21 * 31 41 51 

I I I I I I 

GGCACGAGGC TGAGTGTCCG TCTCGCGCCC GGAAGCGGGC GACCGCCGTC AGCCCGGAGG 60 

AGGAGGAGGA GGAGGAGGAG GAGGGGGCGG CCATGGGGCT GCTGTCCCAG GGCTCGCCGC 120 

TGAGCTGGGA GGAAACCAAG CGCCATGCCG ACCACGTGCG GCGGCACGGG ATCCTCCAGT 180 

TCCTGCACAT CTACCACGCC GTCAAGGACC GGCACAAGGA CGTTCTCAAG TGGGGCGATG 240 

AGGTGGAATA CATGTTGGTA TCTTTTGATC ATGAAAATAA AAAAGTCCGG TTGGTCCTGT 300 

CTGGGGAGAA AGTTCTTGAA ACTCTGCAAG AGAAGGGGGA AAGGACAAAC CCAAACCATC 360 

CTACCCTTTG GAGACCAGAG TATGGGAGTT ACATGATTGA AGGGACACCA GGACAGCCCT 420 

ACGGAGGAAC AATGTCCGAG TTCAATACAG TTGAGGCCAA CATGCGAAAA CGCCGGAAGG 480 

AGGCTACTTC TATATTAGAA GAAAATCAGG CTCTTTGCAC AATAACTTCA TTTCCCAGAT 540 

TAGGCTGTCC TGGGTTCACA CTGCCCGAGG TCAAACCCAA CCCAGTGGAA GGAGGAGCTT 600 

CCAAGTCCCT CTTCTTTCCA GATGAAGCAA TAAACAAGCA CCCTCGCTTC AGTACCTTAA 660 

CAAGAAATAT CCGACATAGG AGAGGAGAAA AGGTTGTCAT CAATGTACCA ATATTTAAGG 720 

ACAAGAATAC ACCATCTCCA TTTATAGAAA CATTTACTGA GGATGATGAA GCTTCAAGGG 780 

CTTCTAAGCC GGATCATATT TACATGGATG CCATGGGATT TGGAATGGGC AATTGCTGTC 840 

TCCAGGTGAC ATTCCAAGCC TGCAGTATAT CTGAGGCCAG ATACCTTTAT GATCAGTTGG 900 

CTACTATCTG TCCAATTGTT ATGGCTTTGA GTGCTGCATC TCCCTTTTAC CGAGGCTATG 960 

TGTCAGACAT TGATTGTCGC TGGGGAGTGA TTTCTGCATC TGTAGATGAT AGAACTCGGG 1020 

AGGAGCGAGG ACTGGAGCCA TTGAAGAACA ATAACTATAG GATCAGTAAA TCCCGATATG 1080 

ACTCAATAGA CAGCTATTTA TCTAAGTGTG GTGAGAAATA TAATGACATC GACTTGACGA 1140 

TAGATAAAGA GATCTACGAA CAGCTGTTGC AGGAAGGCAT TGATCATCTC CTGGCCCAGC 1200 

ATGTTGCTCA TCTCTTTATT AGAGACCCAC TGACACTGTT TGAAGAGAAA ATACACCTGG 1260 

ATGATGCTAA TGAGTCTGAC CATTTTGAGA ATATTCAGTC CACAAATTGG CAGACAATGA 1320 

GATTTAAGCC CCCTCCTCCA AACTCAGACA TTGGATGGAG AGTAGAATTT CGACCCATGG 1380 

AGGTGCAATT AACAGACTTT GAGAACTCTG CCTATGTGGT GTTTGTGGTA CTGCTCACCA 1440 

GAGTGATCCT TTCCTACAAA TTGGATTTTC TCATTCCACT GTCAAAGGTT GATGAGAACA 1500 

TGAAGGTAGC ACAGAAAAGA GATGCTGTCT TGCAGGGAAT GTTTTATTTC AGGAAAGATA 1560 

TTTGCAAAGG TGGCAATGCA GTGGTGGATG GTTGTGGCAA GGCCCAGAAC AGCACGGAGC 1620 

TCGCTGCAGA GGAGTACACC CTCATGAGCA TAGACACCAT CATCAATGGG AAGGAAGGTG 1680 

TGTTTCCTGG ACTGATCCCA ATTCTGAACT CTTACCTTGA AAACATGGAA GTGGATGTGG 1740 

ACACCAGATG TAGTATTCTG AACTACCTAA AGCTAATTAA GAAGAGAGCA TCTGGAGAAC 1800 

TAATGACAGT TGCCAGATGG ATGAGGGAGT TTATCGCAAA CCATCCTGAC TACAAGCAAG 1860 

ACAGTGTCAT AACTGATGAA ATGAATTATA GCCTTATTTT GAAGTGTAAC CAAATTGCAA 1920 

ATGAATTATG TGAATGCCCA GAGTTACTTG GATCAGCATT TAGGAAAGTA AAATATAGTG 1980 

GAAGTAAAAC TGACTCATCC AACTAGACAT TCTACAGAAA GAAAAATGCA TTATTGACGA 2040 

ACTGGCTACA GTACCATGCC TCTCAGCCCG TGTGTATAAT ATGAAGACCA AATGATAGAA 2100 

CTGTACTGTT TTCTGGGCCA GTGAGCCAGA AATTGATTAA GGCTTTCTTT GGTAGGTAAA 2160 

TCTAGAGTTT ATACAGTGTA CATGTACATA GTAAAGTATT TTTGATTAAC AATGTATTTT 2220 

AATAACATAT CTAAAGTCAT CATGAACTGG CTTGTACATT TTTAAATTCT TACTCTGGAG 2280 

CAACCTACTG TCTAAGCAGT TTTGTAAATG TACTGGTAAT TGTACAATAC TTGCATTCCA 2340 

GAGTTAAAAT GTTTACTGTA AATTTTTGTT CTTTTAAAGA CTACCTGGGA CCTGATTTAT 2400 

TGAAATTTTT CTCTTTAAAA ACATTTTCTC TCGTTAATTT TCCTTTGTCA TTTCCTTTGT 2460 

TGTCTACATT AAATCACTTG AATCCATTGA AAGTGCTTCA AGGGTAATCT TGGGTTTCTA 2520 

GCACCTTATC TATGATGTTT CTTTTGCAAT TGGAATAATC ACTTGGTCAC CTTGCCCCAA 2580 
GCTTTCCCCT CTGAATAAAT ACCCATTGAA CTCTGAAAAA AAAAAAAAAA AAAA 



Seq. ID NO: 294 Protein sequence: 
Protein Accession #: NP 001489 



1 11 21 31 41 51 

I I I I I I 

MGLLSQGSPL SWEET KRHAD HVRRHGILQF LHIYHAVKDR HKDVLKWGDE VEYMLVSFDH 60 

ENKKVRLVLS GEKVLETLQE KGERTNPNHP TLWRPEYGSY MIEGTPGQPY GGTMSEFNTV 120 

EANMRKRRKE ATSILEENQA LCTITSFPRL GCPGFTLPEV KPNPVEGGAS KSLFFPDEAI 180 

NKHPRFSTLT RNIRHRRGEK WINVPIFKD KNTPSPFIET FTEDDEASRA SKPDHIYMDA 240 

MGFGMGNCCL QVTFQACSIS EARYLYDQLA TICPIVMALS AASPFYRGYV SDIDCRWGVI 300 

SASVDDRTRE ERGLEPLKNN NYRISKSRYD SIDSYLSKCG EKYNDIDLTI DKEIYEQLLQ 360 

EGIDHLLAQH VAHLFIRDPL TLFEEKIHLD DANESDHFEN IQSTNWQTMR FKPPPPNSDI 420 

GWRVEFRPME VQLTDFENSA YWFWLLTR VILSYKLDFL IPLSKVDENM KVAQKRDAVI* 480 

QGMFYFRKDI CKGGNAWDG CGKAQNSTEL AAEEYTLMSI DTIINGKEGV FPGLIPILNS 540 

YLENMEVDVD TRCSILNYLK LIKKRASGEL MTVARWMREF IANHPDYKQD SVITDEMNYS 600 
LILKCNQIAN ELCECPELLG SAFRKVKYSG SKTDSSN 
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Seq ID NO: 295 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 247-816 



PCT/US02/12476 



AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
ATGACAGGAC 
TTCAGCAAAG 
ACCAGCAGTT 
CAACGAGAAA 
CAAAAATATG 
AAGCGATTTT 
AAGCACCTTA 
CACACCCCAA 
TTCTACAATG 
CTTCCAGAGG 
TTGAAAGGAT 
TAACCCATTA 



11 
I 

TGGGGCAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
ATGCTATTCC 
ATAGGATGAT 
TCTCTGGAGA 
TTAATGCTGA 
AAAAAATCTT 
TTGAATCCAT 
AGAAGAAACT 
ATGCATAATC 
GAGCAGGATA 
CTAAGAAATT 
AACTTGTGTT 
GGTAAATACT 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
ACCCAGCCAA 
GCAGAAACCT 
TGACCTAGAA 
TATAAAACGT 
CGAAATGCTT 
CATCAAGGAA 
GAAACGTATG 
TCGTTAATGA 
TTGCTGAAGT 
TCTGTTAGTA 
TTGGTTATTT 
ATTACAGTCG 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TTGGATTCTC 
GGTAGCAATG 
TGCAGAGAAA 
AAATTAGTGA 
GAAGGAGTGC 
GCAGCAAGAT 
ATTTGAGAAT 
TTGAGGAGAG 
CTCCTGGCAT 
AAAGATGTTC 
TGTATTCCCA 
TGGTTTCTGC 



41 
I 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGA 
AGATTGATGA 
CACCTGTGGG 
CAGCCTCCTC 
AGGAACTCCG 
AAGGACCTAC 
GTATGAGACG 
ACTTGTCCCT 
AAAAGGATCA 
ATGTTACCGA 
TTTTTCCCAA 
CCTGTGCTGG 
A 



51 
I 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAAGCTT 
CTTCACTGGT 
AGGAAACGTT 
TCCCAAAAGC 
ATGCGTTGGA 
TGCAGTCAGG 
AGACTTTGTT 
GGAGGATTAT 
GATTGCTGTT 
ATCAAATAGC 
AGCATTTTAT 
TAGATATTAT 



Seq ID NO: 296 Protein sequence: 
Protein Accession ft: Eos sequence 

30 1 11 21 31 41 51 

I I I I I I ' 

MTDKTEKVAV DPETVPKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 
GHAIPPSQLD SQIDDFTGPS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQR 
EINADIKRKL VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 
35 LKKKLKRMI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 



40 



Seq ID NO: 297 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 247-815 



45 
50 
55 
60 
65 
70 



i 
I 

AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
ATGACAGGAC 
TTCAGCAAAG 
ACCAGCAGTT 
CAACAAGAAA 
CAAAAATATG 
AAACGATTTT 
AAGCACCTTA 
CACACCCCAA 
TTCTACAATG 
CTTCCAGAGG 
TTGAAAGGAT 
TAACCCATTA 



11 
I 

TGGGGCAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
ATGCTATTCC 
ATAGGATGAT 
TCTCTGGAGA 
TTAATGCTGA 
AAAAAATCTT 
TTGAATCCAT 
AGAAGAAACT 
ATGCATAATC 
GAGCAGGATA 
CTAAGAAATT 
AACTTGTGTT 
GGTAAATACT 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
ACCCAGCCAA 
GCAGAAACCT 
TGACCTAGAA 
TATAAAACGT 
CGAAATGCTT 
CATCAAGGAA 
GAAACGTATG 
TCATTAATGA 
TTGCTGAAGT 
TCTGTTAGTA 
TTGGTTATTT 
ATTACAGTCG 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TTGGATTCTC 
GGTAGCAATG 
TGCAGAGAAA 
AAATTAGTGA 
GAAGGAGTGC 
GCAGCAAGAT 
ATTTGAGAAT 
TTGAGGAGAG 
CTCCTGGCAT 
AAAGATGTTC 
TGTATTCCCA 
TGGTTTCTGC 



41 
I 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGA 
AGATTGATGA 
CACCTGTGGG 
CAGCCTCCTC 
AGGAACTCCG 
AAGGACCTAC 
GTATGAGACG 
ACTTGTCCCT 
AAAAGGATCA 
ATGTTACCGA 
TTTTTCCCAA 
CCTGTGCTGG 
A 



51 
I 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAAGCTT 
CTTCACTGGT 
AGGAAACGTT 
TCCCAAAAGC 
ATGCGTTGGA 
TGCAGTCAGG 
AGACTTTGTT 
GGAGGATTAT 
GATTGCTGTT 
ATCAACTGGC 
AGCGTTTTAT 
TAGATATTAT 



Seq ID NO: 298 Protein sequence: 
Protein Accession ft: Eos sequence 



11 



21 



31 
I 



41 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 60 

GHAIPPSQLD SQIDDFTGPS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQQ 120 

EINADIKRKL VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 180 
LKKKLKRMI 



75 



80 



85 



Seq ID NO: 299 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 247-815 



I 

AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 



11 
I 

TGGGGCAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
TGCTATTCCA 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 



41 
I 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGC 
GATTGATGAC 



51 

I 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAGCTTA 
TTCACTGGTT 



60 
120 
180 
240 
300 
360 
420 
480 
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WO 02/086443 

TCAGCAAAGA TAGGATGATG CAGAAACCTG GTAGCAATGC ACCTGTGGGA GGAAACGTTA 
CCAGCAGTTT CTCTGGAGAT GACCTAGAAT GCAGAGAAAC AGCCTCCTCT CCCAAAAGCC 
AACAAGAAAT TAATGCTGAT ATAAAACGTA AATTAGTGAA GGAACTCCGA TGCGTTGGAC 
AAAAATATGA AAAAATCTTC GAAATGCTTG AAGGAGTGCA AGGACCTACT GCAGTCAGGA 
AACGATTTTT TGAATCCATC ATCAAGGAAG CAGCAAGATG TATGAGACGA GACTTTGTTA 
AGCACCTTAA GAAGAAACTG AAACGTATGA TTTGAGAATA CTTGTCCCTG GAGGATTATC 
ACACCCCAAA TGCATAATCT CATTAATGAT TGAGGAGAGA AAAGGATCAG ATTGCTGTTT 
TCTACAATGG AGCAGGATAT TGCTGAAGTC TCCTGGCATA TGTTACCGAA TCAACTGGCC 
TTCCAGAGGC TAAGAAATTT CTGTTAGTAA AAGATGTTCT TTTTCCCAAA GCGTTTTATT 
TGAAAGGATA ACTTGTGTTT TGGTTATTTT GTATTCCCAC CTGTGCTGGT AGATATTATT 
AACCCATTAG GTAAATACTA TTACAGTCGT GGTTTCTGCA 



Seq ID NO: 300 Protein sequence: 
Protein Accession #: Eos sequence 



41 



51 



1 11 21 31 

I I I I I i 

MTDKTEKVAV DPETVFKRPR ECDSP5YQKR QRMALLARKQ GAGDSLIAGS AMSKAKKLMT 
GHAIPPSQLD SQIDDFTGFS KDSMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQQ 
EINADIKRKL VKEIiRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 
LKKKLKRMI 

Seq ID NO: 301 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 247-812 



AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 
TCAGCAAAGA 
CCAGCAATTT 
AACAAGAAAT 
AATATGAAAA 
GATTTTTTGA 
ACCTTAAGAA 
CCCCAAATGC 
ACAATGGAGC 
CAGAGGCTAA 
AAGGATAACT 
CCATTAGGTA 



11 
I 

TGGGGCAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
TGCTATTCCA 
TGGGATGATG 
CTCTGGAGAT 
TAATGCTGAT 
AATCTTCGAA 
ATCCATCATC 
GAAACTGAAA 
ATAATCTCAT 
AGGATATTGC 
GAAATTTCTG 
TGTGTTTTGG 
AATACTATTA 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 
CAGAAACCTG 
GACCTAGAAT 
ATAAAATGTC 
ATGCTTGAAG 
AAGGAAGCAG 
CGTATGATTT 
TAATGATTGA 
TGAAGTCTCC 
TTAGTAAAAG 
TTATTTTGTA 
CAGTCGTGGT 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 
GTAGCAATGC 
GCAGAGGAAT 
AAGTAGTGAA 
GAGTGCAAGG 
CAAGATGTAT 
GAGAATACTT 
GGAGAGAAAA 
TGGCATATGT 
ATGTTCTTTT 
TTCCCACCTG 
TTCTGCA 



41 
I 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGA 
GATTGATGAC 
ACCTGTGGGA 
AGCCTCCTCT 
GGAAATCCGA 
ACCTACTGCA 
GAGACGAGAC 
GTCCCTGGAG 
GGATCAGATT 
TACCGAATCA 
TCCCAAAGCG 
TGCTGGTAGA 



51 
I 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAGCTTA 
TTCACTGGTT 
GGAAATGTTA 
CCCAAAAGCC 
TGCCTTGGAC 
GTCAGGAAAC 
TTTGTTAAGC 
GATTATCACA 
GCTGTTTTCT 
ACTGGCCTTC 
TTTTATTTGA 
TATTATTAAC 



Seq ID NO: 302 Protein sequence: 
Protein Accession #: Eos sequence 



11 



21 



31 



41 



51 



MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 
GHAIPPSQLD SQIDDFTGFS KDGMMQKPGS NAPVGGNVTS NFSGDDLECR GIASSPKSQQ 
EINADIKCQV VKEIRCLGQY EKIFEMLEGV QGPTAVRKRF FESIIKEAAR CMRRDFVKHL 
KKKLKRMI 

Seq ID NO: 303 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 247-815 



AGTGTTCGGC 
GGCCAAACAG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 
TCAGCAAAGA 
CCAGCAGTTT 
AACAAGAAAT 
AAAAATATGA 
AACGATTTTT 
AGCACCTTAA 
ACACCCCAAA 
TCTACAATGG 
TTCCAGAGGC 
TGAAAGGATA 
AACCCATTAG 



11 
I 

TGGGACAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
TGCTATTCCA 
TAGGATGATG 
CTCTGGAGAT 
TAATGCTGAT 
AAAAATCTTC 
TGAATCCATC 
GAAGAAACTG 
TGCATAATCT 
AGCAGGATAT 
TAAGAAATTT 
ACTTGTGTTT 
GTAAATACTA 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 
CAGAAACCTG 
GACCTAGAAT 
ATAAAACGTA 
GAAATGCTTG 
ATCAAGGAAG 
AAACGTATGA 
CGTTAATGAT 
TGCTGAAGTC 
CTGTTAGTAA 
TGGTTATTTT 
TTACAGTCGT 



31 
I 

TGGCTACTTC 
CGTCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 
GTAGCAATGC 
GCAGAGAAAC 
AATTAGTGAA 
AAGGAGTGCA 
CAGCAAGATG 
TTTGAGAATA 
TGAGGAGAGA 
TCCTGGCATA 
AAGATGTTCT 
GTATTCCCAC 
GGTTTCTGCA 



41 



51 



CCTTCCTTCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGC 
GATTGATGAC 
ACCTGTGGGA 
AGCCTCCTCT 
GGAACTCCGA 
AGGACCTACT 
TATGAGACGA 
CTTGTCCCTG 
AAAGGATCAG 
TGTTACCGAA 
TTTTCCCAAA 
CTGTGCTGGT 



ATCCCCCTTG 
TCCCAGATGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAGCTTA 
TTCACTGGTT 
GGAAACGTTA 
CCCAAAAGCC 
TGCGTTGGAC 
GCAGTCAGGA 
GACTTTGTTA 
GAGGATTATC 
ATTGCTGTTT 
TCAACTGGCC 
GCGTTTTATT 
AGATATTATT 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



PCT/US02/12476 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO: 304 Protein sequence: 
Protein Accession ft: Eos sequence 



298 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

1 11 21 31 41 51 

I I I I I I 

MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKAKKLMT 
GHAIPPSQLD SQIDDPTGFS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQQ 
EINADIKRKL VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 
LKKKLKRMI 

Seq ID NO: 305 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 87-689 



PCT/US02/12476 



60 
120 
160 



CGTGGAGGCA 
CCAGACTAGC 
AGATGTCCGC 
CAGAGGTCCC 
TGTCCGGGAA 
ATCGGGAAAT 
CTCCCAAAAG 
AATCCACAAA 
ATAATTTAAA 
AGTATGAGAA 
CTGCTAAAGT 
AGGAGGAGGA 
TTAGAGTAGG 
ATTAGGTTTA 
AATTGTCAGT 
AAGTTGTACA 
CTGTGCACTT 
ATTTGTAAGG 
TATCTATAGT 
GCGTTGAGGC 
GAGGCTGGAC 
GTATATAGTG 
CATGAGAATA 
TATAGAACTC 
CTCCTGTACT 
GAAATGTTTT 
AGTCAATTTC 
CTCCCTATAA 
TGAAGGAGAG 
ATGAAGTCTG 
AGGAAGGTGG 
CCTATTTTGT 
AAATTAAGGC 
ACATTATTTG 
AGTGAGGGTA 
TTGGAAACAC 
AGGGCAGGCT 
GCCTGCTCAT 
GAAGGAGCTT 
GTTGGGGTGA 
CTGATGTGTA 
GTGAGTGTTG 
CGCAGGAGTG 
GTTGAGAAAC 
TACGAGTTAT 
CATCAGAACT 
AAAACTCGGT 
GCCATGTCCT 
CTAGGCCAAG 
GGTCAAAAGG 
CAGGGAAGGG 
GTGGGGGAGC 
TGTCAAGTTG 
CACACTGTGG 
AAGTGGCATG 
TTCTTAAGAT 
CTACTCCCTC 
AGGCTGTGAT 
GTATTTGGGG 



11 
I 

GCTAGCGCGA 
GAACAATACA 
TTATGCCTTC 
TGTCAATTTT 
AGAGAAATCT 
GAAGGATTAT 
GCCACCGTCT 
CCCCGGCATC 
TGACAGTGAA 
GGATGTTGCT 
TGCCCGGAAA 
GGAGGAGGAG 
GGAGCGCCGT 
ATTACAAAAT 
GGTTTACATG 
TATTTCCAAA 
TGCTGTTGGT 
TGGTGGTAAC 
TTGTAAAAAG 
TGTGGGG AAG 
CTGTTGACTC 
ACATAGCATT 

TTCATTGTCA 
TAAACACGAT 
TGAAGTTAAA 
TGACTCACAG 
ATGTGGTAGC 
GGCTACTTGA 
GAGGAGTTAG 
GTGATTAGGA 
GGGGCCAAAT 
CTTATTGTTT 
TGGTGCCCAA 
TGTGGGATGG 
CAAACACCCC 
AATGGAATCA 
AAGTTTAGCT 
GGTTTGTGTG 
GGGGAGATGG 
TATACATCAT 
CTATTGCCCA 
TTTTTGTGCT 
TTGCATGTCT 
GGTCACGGTC 
GTTTGTCCTG 
TGTGAGGTTT 
TGTCACTTGG 
ATTCGGGAGC 
GAGTGATTTG 
CCAAGGATGG 
AGTTTAGCCA 
AGGCCACTTG 
CAAGATTGCT 
ATGTTACCTA 
GCCAACCTGT 
TAACCACCTC 
GTAGTGACTA 
ACGTTGGATG 



21 

I 

GGCTGGGGAG 
GTCAGGATGG 
TTTGTGCAGA 
GCGGAATTTT 
AAATTTGATG 
GGACCAGCTA 
GGATTCTTCC 
TCTATTGGAG 
AAGCAGCCTT 
GACTATAAGT 
AAGGTGGAAG 
GATGAATAAA 
AATTGACACA 
TTGATCACGA 
AAGTGGCCAT 
CATTTTTAAA 
GTGACAAGGC 
TATGGTTATT 
AACAAAACAA 
ATGCCTTTTG 
TGCAGGGGGC 
CTGCTGCC^T 
TAAGTGCGGT 
GCAAAGCAAA 
TCGCAACGTT 
TAAACAGTAT 
CAGTGAACAA 
TTCTTTTATT 
AGCTACTGTG 
GAGAACGACA 
CTGAGGCTAT 
GCATTGCTAA 
TTCTCTTTCA 
CATTTGGGGT 



AAGGAAGATG 
ACCATTTCTG 
CATTCACTGG 
TCAGTGGTTA 
CCACAGTAGC 
TACTGTCCGT 
GCATTAATAT 
ATTAATTTTA 
GGAGGCGGTG 
ACAGCCTGAT 
AATGTGTTCC 
GCCCAGAGGC 
CATTCTAAGC 
TGTTGCCAGC 
TTAAGTGGTG 
AAAGGGGTAA 
GATGATCTTT 
GTCCATTAGC 
CTTCTAGTGG 
AGGCTTAGGC 
TGCTTTTTTT 
ACCCCATTCT 
TTGTCTGTGT 
CATTCATTTT 



31 

I 

CGCTGAGCCG 
CTAAAGGTGA 
CATGCAGAGA 
CCAAGAAGTG 
AAATGGCAAA 
AGGGAGGCAA 
TGTTCTGTTC 
ACGTGGCAAA 
ACATCACTAA 
CGAAAGGAAA 
AGGAAGATGA 
GAAACTGTTT 
TCTCTTATTT 
TCATATTGTA 
GGGTGTCTGG 
ATGAAAAGGC 
ATTTAAAGAT 
GGCTAGAAAT 
CCGAGACAAA 
GGAGAGGCTG 
ATCCATTTAG 
CTTAGCTGTG 
AGTTTTTAAA 
GAGTCACTGC 
CTGTTATTTT 
TACATTTTTA 
ACCCCCACTC 
ACTCAGTGGC 
TGATTTTGTT 
TAGGCAAGGT 
CTAGGTTTAA 
ACAGCAATTT 
CCCCTACCCC 
CTTGAGCCTG 
TAGGGGACGG 
ATAGGCTCCA 
AGCACTAAAT 
AAATGTAGAT 
TATTAGTGGG 
AAGTGGTGAC 
AGCAATGAAG 
TTGGGTGTGT 
AGAGAAAGCA 
TCCTCTCCGC 
CTCTTATGTG 
TCTAGTTCTA 
ACTTGTTCCA 
TAAAGCTTTA 
CTCGTCAAAT 
CGCGTCTATC 
CTTTTGTGCT 
GATTAGGCAA 
TGGGGCAGCA 
AATAATGCCC 
TTAGCTTGAT 
TTTTTTTTCC 
TGAATGACAT 
CTCCTGTGTG 
CTGTAATAAA 



41 
I 

CGCGTCGTGC 
CCCCAAGAAA 
AGAACATAAG 
CTCTGAGAGG 
GGCAGATAAA 
GAAGAAGAAG 
AGAATTCCGC 
AAAGCTGGGT 
GGCGGCAAAG 
GTTTGATGGT 
AGAAGAGGAG 
ATCTGTCTCC 
GAGAAGTGTC 
GTCTCTCAAA 
AGCACCCTGA 
ACTCTCGTGT 
GTTTCTGGCA 
CCTGAGTTTT 
CCCTTGATGC 
TAGCTCAGGG 
CTTCAGGTTG 
GACAAAGGGG 
CTGTTTGTTT 
ATCAATGAAA 
TTTTGTATGT 
AAACTCTTCT 
CATTGTATTT 
CAGCTCACTT 
TGTGTCTGAG 
TCAGCAGCCT 
CTTTTGTCCC 
CAGAGTGTAT 
CCGTGCTCCT 
CTGCTGGTCT 
TATCCTTTTT 
TCTTGGGCCA 
GTATCATGAA 
TGATGTTCAA 
TAGTGTAACA 
ACTAAATACC 
GATACAGTAC 
ATGTTTGAGG 
GCTTTTTCTT 
CCTGTCGGGT 
TTCATAGCCA 
GAAAATGACC 
GAATTTCCCC 
GCTTCCCAAT 
ATGGAAGAGA 
TCATAACTAG 
TCCAAAGTAG 
ACATTGAGTT 
AGATCACTAC 
TAGTTTCTCT 
TTCTGGGCCC 
CCCATTTAAA 
TTTATCCTTC 



51 

I 

CCTGCGCTGC 
CCAAAGGGCA 
AAGAAAAACC 
TGGAAGACGA 
GTGCGCTATG 
GATCCTAATG 
CCCAAGATCA 
GAGATGTGGA 
CTGAAGGAGA 
GCAAAGGGTC 
GAGGAAGAAG 
TTGTGAATAC 
TGTTGCCCTC 
GTGCTCTAGA 
AACTGTATCA 
TCTCCTCACT 
TTTTCTTTTT 
CAACTGTATA 
TCCTTGCTCG 
CGTGCACTGT 
TCTTGTTTCT 
GGTCAGCTGG 
TTAAACAAAC 
GTTCAAGAAC 
TTAGAATGCT 
CTATTATAAC 
GGAGACTGGC 
AGGGCTGAGA 
TGGCATTCAG 
TCCAAGGTAT 
ACCTCCACCC 
GGTGTGTCAA 
GGCACATATC 
CCTGGATGCC 
TTGCTCCTAC 
CCTGAGCTAT 
AAGTTGAATG 
TGTTAAACTG 
TTTTATCCAG 
ATTTTGAAGG 
TGTGTTGTGG 
CTATGAAACA 
AAAATTCACT 
CCTGGATGAG 
TTCGCTCTCC 
ACTAATTTAA 
TCCTGCTTCA 
TCGTGATGTG 
AACAACCTGC 
ATGTACCAAC 
CTAAGCAGAA 
TTAAAGAGGC 
TCAACGTTTT 
GAGATGATGT 
ACTGTCTGTG 
AGGATAGTAC 
GGAAAGAACA 
TGTCACAAAT 



Seq ID NO: 306 Protein sequence: 
Protein Accession ft: NP_005333.1 

21 



51 



1 11 21 31 41 

I I I 1 I I 

MAKGDPKKPK GKMSAYAFFV OTCREEHKKK NPEVPVNFAE FSKKCSERWK TMSGKEKSKP 

DEMAKADKVR YDREMKDYGP AKGGKKKKDP NAPKRPPSGF FLFCSEFRPK IKSTNPGISI 

GDVAKKLGEM WNNLNDSEKQ PYITKAAKLK EKYEKDVADY KSKGKFDGAK GPAKVARKKV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
25B0 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



60 
120 
180 



Seq ID NO: 307 DNA sequence 
Nucleic Acid Accession ft: NM_022342 
Coding sequence: 1..2178 



299 



WO 02/086443 



1 11 21 31 41 51 

I I I I I I 

ATGGGTACTA GGAAAAAAGT TCATGCATTT GTCCGTGTCA AACCCACCGA TGACTTTGCT 60 

CATGAAATGA TCAGATACGG AGATGACAAA AGAAGCATTG ATATTCACTT AAAAAAAGAC 120 

ATTCGGAGAG GAGTTGTCAA TAACCAACAG ACAGACTGGT CGTTTAAGTT GGATGGAGTT 180 

TTCACGATG CCTCCCAGGA CTTGGTTTAT GAGACAGTTG CAAAGGATGT GGTTTCTCAG 240 

CCCTCGATG GCTATAATGG CACCATCATG TGTTATGGGC AGACGGGAGC TGGCAAGACA 300 

ACACCATGA TGGGGGCAAC TGAGAATTAC AAGCACCGGG GGATCCTCCC TCGTGCCCTG 360 

AGCAGGTTT TTAGGATGAT OGAAGAAOGC CCCACACATG CCATCACTGT GCGTGTTTCC 420 

ACTTGGAAA TCTATAATGA GAGCCTGTTT GATCTCCTGT CCACTCTGCC CTATGTTGGA 480 

CCTCAGTCA CACCAATGAC CATCGTGGAA AACCCTCAAG GAGTCTTCAT TAAGGGCTTG 540 

CAGTTCACC TCACAAGTCA GGAGGAGGAT GCATTCAGCC TCCTTTTTGA GGGTGAGACC 600 

ACAGGATTA TAGCCTCCCA CACTATGAAC AAAAACTCTT CCAGATCACA CTGCATTTTC 660 

CCATCTACT TAGAGGCCCA TTCCCGGACC TTATCAGAGG AAAAGTACAT CACTTCCAAA 720 

TTAACTTGG TGGATCTGGC AGGCTCAGAG AGGCTGGGGA AGTCTGGGTC TGAGGGCCAA 780 

TCCTGAAGG AAGCCACCTA CATCAACAAA TCGCTCTCAT TCCTGGAGCA GGCCATCATT 840 

CCCTTGGGG ACCAGAAGCG GGACCACATC CCCTTTCGGC AGTGCAAGCT CACCCACGCT 900 

TGAAGGACT CGTTAGGGGG AAACTGCAAT ATGGTCCTCG TGACAAACAT CTATGGAGAA 960 

CTGCCCAGT TAGAAGAAAC GCTATCTTGA CTGAGATTTG CCAGCAGGAT GAAGCTAGTC 1020 

CCACTGAGC CTGCCATCAA TGAAAAGTAT GATGCTGAGA GAATGGTCAA GAACCTGGAG 1080 

AGGAACTAG CACTACTCAA GCAGGAGCTG GCTATCCATG ACAGCCTGAC CAACCGCACC 1140 

TTGTGACCT ATGACCCCAT GGATGAAATC CAGATTGCTG AGATCAACTC CCAGGTGCGG 1200 

GGTACCTGG AGGGGACACT GGACGAGATC GACATAATCA GCCTTAGACA GATCA AGGA G 1260 

TGTTCAACC AGTTCCGGGT GGTTCTGAGC CAACAGGAAC AGGAAGTGGA GTCCACTTTG 1320 

GCAGGAAGT ACACCCTCAT TGACAGGAAT GACTTTGCAG CCATTTCTGC TATCCAGAAG 1380 

CGGGGCTTG TGGATGTTGA TGGCCACCTA GTGGGTGAGC CTGAAGGACA AAACTTTGGA 1440 

TCGGAGTOG CCCCTTTCTC TACCAAACCT GGGAAGAAAG CCAAGTCCAA GAAGACATTC 1500 

AAGAGCCAC TCAGGCCCGA CACCCCACCC TCCAAACCAG TGGCCTTTGA GGAGTTTAAG 1560 

ATGAGCAAG GTAGTGAGAT CAACCGAATT TTCAAAGAAA ACAAATCCAT CTTGAATGAA 1620 

GGAGGAAAA GGGCCAGCGA GACCACACAG CACATCAATG CCATCAAGCG GGAGATTGAT 1680 

TGACCAAGG AGGCCCTGAA TTTCCAGAAG TCACTACGGG AGAAGCAAGG CAAGTACGAA 1740 

ACAAGGGGC TGATGATCAT CGATGAGGAA GAATTCCTGC TGATCCTCAA GCTCAAAGAC 1800 

TCAAGAAGC AGTACCGCAG CGAGTACCAG GACCTGCGTG ACCTCAGGGC TGAGATCCAG 1860 

ATTGCCAGC ACCTAGTGGA TCAGTGTCGC CACCGCCTGC TCATGGAATT TGACATCTGG 1920 

ACAATGAGT CCTTTGTCAT CCCTGAGGAC ATGCAGATGG CACTGAAGCC AGGCGGCAGC 1980 

TCCGGCCAG GCATGGTCCC TGTGAACAGG ATTGTGTCTC TGGGAGAAGA TGACCAGGAC 2040 

AATTCAGCC AGCTGCAGCA GAGGGTGCTT CCTGAGGGCC CTGATTCCAT CTCCTTCTAC 2100 

ATGCCAAAG TCAAGATAGA GCAGAAGCAT AATTACTTGA AAACCATGAT GGGCCTCCAG 2160 
AGGCACATA GAAAATAG 



Seq ID NO: 308 Protein sequence: 
Protein Accession fh NP_071737 



1 11 21 31 41 51 

I I I I I 1 

MGTRKKVHAF VRVKPTDDFA HEMIRYGDDK RSIDIHIiKKD IRRGWNNQQ TDWSFKLDGV 60 

LHDASQDIjVY ETVAKDWSQ ALDGYNGTIM CYGOTGAGKT YTMMGATENY KHRGILPRAL 120 

QQVFRMIEBR PTHAITVRVS YLEIYNESLF DLLSTLPYVG PSVTPMTIVE NPQGVFIKGL 180 

SVHLTSQEED AFSLLFEGET NRIIASHTMN KNSSRSHCIF TIYLEAHSRT LSEEKYITSK 240 

INLVDLAGSE RLGKSGSEGQ VLKEATYINK SLSFLEQAII ALGDQKRDHI PFRQCKLTHA 300 

LKDSLGGNCN MVLVTNIYGE AAQLEETLSS LRFASRMKLV TTEPAINEKY DAERMVKNLE 360 

KELiALLKQEL. AIHDSLTNRT FVTYDPMDEI QIAEINSQVR RYLEGTLDEI DIISLRQIKE 420 

VFNQFRWLS QQEQEVESTL RRKYTLIDRN DFAAISAIQK AGLVDVDGHL VGEPEGQNFG 480 

LGVAPFSTKP GKKAKSKKTF KEPLRPDTPP SKPVAFEEFK NEQGSEINRI FKENKSILNE 540 

RRKRASETTQ HINAIKREID VTKEALNFQK SLREKQGKYE NKGLMI IDEE EFLLILKLKD 600 

LKKQYRSEYQ DERDLRAEIQ YCQHLVDQCR HRLLMEFDIW YNESFVIPED MQMALKPGGS 660 

IRPGMVPVNR IVSLGEDDQD KFSQLQQRVL PEGPDSISFY NAKVKIEQKH NYLKTMMGLQ 720 
QAHRK 



Seq ID NO: 309 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 



1 11 21 31 41 51 

I I I ! I I 

TTTTTTTTTT TTTTTTTTAA TGCCTGCTGT CATGCTCTGT CTACCAGGGT GAATTTCCAA 60 

AAATTTCTGC ATAGCAATTT TAGCCAAAAC TATATATGTT CTGGGGAGGA TAGGCATAGG 120 

CACATTGAAG ACCAAAGGAA AGAGTGAAGA AGTGTAGTTG GGTCATTGTG AATGGATGTT 180 

TAGATTGTCA AGAAAAGTGG GCCAGAGGCC CCACCTCACA CTAGGAGGGC AATTGCCTCT 240 

CATTAGTATC TCAGGCACCA TGGGTCTTAT TTGGTGTCAT AAGAAACACC CTCAACAAAG 300 

TAATGAACCC TCAGCCTCCA GCTTCTCTTC TTCGGGATTC TTCTTAGGGC CTCCTTTTTC 360 

CTTTTATGTT TCCAGTACCC TGAATTTCTT ATTCCCATCC CCCATTAAAA TCTGCTTCAA 420 

AGAAAAAACA AGAAGGACAC ATTCACTTTA AGATCCAAAT GAATGATAAG AGCTTAAAAC 480 
ATTATACTTA TCAGTATTAT TTGCATTTTT ATAGAAACCA AAACCATATT TCAACAAC 



Seq ID NO: 310 DNA sequence 

Nucleic Acid Accession #: NM_018622.2 

Coding sequence: 1-1140 



1 11 21 31 41 51 

I I I I I I 

ATGGCGTGGC GAGGCTGGGC GCAGAGAGGC TGGGGCTGCG GCCAGGCGTG GGGTGCGTCG 60 

GTGGGCGGCC GCAGCTGCGA GGAGCTCACT GCGGTCCTAA CCCCGCCGCA GCTCCTCGGA 120 

CGCAGGTTTA ACTTCTTTAT TCAACAAAAA TGCGGATTCA GAAAAGCACC CAGGAAGGTT 180 

GAACCTCGAA GATCAGACCC AGGGACAAGT GGTGAAGCAT ACAAGAGAAG TGCTTTGATT 240 

CCTCCTGTGG AAGAAACAGT CTTTTATCCT TCTCCCTATC CTATAAGGAG TCTCATAAAA 300 

CCTTTATTTT TTACTGTTGG GTTTACAGGC TGTGCATTTG GATCAGCTGC TATTTGGCAA 360 



300 



10 



15 



WO 02/086443 

TATGAATCAC TGAAATCCAG GGTCCAGAGT TATTTTGATG GTATAAAAGC TGATTGGTTG 420 

GATAGCATAA GACCACAAAA AGAAGGAGAC TTCAGAAAGG AGATTAACAA GTGGTGGAAT 480 

AACCTAAGTG ATGGCCAGCG GACTGTGACA GGTATTATAG CTGCAAATGT CCTTGTATTC 540 

TGTTTATGGA GAGTACCTTC TCTGCAGCGG ACAATGATCA GATATTTCAC ATCGAATCCA 600 

GCCTCAAAGG TCCTTTGTTC TCCAATGTTG CTGTCAACAT TCAGTCACTT CTCCTTATTT 660 

CACATGGCAG CAAATATGTA TGTTTTGTGG AGCTTCTCTT CCAGCATAGT GAACATTCTG 720 

GGTCAAGAGC AGTTCATGGC AGTGTACCTA TCTGCAGGTG TTATTTCCAA TTTTGTCAGT 780 

TACCTGGGTA AAGTTGCCAC AGGAAGATAT GGACCATCAC TTGGTGCATC TGGTGCCATC 840 

ATGACAGTCC TCGCAGCTGT CTGCACTAAG ATCCCAGAAG GGAGGCTTGC CATTATTTTC 900 

CTTCCGATGT TCACGTTCAC AGCAGGGAAT GCCCTGAAAG CCATTATCGC CATGGATACA 960 

GCAGGAATGA TCCTGGGATG GAAATTTTTT GATCATGCGG CACATCTTGG GGGAGCTCTT 1020 

TTTGGAATAT GGTATGTTAC TTACGGTCAT GAACTGATTT GGAAGAACAG GGAGCCGCTA 1080 
GTGAAAATCT GGCATGAAAT AAGGACTAAT GGCCCCAAAA AAGGAGGTGG CTCTAAGTAA 



Seq ID NO: 311 Protein sequence: 
Protein Accession 8: NP 061092.2 



PCT/US02/12476 



20 



25 



MAWRGWAQRG 
EPRRSDPGTS 
YESLKSRVQS 
CLWRVPSLQR 
GQEQFMAVYL 
LPMPTFTAGN 
VKIWHEIRTN 



11 
I 

WGCGQAWGAS 
GEAYKRSAIil 
YFDGIKADWL 
TMIRYFTSNP 
SAGVISNFVS 
ALKAIIAMDT 
GPKKGGGSK 



21 
I 

VGGRSCEELT 
PPVEETVFYP 
DSIRPQKEGD 
ASKVLCSPML 
YLGKVATGRY 
AGMILGWKFP 



31 
I 

AVLTPPQLLG 
SPYPIRSLIK 
FRKBINKWWN 
LSTFSHFSLF 
GPSLGASGAI 
DHAAHLGGAL 



41 
I 

RRFNFFIQQK 
PLFFTVGFTG 
NLSDGQRTVT 
HMAANMYVLW 
MTVLAAVCTK 
FGIWYVTYGH 



51 

I 

CGFRKAPRKV 
CAFGSAAIWQ 
GIIAANVLVP 
SFSSSIVNIL 
IPEGRLAIIP 
ELIWKNREPL 



60 
120 
180 
240 
300 
360 



30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 312 DNA sequence 
Nucleic Acid Accession #: NM_000625 
Coding sequence: 195.. 3656 



CTCTCGGCCA 
CACAGGTCTC 
GCCCCACAGT 
GTAAAGCCAT 
ATGCAATGAA 
CCAGTCCAGT 
CCCCGCAGCC 
CAACCCCATT 
TCCAAGACAC 
TGGGGTCCAT 
CAGATGAGCT 
AGGCAAAAAT 
CAGTAACCTA 
ATGCCCCACG 
GCTGTTCCAC 
ACAATGGCAA 
ACTTCCGGGT 
GCATCAGAGG 
AGCCCAAGTA 
CTGAGCTCTT 
ACGAGTGGTT 
TGCTTGAGGT 
CAGAGATCGG 
GCAGGAGAAT 
TTGAGATCAA 
ACCACTCGGC 
GCTGCCCGGC 
TTCACCAGGA 
GGAAAACCCA 
AAGTCTTGGT 
GAGTCAGAGT 
ACCTGGGGGC 
GGCTGAGCTG 
GAGACTGCCC 
ACAACAAATT 
CCTTTGCTCA 
TGGGAGAAGG 
CCTTCAAGGC 
AGCTCTACAC 
AGCCTTTGGA 
GGCTCAAATC 
AACTCTCCTG 
GCCCAGGCAA 
CACCCCACCA 
ACAAGAGGCT 
CACCCCCAAC 
GACAGAGGCT 
GCCCCACATT 
TGCTTTCCCA 
ACACGCCCAC 
AGGGTCCCCT 
CAGTGCCCTG 



11 
I 

CCTTTGATGA 
TTCCTGGTTT 
GAAGAACATC 
AGAGATGGCC 
TGGGGAAAAA 
GACACAGGAT 
CCTCGTGGAG 
GTCCTCCCCA 
ACTTCACCAT 
TATGACTCCC 
TCTACCTCAA 
AGAGGAACAT 
CCAACTGACG 
CTGCATTGGG 
TGCCCGGGAA 
CATCAGGTCG 
GTGGAATGCT 
GGACCCTGCC 
CGGCCGCTTC 
CGAAATCCCA 
TCGGGAACTG 
GGGCGGCCTG 
AGTCCGGGAC 
GGGCCTGGAA 
CATTGCTGTG 
TGCAGAATCC 
AGACTGGATT 
GATGCTGAAC 
TGTCTGGCAG 
CAAAGCTGTG 
CACCATCCTC 
CTTATTCAGC 
CCTGGAGGAG 
TGGCAATGGA 
CAGGTACGCT 
TGACATTGAT 
GGATGAGCTC 
AGCCTGTGAG 
CTCCAATGTG 
CCTCAGCAAA 
TCGGCAGAAT 
TGAGGATGGC 
CCAGCCGGCC 
GGCAGTGCGC 
GCCCCCCTGC 
CCAGCTGCTG 
GGAGGCCCTG 
CCTGGAGGTG 
GCTCCCCATT 
GGAGATCCAC 
GCACCACGGC 
CTTTGTGCGG 



21 
I 

GGGGACTGGG 
GACTGTCCTT 
TGAGCTCAAA 
TGTCCTTGGA 
GGCATCAACA 
GACCTTCAGT 
ACGGGAAAGA 
CGGCATGTGA 
AAGGCCAAAG 
AAAAGTTTGA 
GCTATCGAAT 
CTGGCCAGGG 
GGAGATGAGC 
AGGATCCAGT 
ATGTTTGAAC 
GCCATCACCG 
CAGCTCATCC 
AACGTGGAAT 
GATGTGGTCC 
CCTGACCTTG 
GAGCTAAAGT 
GAGTTCCCAG 
TTCTGTGATG 
ACGCACAAGC 
CTCCATAGTT 
TTCATGAAGT 
TGGCTGGTCC 
TACGTCCTGT 
GACGAGAAGC 
CTCTTTGCCT 
TTTGCGACAG 
TGTGCCTTCA 
GAACGGCTGC 
GAGAAACTGA 
GTGTTTGGCC 
CAGAAGCTGT 
AGTGGGCAGG 
ACGTTTGATG 
ACCTGGGACC 
GCCCTCAGCA 
CTACAAAGTC 
CAAGGCCTGA 
CTGGTCCAAG 
CTGGAGGCCC 
TCACTCAGCC 
CTCCAAAAGC 
TGCCAGCCCT 
CTAGAGGAGT 
CTGAAGCCCA 
CTGACTGTGG 
GTCTGCAGCA 
AATGCCAGCG 



31 
I 

CAGTTCTAGA 
ACCCCGGGGA 
TCCAGATAAG 
AATTTCTGTT 
ACAATGTGGA 
ATCACAACCT 
AGTCTCCAGA 
GGATCAAAAA 
GGATTTTAAC 
CCAGAGGACC 
TTGTCAACCA 
TGGAAGCGGT 
TCATCTTCGC 
GGTCCAACCT 
ACATCTGCAG 
TGTTCCCCCA 
GCTATGCTGG 
TCACTCAGCT 
CCCTGGTCCT 
TGCTTGAGGT 
GGTACGCCCT 
GGTGCCCCTT 
TCCAGCGCTA 
TGGCCTCGCT 
TCCAGAAGCA 
ACATGCAGAA 
CTCCCATGTC 
CCCCTTTCTA 
GGAGACCCAA 
GTATGCTGAT 
AGACAGGAAA 
ACCCCAAGGT 
TGTTGGTGGT 
AGAAATCGCT 
TCGGCTCCAG 
CCCACCTGGG 
AGGACGCCTT 
TCCGAGGCAA 
CGCACCACTA 
GCATGCATGC 
CGACATCCAG 
ACTACCTGCC 
GCATCCTGGA 
TGGATGAGAG 
AGGCCCTCAC 
TGGCCCAGGT 
CAGAGTACAG 
TCCCGTCCCT 
GGTTCTACTC 
CCGTGGTCAC 
CATGGCTCAA 
GCTTCCACCT 



41 
I 

CAGTCCCGAA 
GGCAGTGCAG 
TGACATAAGT 
CAAGACCAAA 
GAAAGCCCCC 
CAGCAAGCAG 
ATCTCTGGTC 
CTGGGGCAGC 
TTGCAGGTCC 
CAGGGACAAG 
ATATTACGGC 
AACAAAGGAG 
CACCAAGCAG 
GCAGGTCTTC 
ACACGTGCGT 
GCGGAGTGAT 
CTACCAGATG 
GTGCATCGAC 
GCAGGCCAAT 
GGCCATGGAA 
GCCTGCAGTG 
CAATGGCTGG 
CAACATCCTG 
CTGGAAAGAC 
GAATGTGACC 
TGAATACCGG 
TGGGAGCATC 
CTACTATCAG 
GAGAAGAGAG 
GCGCAAGACA 
ATCAGAGGCG 
TGTCTGCATG 
GACCAGTACG 
CTTCATGCTG 
CATGTACCCT 
GGCCTCTCAG 
CCGCAGCTGG 
ACAGCACATT 
CAGGCTCGTG 
CAAGAACGTG 
CCGTGCCACC 
GGGGGAGCAC 
GCGAGTGGTG 
TGGCAGCTAC 
CTACTTCCTG 
GGCCACAGAA 
CAAGTGGAAG 
GCGGGTGTCT 
CATCAGCTCC 
CTACCACACC 
CAGCCTGAAG 
CCCCGAGGAT 



51 
I 

GTTCTCAAGG 
CCAGCTGCAA 
GACCTGCTTT 
TTCCACCAGT 
TGTGCCACCT 
CAGAATGAGT 
AAGCTGGATG 
GGGATGACTT 
AAATCTTGCC 
CCTACCCCTC 
TCCCTCAAAG 
ATAGAAACAA 
GCCTGGCGCA 
GATGCCCGCA 
TACTCCACCA 
GGCAAGCACG 
CCAGATGGCA 
CTGGGCTGGA 
GGCCGTGACC 
CATCCCAAAT 
GCCAACATGC 
TACATGGGCA 
GAGGAAGTGG 
CAGGCTGTOG 
ATCATGGACC 
TCCCGTGGGG 
ACCCCCGTGT 
GTAGAGGCCT 
ATTCCATTGA 
ATGGCGTCCC 
CTGGCCTGGG 
GATAAGTACA 
TTTGGCAATG 
AAAGAGCTCA 
CGGTTCTGCG 
CTCACCCCGA 
GCCGTGCAAA 
CAGATCCCCA 
CAGGACTCAC 
TTCACCATGA 
ATCCTGGTGG 
CTTGGGGTTT 
GATGGCCCCA 
TGGGTCAGTG 
GACATCACCA 
GAGCCTGAGA 
TTCACCAACA 
GCTGGCTTCC 
CCCCGGGATC 
CGAGATGGCC 
CCCCAAGACC 
CCCTCCCATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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1380 
1440 
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2040 
2100 
2160 
2220 
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2460 
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2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
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CTTGCATCCT CATCGGGCCT GGCACAGGCA 
GGCTCCATGA CTCCCAGCAC AAGGGAGTGC 
GCCGCCGCCC AGATGAGGAC CACATCTACC 
GGGTGCTGCA TGCGGTGCAC ACAGCCTATT 
TTCAGGACAT CCTGCGGCAG CAGCTGGCCA 
CAGGCCACCT CTATGTTTGC GGGGATGTGC 
AGCAGCTGGT GGCTGCCAAG CTGAAATTGA 
AGCTCAAGAG CCAGAAGCGC TATCACGAAG 
CGAAGAAGGA CAGGGTGGCG GTGCAGCCCA 
CTACAGGAGG GGTTAAAGCT GCCGGCACAG 
CTGAGGTCAC AGGGCCTGGG GAGATGGAGG 
TTCCTCAACG TTGCTCCCCA TCAAGCCCTT 
ATTGATCGGA GCCTC 

Seq ID NO: 313 Protein sequence: 
Protein Accession ff: NP 000616 



PCT/US02/12476 



TCGCGCCCTT 
GGGGAGGCCG 
AGGAGGAGAT 
CCCGCCTGCC 
GCGAGGTGCT 
GCATGGCCCG 
ATGAGGAGCA 
ATATCTTTGG 
GCAGCCTGGA 
AACTTAAGGA 
AAAGTGATAT 
TACTTGACCT 



CCGCAGTTTC 
CATGACCTTG 
GCTGGAGATG 
TGGCAAGCCC 
CCGTGTGCTC 
GGACGTGGCC 
GGTCGAGGAC 
TGCTGTATTT 
GATGTCAGCG 
TGGAGCCAGC 
CCCCCAGCCT 
CCTAACAAGT 



TGGCAGCAAC 
GTGTTTGGGT 
GCCCAGAAGG 
AAGGTCTATG 
CACAAGGAGC 
CACACCCTGA 
TATTTCTTTC 
CCTTACGAGG 
CTCTGAGGGC 
TCTGCATTAT 
CAAGTCTTAT 
AGCACCCTGG 



1 
I 

MACPWKFLFK 
VETGKKSPES 
TPKSIiTRGPR 
LTGDELIFAT 
RSAITVFPQR 
RFDWPLVLQ 
GLEFPGCPFN 
AVLHSFQKQN 
LNYVLSPFYY 
ILFATETGKS 
NGEKLKKSLF 
ELSGQEDAFR 
' SKALSSMHAK 
PALVQGILER 
LLLQKLAQVA 
PILKPRFYSI 
VRNASGFHLP 
EDHIYQEEML 
VCGDVRMARD 
VAVQPSSLEM 



11 
I 

TKFHQYAMNG 
LVKLDATPLS 
DKPTPPDELL 
KQAWRNAPRC 
SDGKHDFRVW 
ANGRDPELFE 
GWYMGTEIGV 
VTIMDHHSAA 
YQVEAWKTHV 
EAXiAWDLGAL 
MLKELNNKFR 
SWAVQTFKAA 
NVFTMRLKSR 
WDGPTPHQA 
TEEPERQRLE 
SSPRDHTPTE 
EDPSHPCILI 
EMAQKGVLHA 
VAHTLKQIiVA 
SAL 



21 
I 

EKGINNNVEK 
SPRHVRIKNW 
PQAIEFVNQY 
IGRIQWSNLQ 
NAQLIRYAGY 
IPPDLVLEVA 
RDFCDVQRYN 
ESFMKYMQNE 
WQDEKRRPKR 
FSCAFNPKW 
YAVFGLGSSM 
CETFDVRGKQ 
QNLQSPTSSR 
VRLEALDESG 
ALCQPSEYSK 
IHLTVAWTY 
GPGTGIAPFR 
VHTAYSRLPG 
AKLKLNEEQV 



31 
I 

APCATSSPVT 
GSGMTFQDTI, 
YGSLKEAKIE 
VFDARSCSTA 
QMPDGSIRGD 
MEHPKYEWFR 
ILEEVGRRMG 
YRSRGGCPAD 
RE1PLKVLVK 
CMDKYRLSCL 
YPRFCAFAHD 
HIQIPKLYTS 
ATILVELSCE 
SYWVSDKRLP 
WKFTNSPTFL 
HTRDGQGPLH 
SFWQQRLHDS 
KPKVYVQDIL 
EDYFFQLKSQ 



41 
I 

QDDLQYHNLS 
HHKAKGILTC 
EHLARVEAVT 
REMFEHICRH 
PANVEFTQLC 
ELELKWYALP 
LETHKLASLW 
WIWLVPPMSG 
AVLFACMLMR 
EEERLLLWT 
IDQKLSHLGA 
NVTWDPHHYR 
DGQGLNYLPG 
PCSLSQALTY 
EVLEEFPSLR 
HGVCSTWLNS 
QHKGVRGGRM 
RQQLASEVLR 
KRYKEDZ FGA 



51 

I 

KQQNESPQPIj 
RSKSCLGSIM 
KEIETTVTYQ 
VRYSTNNGNI 
IDLGWKPKYG 
AVANMLLEVG 
KDQAWEINI 
SiTPVFHQEM 
KTMASRVRVT 
STFGNGDCPG 
SQLTPMGEGD 
LVQDSQPLDL 
EHLGVCPGNQ 
FLDITTPPTQ 
VSAGFLLSQL 
LKPQDPVPCF 
TLVFGCRRPD 
VLHKEPGHLY 
VFPYEAKKDR 



Seq ID NO: 314 DNA sequence 
Nucleic Acid Accession #: XM_087254 
Coding sequence: 47.. 23 32 



1 

I 

AGAGTACGTG 
ATGTTCAATT 
AACACCAGAC 
CTTATCCCAT 
AATTAAAGAA 
CAATGTTCAA 
GTTGGAGTAC 
TGGTATTGTG 
GGAACGGTAC 
AATTGTTCAG 
TCTCCCTAAA 
TTTGAAAGGG 
GGAAATAGAT 
GGCAGCTGTT 
AGACAGACTA 
AGTATGGGTA 
CCATTTTCAT 
TGCTGAACAA 
GCTGGTAGTG 
GGAAGTTTGC 
AAAAGTAATA 
TGGTGCTAAT 
AGAAGGAAGA 
CAAATTGCTT 
TTTTTTTTAT 
GTTTTCTCAG 
TTCCCTACCT 
AAATAAGCCC 
TCTTTATTGG 
ACTAATAGGG 
TGGCACTTTG 
TCATTTTTGG 
ATTTTCCTTG 
TGTGTTTATT 
TACATGTCTA 
TACTGAAAAG 
CTGTTTCCCG 
AGGAAGATGT 
CAACGACAGG 



11 

I 

TTTACAGATA 
AATGGCATGA 
TCTTCAGAAG 
CTTACAACCA 
CATGATCTCT 
ACTGACTGCA 
TATGCATCTT 
TTTATTGGCA 
AAACTGCTTC 
GCACCTTCAG 
TGTATAGGTG 
CTAAGAACTC 
AAACGCATAT 
TTCCAGTTCA 
CAAGATAAAG 
CTTACTGGGG 
AGAACCATGA 
TTGAGGCAGC 
GATGGGACCA 
AGAAATTGTT 
AGACTAATAA 
GACGTAAGCA 
CAGGCTGCAA 
TTTGTTCATG 
AAGAATGTGT 
CAAACATTGT 
ATTCTGATAT 
ACCCTTTATC 
ACCATCCTGG 
AAAGATACAT 
GTCTTCACAG 
ACTTGGATCA 
TTTTATGGAG 
CAGCTCCTGT 
TTTCTTGATA 
GCACAGCTTA 
GAAGGAGAAG 
AGTCCAACCC 
AGCATCTTGA 



21 
I 

AAACTGGTAC 
AATACCAAGA 
GAAACTTATC 
GTTCCTCTTT 
TCTTTAAAGC 
CTGGTGATGG 
CACCAGATGA 
ATTCTGAAGA 
ATATTCTGGA 
GTGAGAAGTT 
GAGAAATAGA 
TGTGTATAGC 
TTGAAGCCAG 
TAGAGAAAGA 
TTCGAGAAAC 
ATAAACATGA 
ACATCCTTGA 
TTGCCAGAAG 
GCCTATCTCT 
CAGCTGTATT 
AAATATCACC 
TGATACAAGA 
GAAACAGTGA 
GTCATTTTTA 
GCTTTATCAC 
ATGACAGCGT 
ATAGTCTTTT 
GAGACATTAG 
GCTTCAGTCA 
CTCTGCTTGG 
TCATGGTTAT 
ACCATCTCGT 
GGATTCTCTG 
CAAGTGGTTC 
TCATAAAGAA 
CTGAAACAAA 
CAGCGTGTGC 
ACATCAGCAG 
CTCTCTCCAC 



31 

I 

ACTGACAGAA 
AATTAATGGT 
TTATCTTAGT 
CAGAACCAGT 
AGTCAGTCTC 
TCCCTGGCAA 
AAAGGCTCTA 
AACTATGGAG 
ATTTGATTCA 
ATTATTTGCT 
AAAAACCAGA 
ATATAGAAAA 
GACTGCCTTG 
CCTGATATTA 
TATTGAAGCA 
AACAGCTGTT 
ACTTATAAAC 
AATTACAGAG 
TGCACTCAGG 
ATGCTGTCGT 
TGAGAAACCT 
AGCCCATGTT 
CTATGCAATA 
TTATATTAGA 
ACCCCAGTTT 
GTACCTGACT 
GGAACAGCAT 
TAAAAACCGC 
TGCCTTTATT 
AAATGGCCAG 
TACAGTCACA 
TACCTGGGGA 
GCCATTTTTG 
TGCTTGGTTT 
GGTCTTTGAC 
TGCAGGTATC 
ATCTGTTGGA 
ATCATGGAGT 
AATGGACTCA 



41 

I 

AATGAGATGC 
AGACTTGTAC 
AGTTTATCCC 
CCTGAAAATG 
TGTCACACTG 
TCCAACCTGG 
GTAGAAGCTG 
GTTAAAACTC 
GATCGTAGGA 
AAAGGAGCTG 
ATTCATGTAG 
TTTACATCAA 
CAGCAGCGGG 
CTTGGAGCCA 
TTGAGAATGG 
AGTGTGAGTT 
CAGAAATCAG 
GATCATGTGA 
GAGCATGAAA 
ATGGCTCCAC 
ATAACATTGG 
GGCATAGGAA 
GCCAGATTTA 
ATAGCTACCC 
TTATATCAGT 
TTATACAATA 
GTAGACCCTC 
CTCTTAAGTA 
TTCTTTTTTG 
ATGTTTGGAA 
GTAAAGATGG 
TCTATTATAT 
GGCTCCCAGA 
GCCATAATCC 
CGACACCTCC 
AAGTGCTTGG 
AGAATGCTGG 
GCATCGGATC 
TCTACTTGTT 



51 

I 

AGTTTCGGGA 
CCGAAGGACC 
ATCTTAACAA 
AAACTGAACT 
TACAGATTAG 
CACCATCGCA 
CTGCAAGGAT 
TTGGAAAACT 
GAATGAGTGT 
AGTCATCAAT 
ATGAATTTGC 
AAGAGTATGA 
AAGAGAAATT 
CAGCAGTAGA 
CTGGTATCAA 
TATCATGTGG 
ACAGCGAGTG 
TTCAGCATGG 
AACTATTTAT 
TGCAGAAAGC 
CTGTTGGTGA 
TCATGGGTAA 
AGTTCCTCTC 
TTGTACAGTA 
TCTACTGTTT 
TTTGTTTTAC 
ATGTGTTACA 
TTAAAACATT 
GATCCTATTT 
ACTGGACATT 
CTCTGGAAAC 
TTTATTTTGT 
ATATGTATTT 
TCATGGTTGT 
ACCCTACAAG 
ACTCCATGTG 
AACGAGTTAT 
CTTTCTATAC 
AAAGGGGCAG 



3160 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



302 
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TAGTACTTTG TGGGAGCCAG TTCACCTCCT TTCCTAAAAT TCAGTGTGAT CACCCTGTTA 2400 

ATGGCCACAC TAGCTCTGAA ATTAATTTCC AAAATCTTTG TAGTAGTTCA TACCCACTCA 2460 

GAGTTATAAT GGCAAACAAA CAGAAAGCAT TAGTACAAGC CCCTCCCAAC ACCCTTAATT 2520 

TGAATCTGAA CATGTTAAAA TTTGAGAATA AAGAGACATT TTTCATCTCT TTGTCTGGTT 2580 

TGTCCCTTGT GCTTATGGGA CTCCTAATGG CATTTCAGTC TGTTGCTGAG GCCATTATAT 2640 

TTTAATATAA ATGTAGAAAA AAGAGAGAAA TCTTAGTAAA GAGTATTTTT TAGTATTAGC 2700 

TTGATTATTG ACTCTTCTAT TTAAATCTGC TTCTGTAAAT TATGCTGAAA GTTTGCCTTG 2760 

AGAACTCTAT TTTTTTATTA GAGTTATATT TAAAGCTTTT CATGGGAAAA GTTAATGTGA 2820 

ATACTGAGGA ATTTTGGTCC CTCAGTGACC TGTGTTGTTA ATTCATTAAT GCATTCTGAG 2880 

TTCACAGAGC AAATTAGGAG AATCATTTCC AACCATTATT TACTGCAGTA TGGGGAGTAA 2940 

ATTTATACCA ATTCCTCTAA CTGTACTGTA ACACAGCCTG TAAAGTTAGC CATATAAATG 3000 

CAAGGGTATA TCATATATAC AAATCAGGAA TCAGGTCCGT TCACCGAACT TCAAATTGAT 3060 

GTTTACTAAT ATTTTTGTGA CAGAGTATAA AGACCCTATA GTGGGTAAAT TAGATACTAT 3120 

TAG CAT ATT A TTAATTTAAT GTCTTTATCA TTGGATCTTT TGCATGCTTT AATCTGGTTA 3180 

ACATATTTAA ATTTGCTTTT TTTCTCTTTA CCTGAAGGCT CTGTGTATAG TATTTCATGA 3240 

CATCGTTGTA CAGTTTAACT ATATCAATAA AAAGTTTGGA CAGTATTTAA ATATTGCAAA 3300 

TATGTTTAAT TATACAAATC AGAATAGTAT GGGTAATTAA ATGAATACAA AAAGAAGAGC 3360 

CTCTTTCTGC AGCCGACTTA GACATGCTCT TCCCTTTCTA TAAGCTAGAT TTTAGAATAA 3420 

AGGGTTTCAG TTAATAATCT TATTTTCAGG TTATGTCATC TAACTTATAG CAAACTACCA 3480 

CAATACAGTG AGTTCTGCCA GTGTCCCAGT ACAAGGCATA TTTCAGGTGT GGCTGTGGAA 3540 

TGTAAAAATG CTCAACTTGT ATCAGGTAAT GTTAGCAATA AATTAAATGC TAAGAATGAT 3600 

TAATCGGGTA CATGTTACTG TAATTAACTC ATTGCACTTC AAAACCTAAC TTCCATCCTG 3660 

AATTTATCAA GTAGTTCAGT ATTGTCATTT GTTTTTGTTT TATTGAAAAG TAATGTTGTC 3720 

TTAAGATTTA GAAGTGATTA TTAGCTTGAG AACTATTACC CAGCTCTAAG CAAATAATGA 3780 

TTGTATACAT ATTAAGATAA TGGTTAAATG CGGTTTTACC AAGTTTTCCC TTGAAAATGT 3840 

AATTCCTTTA TGGAGATTTA TTGTGCAGCC CTAAGCTTCC TTCCCATTTC ATGAATATAA 3900 

GGCTTCTAGA ATTGGACTGG CAGGGGAAAG AATGGTAGAG ACAGAAATTA AGACTTTATC 3960 

CTTGTTTGCT TGTAAACTAT TATTTTCTTG CTAATGTAAC ATTTGTCTGT TCCAGTGATG 4020 

TAAGGATATT AAGTTATTAA GCTAAATATT AATTTTCAAA AATAGTCCTT CTTTAACTTA 4080 

GATATTTCAT AGCTGGATTT AGGAAGATCT GTTATTCTGG AAGTACTAAA AAGAATAATA 4140 

CAACGTACAA TGTCTGCATT CACTAATTCA TGTTCCAGAA GAGGAAATAA TGAAGATATA 4200 

CTCAGTAGAG TACTAGGTGG GAGGATATGG AAATTTGCTC ATAAAATCTC TTATAAAACG 4260 

TGCATATAAC AAAATGACAC CCAGTAGGCC TGCATTACAT TTACATGACC GTGTTTATTT 4320 

GCCATCAAAT AAACTGAGTA CTGACACCAG ACAAAGACTC CAAAGTCATA AAATAGCCTA 4380 

TGACCAACTG CAGCAAGACA GGAGGTCAGC TCGCCTATAA TGGTGCTTAA AGTGTGATTG 4440 

ATGTAATTTT CTGTACTCAC CATTTGAAGT TAGTTAAGGA GAACTTTATT TTTTTAAAAA 4500 

AAGTAAATGG CAACCACTAG TGTGCTCATC CTGAACTGTT ACTCCAAATC CACTCCGTTT 4560 

TTAAAGCAAA ATTATCTTGT GATTTTAAGA AAAGAGTTTT CTATTTATTT AAGAAAGTAA 4620 

CAATGCAGTC TGCAAGCTTT CAGTAGTTTT CTAGTGCTAT ATTCATCCTG TAAAACTCTT 4680 

ACTACGTAAC CAGTAATCAC AAGGAAAGTG TCCCCTTTGC ATATTTCTTT AAAATTCTTT 4740 

CTTTGGAAAG TATGATGTTG ATAATTAACT TACCCTTATC TGCCAAAACC AGAGCAAAAT 4800 

GCTAAATACG TTATTGCTAA TCAGTGGTCT CAAATCGATT TGCCTCCCTT TGCCTCGTCT 4860 

GAGGGCTGTA AGCCTGAAGA TAGTGGCAAG CACCAAGTCA GTTTCCAAAA TTGCCCCTCA 4920 

GCTGCTTTAA GTGACTCAGC ACCCTGCCTC AGCTTCAGCA GGCGTAGGCT CACCCTGGGC 4980 

GGAGCAAAGT ATGGGCCAGG GAGAACTACA GCTACGAAGA CCTGCTGTCG AGTTGAGAAA 5040 

AGGGGAGAAT TTATGGTCTG AATTTTCTAA CTGTCCTCTT TCTTGGGTCT AAAGCTCATA 5100 

ATACACAAAG GCTTCCAGAC CTGAGCCACA CCCAGGCCCT ATCCTGAACA GGAGACTAAA 5160 

CAGAGGCAAA TCAACCCTAG GAAATACTTG CATTCTGCCC TACGGTTAGT ACCAGGACTG 5220 

AGGTCATTTC TACTGGAAAA GATTGTGAGA TTGAACTTAT CTGATCGCTT GAGACTCCTA 5280 

ATAGGCAGGA GTCAAGGCCA CTAGAAAATT GACAGTTAAG AGCCAAAAGT TTTTAAAATA 5340 

TGCTACTCTG AAAAATCTCG TGAAGGCTGT AGGAAAAGGG AGAATCTTCC ATGTTGGTGT 5400 

TTTTCCTGTA AAGATCAGTT TGGGGTATGA TATAAGCAGG TATTAATAAA AATAACACAC S460 

CAAAGAGTTA CGTAAAACAT GTTTTATTAA TTTTGGTCCC CACGTACAGA CATTTTATTT 5520 

CTATTTTGAA ATGAGTTATC TATTTTCATA AAAGTAAAAC ACTATTAAAG TGCTGTTTTA 5580 

TGTGAAATAA CTTGAATGTT GTTCCTATAA AAAATAGATC ATAACTCATG ATATGTTTGT 5640 

AATCATGGTA ATTTAGATTT TTATGAGGAA TGAGTATCTG GAAATATTGT AGCAATACTT 5700 

GGTTTAAAAT TTTGGACCTG AGACACTGTG GCTGTCTAAT GTAATCCTTT AAAAATTCTC 5760 

TGCATTGTCA GTAAATGTAG TATATTATTG TACAGCTACT CATAATTTTT TAAAGTTTAT 5820 
GAAGTTATAT TTATCAAATA AAAACTTTCC TATAT 

Seq ID NO: 315 Protein sequence: 
Protein Accession #: XP 087254 



1 11 21 31 41 51 

I I I I I I 

MQFRECSING MKYQEINGRL VPEGPTPDSS EGNLSYLSSL SHLNNLSHLT TSSSFRTSPE 60 

NETELIKEHD LFFKAVSLCH TVQISNVQTD CTGDGPWQSN LAPSQLEYYA SSPDEKALVE 120 

AAARIGIVF1 GNSEETMEVK TLGKLERYKL LHILEFDSDR RRMSV1VQAP SGEKLLFAKG 180 

AESSILPKCI GGEIEKTRIH VDEFALKGLR TLCIAYRKFT SKEYEEIDKR IFEARTALQQ 240 

REEKLAAVFQ FIEKDLILLG ATAVEDRLQD KVRETIEALR MAGIKVWVLT GDKHETAVSV 300 

SLSCGHFHRT MNILELINQK SDSECAEQLR QLARRITEDH VIQHGLWDG TSLSLALREH 360 

EKLFMEVCRN CSAVLCCRMA PLQKAKVIRL IKISPEKP1T LAVGDGANDV SMIQEAHVGI 420 

GIMGKEGRQA ARNSDYAIAR FKFLSKLLFV HGHFYYIRIA TLVQYFFYKN VCFITPQFLY 480 

QFYCLFSQQT LYDSVYLTLY NICFTSLPIL IYSLLEQHVD PHVLQNKPTL YRDISKNRLL 540 

SIKTFLYWTI LGFSHAFIFF FGSYLLIGKD TSIiLGNGQMF GNWTFGTLVF TVMVITVTVK 600 

MALETHFWTW INHLVTWGSI IFYFVFSLFY GGILWPFLGS QNMYFVFIQL LSSGSAWFAI 660 

ILMWTCIiFL DIIKKVFDRH LHPTSTEKAQ LTETNAGIKC LDSMCCFPEG EAACA5VGRM 720 
LERVIGRCSP THISRSWSAS DPFYTNDRSI LTLSTMDSST C 

Seq ID NO: 316 DNA sequence 
Nucleic Acid Accession #: NM_004473 
Coding sequence: 661.. 1791 

1 11 21 31 41 51 

I I I 1 I I 

CTCGCCAGCG GTCCGCGGGG CTGGAGACCC ACGCCGTGGA GAGGACCAGC CTCAGGTCGC 60 
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CCCGCCTGGG CCCGCGCCCC GACCTCGCTG 
TACCGCCACC TTGGCCTCGG GGGCAGGGCA 
AGTACTAACT GCCCTCGCTC TGGCCTTCGA 
GGCAGTAATC CTAAACTAGC GGGCACCACA 
CACTTCCGGA CCCCTCGACC GCCCGGCACC 
AGAGTCCAGT CCCGGTCGCG AGGCCACCGC 
CTGAGCCGTC GGCTAGCGGG TCACTCCCGA 
CACGGACGCT GAGCCTCCAG CGCGCGCCAG 
GCGACGATCC CCTGAGCTCT CCGCAGAAGG 
CGCCCCCGCC CCCCGACAGC CGCGGGGATC 
ATGACTGCCG AGAGCGGGCC GCCGCCGCCG 
GAGCGOGGCG AGACGGCAGC AGGGGCCGGG 
GGCGGGCGGC GCCGCAAGCG CCCCCTGCAG 
CTCATCGCCA TGGCCATCGC GCACGCGCCC 
AAGTTCATCA CCGAGCGCTT CCCCTTCTAC 
ATCCGCCACA ACCTCACACT CAACGACTGC 
CCGGGTAAGG GCAACTACTG GGCGCTCGAC 
AGCTTCCTGC GCCGCCGCAA GCGCTTCAAG 
ATGCACGACG CGGCGGCTGC CGCAGCCGCC 
GCCGCCATCT TCCCAGGCX3C GGTGCCCGCC 
GCAGGCTACG CGCCGCCGTC GCTGGCCGCG 
CCCGGCCCTT GCCGCGTCTT CGGCCTGGTT 
CCCGCACCGT CGGGGCCCGG CGGCTCTTGC 
ACCACCGGCT ACCAGCCCGC AGGCTGCACC 
GCGGCTGCCT ACGCGGGCCC CGACGGCGCG 
GCCGCTGCTG GCCGCCTGGC GGGACCCGCT 
GTGGAGACCA CGGTGGACTT CTACGGGCGC 
GCCTGCTACA ACCCTGGCGG GCAGCTCGGA 
CATGCTGCCG CTTATCCCGG TGGGATAGAT 
GGGACGAAAA CTCATAGACA CATCGGCTGT 
AGGAATGGAG AGAGGACTCA ACTGGGACCC 
CTCGGTCTCC CCGCGCACAG CGTAGGCACC 
GCGAGGCAGC CAGAGCCCTT GGACTGGCAC 
CGGGATGCTT TCTGGCATTC TATCGGGGAG 
GTGTCAACAC CAGAGACCAG GATCCAAATT 
TGCCGGAACT CGGGCCTTTT TACGCGGTTC 
AATAAAAGGC TGCGGCAGCG CCTTTCTTCT 
ATAGGCTTTT CTTCTTTTTT AAATTGGAGA 
TTCCCTGTCT CTGAGAACTT GAGACCTAGC 
TCCCATCACC TGAACCTTCA GTCTCCCCCA 
TCCACCGCCC CCGGGTTATC ATTCAGGGCC 
GGCAGCAATG GTGGGCCACC CAGGGCCTCT 
TAGGGAACGG AAAAGAGTTC ATGGCCAAGC 
GGCAGCAATT ATATCATAAC TTATTGAACT 
CTGTTACTGC CCAGTCATAA ATCTGCTTTT 
TCATTTGTCC ACTGTTTCTT GTCATCACGC 
TAAGGAGATG AGAGGATTCA AGGAGCCCGT 
TCTTCCATCC CCAGCACCCC CTGCTACACC 
AGAAAAATTA AGTTAGGGCA GTCAGTAAAG 
TTCATTTTGT ATCTTGCTTA AGTAGCAGGC 
TCCCCCTTTC CCTTGAGAAA TCTTTAAGTT 
AATATTTCAG AGGCTCCATT CACAGCTTTC 
TTAATAAAAA TCTTTGAGCA AGTGAGTTAT 
AACTTAACAG GGAAGGGCTG GGGTGTGAAA 
TTTATTTTTT ATTTTTGGGA CTGCACTATC 
GTCCAAATGG GGATTTGTAT AAACCAGTGC 
ATGTAATTTT AAATATTCTA GTAGCCACAT 
AA 

Seq ID NO: 317 Protein sequence : 
Protein Accession # : NP 004464 
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i 

I 

FKHLTHYROI 
GAGVPGEATG 
PFYRDNPKKW 
RFKRSDLSTY 
LAAPPPVYYP 
GCTGARPANP 
YGRTSPGQFG 



11 
I 

DT RAN SCR IP 
RGAGGRRRKR 
QNSIRHNLTL 
PAYMHDAAAA 
AASPGPCRVF 
SAYAAAYAGP 
ALGACYNPGG 



21 
I 

TIONFACTOR 
PLQRGKPPYS 
NDCFLKIPRE 
AAAAAAAAAA 
GLVPERPLSP 
DGAYPQGAGS 
QLGGASAGAY 



CCCCCGCCTC 
TGGGCGGCCC 
GCCCGAAGCC 
GACCAGCTGC 
AGCGCGCAAG 
CGCTGCCCGC 
GCCTCTGTCT 
CCTGGGCCGC 
GCCGAGCGTC 
CAGAGCCCGG 
CAGCCGGAGG 
GTCCCAGGGG 
CGCGGGAAGC 
GAGCGCCGCC 
CGCGACAACC 
TTCCTCAAGA 
CCCAACGCGG 
CGCTCGGACC 
GCTGCCGCAG 
GCGCGCCCCC 
CCGCCTCCAG 
CCTGAGCGGC 
GCCTTTGCCT 
GGGGCCCGGC 
TACCCGCAGG 
TCGCCCCCAG 
ACGTCGCCCG 
GGGGCCAGTG 



TCACACGTTC 
ACGTGGAAAA 
CTGTGTACTC 
AGGGACCCTC 
GGTCCTTGGC 
GTGGGGAATC 
GTCCTCTAGT 
TAAAGTGAGG 
AATCTCTGCT 
TCCGAGTTGA 
TCTGTTACAC 
CCATCATCTT 
GAGTAGCCAC 
GTCTAACCTA 
TTTGAGCAGG 
CCATTATAAG 
AGCCCTGGAC 
TGGTGACGCC 
TCAGCAGCCT 
TGAGCTTTAG 
TCACTAAAAT 
TCGATTCTGG 
AGATAAACTG 
GGCAAGAGAA 
AAGAAGATTG 
CTGTTCACGA 
TCTCCATTAG 
TAATAAAGTN 



31 
I 

TTFMTAESGP 
YIALIAMAIA 
AGRPGKGNYW 
AAAAAIFPGA 
ELGPAPSGPG 
AIFAAAGRLA 
HARHAAAYPG 



GCCTCTCTGC 
CCGCCAGATC 
TCTTCTGCGC 
AGCCACCCCA 
GGACCCTTCA 
CTCGAGAAGC 
GCACCGCGCC 
TGGGCTCTCC 
CGTTCCGGGG 
GGGTGCGGGA 
TGCTGGCTAC 
AGGCCACGGG 
CGCCCTACAG 
TCACGCTGGG 
CCAAAAAGTG 
TCCCGCGCGA 
AGGACATGTT 
TCTCCACCTA 
COGCCGCCGC 
CCTACCCGGG 
TCTACTACCC 
CGCTCAGCCC 
CCGCCGGCGC 
CGGCCAACCC 
GCGCCGGCAG 
CGGGCGGCAG 
GCCAGTTCGG 
CAGGCGCCTA 
CCGCCATGTG 
CCCGCAACCT 
GACCGAGCAG 
TGTAAACGGG 
GATGGAGCGA 
GGTAACCAGA 
AGTTTCAGCC 
GCCTTTAACT 
AGGACAAATT 
CTGGTTGACC 
ACTGTGCGTC 
TAGAGGGCTG 
GGATGCTGCC 
CCAAAGCCTA 
AAGTCCCAGG 
ACGTGCTGGT 
GCAGAGAGAA 
CCAAAGGGTG 
TTTCAGTAGC 
CCCCCATGCA 
AAAGAAACTG 
TAGAGAAAGT 
AGCAAAAACT 
GAGTGTTCAG 
ACTCAGCCTC 
TATGAAAACC 
AGACATGTGA 
AAATATGGTG 
AAAAGAAACA 



41 

I 

PPPQPEVLAT 
HAPERRLTLG 
ALDPNAEDMF 
VPAARPPYPG 
GSCAFASAGA 
GPASPPAGGS 
GIDRFVSAM 



CCGTGGCGCT 
GCCCAGCGCC 
GCACAACCTA 
ACCCAGGGAT 
GCCGGAGACC 
ACAACGCGGG 
AGCCCCAGAC 
GGGCCAGCCC 
ACGCCAGGCC 
CGCCCGCGCC 
CGTGAAGGAA 
CCGCGGGGCG 
CTACATCGCG 
CGGCATCTAC 
GCAGAACAGC 
GGCCGGCCGC 
CGAGAGCGGC 
CCCGG CTTAC 
CGCCGCCGCC 
CGCOGTCTAT 
CGCGGCGTCG 
AGAGCTGGGG 
CCCCGCTACC 
CTCTGCCTAT 
TGCGATCTTT 
CAGTGGCGGC 
AGCGCTGGGA 
CCATGCTCGC 
AGCCAGCGTA 
GAGAACGAAC 
GCCACAGAGG 
AGGAGGTGGG 
AGCCCTCAAA 
GGGCAGCGTA 
TTCCATGTGC 
GCGTTACTAC 
TGCAAAAGAA 
TGGGCTGGTT 
AGCACTCCAG 
CAGGACTCTA 
CTGCGTATTT 
GCCGCTGTTC 
ATTGGCTCCA 
AATTTCATGG 
GTACATTCGT 
AACTAAAGTT 
TGGGGAGGGC 
AAAAGGAAAG 
GAATTTTAAC 
CCAATAACTC 
TTCAGCATTA 
ATGGACTGTT 
TTTCTGTATA 
ATTGGTAATT 
ACTTGGTTCA 
CAAGCCACAT 
AAAAAAAAAA 



51 
I 

VKEERGETAA 
GIYKFITERF 
ESGSFLRRRK 
AVYAGYAPPS 
PATTTGYQPA 
SGGVETTVDF 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



60 
120 
180 
240 
300 
360 



Seq ID NO: 318 DNA sequence 
Nucleic Acid Accession fh NM_005688 
Coding sequence: 126. .4439 



80 
85 



I 

CCGGGCAGGT 
AGGGGCGCAG 
AGAAGATGAA 
GTGTGAGGGA 
GGAGAACTCG 
TCTCTCTTGA 
GAAAGTACCA 
ACCCAGTGGA 



11 
I 

GGCTCATGCT 
GAATTCTGAT 
GGATATCGAC 
GAGAACCAGC 
ACCGTTGGAA 
TGCCTCCATG 
TCATGGCTTG 
CAATGCTGGG 



21 
I 

CGGGAGCGTG 
GTGAAACTAA 
ATAGGAAAAG 
ACTTCTGGGA 
TGCCAAGATG 
CATTCTCAGC 
AGTGCTCTGA 
CTTTTTTCCT 



31 
I 

GTTGAGCGGC 
CAGTCTGTGA 
AGTATATCAT 
CGCACAGAGA 
CCTTGGAAAC 
TCAGAATCCT 
AGCCCATCCG 
GTATGACTTT 



41 
I 

TGGCGCGGTT 
GCCCTGGAAC 
CCCCAGTCCT 
CCGTGAAGAT 
AGCAGCCCGA 
GGATGAGGAG 
GACTACTTCC 
TTCGTGGCTT 



51 
I 

GTCCTGGAGC 
CTCCGCTCAG 
GGGTATAGAA 
TCCAAGTTCA 
GCCGAGGGCC 
CATCCCAAGG 
AAACACCAGC 
TCTTCTCTGG 



60 
120 
180 
240 
300 
360 
420 
480 
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CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATCGTGTGC CTGATGATCA OGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

TTTGCTCCAA CGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 

ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 ' 

GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 

AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 

GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAACGAGA 2820 

CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 

CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 

TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 

CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 

TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 

CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 

AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 

AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC 3780 

CTAAAGAGAA GATTGGCATT GTGGGGCGGA. CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 

GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 

TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 

TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 

TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 

CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 

GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 

TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

GTTCCGGATT GGGTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 4620 

ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 

TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 

CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4960 

ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 

TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 

TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 

GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 5400 

ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTTTTCCTTT 5460 

CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 

. CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 

ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 

GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 
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ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 5760 
CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 5820 
AAAAAAAAAA AAAAAAAA 



Seg ID NO: 319 Protein sequence: 
Protein Accession #: NP_005679 

1 11 21 31 41 51 

I I I I I I 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAI* ETAARAEGLS 60 

LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLFSCM TFSWLSSLAR 120 

VAHKKGELiSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 

LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTEI VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGEIiINIC SNDGQRMFEA AAVGSLLAGG 300 

PWAILGMIY NVIILGPTGF LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 360 

VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 

VLAEQKGHLL LDSDERPSPE EBEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA ISGTFAYVAQ CAWILNATLR DNILFGKEYD EERYNSVLNS 660 

CCLRPDIiAIL PSSDI/TEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 720 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNLLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE IiFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV FPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG LATIHAYNKG QEFLHRYQEL 1080 

LDDNQAPFFI* FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG IiAISYAVQLT 1140 

GLFQF7VRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVI* KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

ZGLADLRSKIi SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLC1 ARALLRHCKI LILDEATAAM DTETDLLIQE TIREAFADCT 1380 
MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFYAMFAAAE NKVAVKG 



Seq ID NO: 320 DNA sequence 

Nucleic Acid Accession #: AK022089.1 

Coding sequence: 181-1488 

1 11 21 31 41 51 

I I II I I 

AGCAGTTGCA CAACTTCCAG CAACTTTCTC AGCCGGCTAC TAATGAGCTG AAAGCCAGGA 60 

ACATCCGAGG AGAAGAGAAA GCTTCCAGCC CTCCTCCCTT CACCCTGGAA ATCCAGACAC 120 

CCCCACCCCC ACCCTCAGAT CACTTTAAGA TAATTTCTTT ATTCGTTTGC CCGACAGACC 180 

ATGGCTCCCT TTGGAAGAAA CTTGCTAAAG ACTCGGCATA AAAACAGATC TCCAACTAAA 240 

GACATGGATT CAGAAGAGAA GGAAATTGTG GTTTGGGTTT GCCAAGAAGA GAAGCTTGTC 300 

TGTGGGCTGA CTAAACGCAC CACCTCTGCT GATGTCATCC AGGCTTTGCT TGAGGAACAT 360 

GAGGCTACGT TTGGAGAGAA ACGATTTCTT CTGGGGAAGC CCAGTGATTA CTGCATCATA 420 

GAGAAGTGGA GAGGCTCCGA AAGGGTTCTT CCTCCACTAA CTAGAATCCT GAAGCTTTGG 480 

AAAGCGTGGG GAGATGAGCA GCCCAATATG CAATTTGTTT TGGTTAAAGC AGATGCTTTT 540 

CTTCCAGTTC CTTTGTGGCG GACAGCTGAA GCCAAATTAG TGCAAAACAC AGAAAAATTG 600 

TGGGAGCTCA GCCCAGCAAA CTACATGAAG ACTTTACCAC CAGATAAACA AAAAAGAATA 660 

GTCAGGAAAA CTTTCCGGAA ACTGGCTAAA ATTAAGCAGG ACACAGTTTC TCATGATCGA 720 

GATAATATGG AGACATTAGT TCATCTGATC ATTTCCCAGG ACCATACTAT TCATCAGCAA 780 

GTCAAGAGAA TGAAAGAGCT GGATCTGGAA ATTGAAAAGT GTGAAGCTAA GTTCCATCTT 840 

GATCGAGTAG AAAATGATGG AGAAAACTAT GTTCAGGATG CATATTTAAT GCCCAGTTTC 900 

AGTGAAGTTG AGCAAAATCT AGACTTGCAG TATGAGGAAA ACCAGACTCT GGAGGACCTG 960 

AGCGAAAGTG ATGGAATTGA ACAGCTGGAA GAACGACTGA AATATTACCG AATACTCATT 1020 

GATAAGCTCT CTGCTGAAAT AGAAAAAGAG GTAAAAAGTG TTTGCATTGA TATAAATGAA 1080 

GATGCGGAAG GGGAAGCTGC AAGTGAACTG GAAAGCTCTA ATTTAGAGAG TGTTAAGTGT 1140 

GATTTGGAGA AAAGCATGAA AGCTGGTTTG AAAATTCACT CTCATTTGAG TGGCATCCAG 1200 

AAAGAGATTA AATACAGTGA CTCATTGCTT CAGATGAAAG CAAAAGAATA TGAACTCCTG 1260 

GCCAAGGAAT TCAATTCACT TCACATTAGC AACAAAGATG GGTGCCAGTT AAAGGAAAAC 1320 

AGAGCGAAGG AATCTGAGGT TCCCAGTAGC AATGGGGAGA TTCCTCCCTT TACTCAAAGA 1380 

GTATTTAGCA ATTACACAAA TGACACAGAC TCGGACACTG GTATCAGTTC TAACCACAGT 1440 

CAGGACTCCG AAACAACAGT AGGAGATGTG GTGCTGTTGT CAACATAGTT CCAATGGCTC 1500 

CTTTCTGACC TGCTTTCATG TTTTAATGTT TGTTTAATTT AATAGGAAAC CTCATTTTAA 1560 

ATATAACACT CAAAAAAATG TAAATCATAT TGTAGTATTC AATAGTTAAT AAAAACTCX3A 1620 
GAAATGTGTT GTTTCTG 

Seq ID NO: 321 Protein sequence: 
Protein AcceaBion #: NP_005438.1 

1 11 21 31 41 " 51 

I I I I I I 

MAPFGRNLLK TRHKNRSPTK DMDSEEKEIV VWVCQEEKLV CGLTKRTTSA DVIQALLEEH 60 

EATFGEKRFLi ■ LGKPSDYCI I EKWRGSERVL PPLTRILKLW KAWGDEQPNM QFVLVKADAF 120 

LPVPLWRTAE AKLVQNTEKL WELSPANYMK TLPPDKQKRI VRKTFRKLAK IKQDTVSHDR 180 

DNMETLVHXil ISQDHTIHQQ VKRMKELDLE IEKCEAKFHL DRVENDGENY VQDAYLMPSF 240 

SEVEQNLDLQ YEENQTLEDL SESDGIEQLE ERLKYYRILI DKLSAEIEKE VKSVCIDINE 300 

DAEGEAASEL ESSNLESVKC DLEKSMKAGL KIHSHLSGIQ KEIKYSDSLL QMKAKEYELL 360 

AKEFNSLHIS NKDGCQLKEN RAKESEVPSS NGEIPPFTQR VFSNYTNDTD SDTGISSNHS 420 
QDSETTVGDV VLLST 

Seq ID NO: 322 DNA sequence 

Nucleic Acid Accession #: NM_030920.1 



306 



WO 02/086443 

Coding sequence: 317-1123 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
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70 
75 
80 
85 



AGCATTGAAG 
GGCGCGTGCG 
AGTTAGAGTC 
CTGGTGGTGG 
AAAATTTTGG 
AGAGCAAGAG 
CCCGGAGGAG 
AGGCCTGAAT 
TTCGCTGGCC 
AATTTCTGGA 
GAGTGGAAAC 
GAAAAGTCTT 
TTTTGAACTA 
GCCGGACTCT 
AAATGAAGCT 
TGAGGATGAG 
AGTGGGCCTC 
TGTTGAAGAA 
GAAACGAGAT 
TTCTCTAATG 
AGCTATCCCT 
TTGCCTTATC 
ATTTTCATTG 
TTCAAGCTTA 
GGTGTAGTAT 
AATATAAAAT 
GGAATCTTTA 
TTTGGTTGCT 
TTCAGCAACA 
TAGTGGTGAC 
CATGCAGGTG 
GTTTTTAGGT 
GTTTCACTCC 
CTTTAGTCTT 
AAGAAATTGA 
TCTCCTTACC 
AAAGAAAAGC 
GCCTGTTGAT 
TTGGATACCA 
TAATAGCATT 
TTAAAATTAA 
AGTGAGTGTG 
CTCCTCTTTT 
TAAAATTTTT 
TTAATTGAAG 
TACTGCTTCT 
CTTGTGGAGA 
TAGTGAGGGG 
ATTCCCTGTT 
ATTTTTAAAA 
ATTAGAGTAC 
TTAGCTTTGT 
AGTTTTCGGT 
CTCAAGAATC 
AGCCATGACA 



11 
I 

GGGAAGGAAC 
TGCGTGTGTG 
CCAACTCTTG 
TAGTGGGCGT 
GTTGGGGGTA 
GAGGACATGG 
GTGACAGAGT 
GATACTTTCA 
CGGCTTCCCA 
GGCTTGGAAG 
AAAATAAAAG 
GACCTGTTTA 
CTGCAGCAAA 
GAAGAGGAGG 
GGTCCACCGG 
GATGAAGATG 
TCATACTTAA 
GGGGAAGAAG 
GCTGAAGACG 
TTTCTGGGTG 
ACAGAAGATA 
ATTCCAAATA 
ACTCCCATTG 
CTGTATTAGT 
GGTGCATTTC 
GACAATAGTC 
GTTTGAAGGC 
TTTTTGTCAC 
TAGTTCATGG 
TGATAAGATT 
AGCCCTTTTG 
GTGCATTCTT 

ATACCTCAAC 
GTATTTTCTG 
ATAAATGTTT 
AGGGGTGCTT 
ATATTAATAA 
CTCTGCAAAG 
TAATTGGAAT 
GATTTGTTTT 
TGTTTCCTTA 
GCTATGGAGG 
TCCCTTTGTT 
TTATGCTTCT 
ACAATGTAAA 
AACTAAAAAT 
ATTGGAATCC 
CTAGTTCCTA 
TGAAATCTAT 
TTTGGTTTGA 
GGGTGGTATT 
CCTATTTTAA 
TCCTTAAAAA 
GTGTTAAAAA 



21 
i 

TGCGGGTGTG 
TGCGCGCGCT 
GACTCCATTT 
TTATATTTGC 
TTGGGGAAGG 
AGATGAAGAA 
TAGTCCTTGA 
AAGAACTAGA 
GCTTAAATAA 
TCCTGGCAGA 
ATCTCAGTAC 
ACTGTGAGAT 
TCACATACTT 
ATGATGAGGA 
AAGGATATGA 
AAGATGAAGC 
TGAAAGAAGA 
AGGAAGAAGA 
ATGGAGAGGA 
TGCAATAGAG 
ATGTGTAACT 
AGAACTAGTC 
TGGAATTCCC 
CATTTTTAGC 
ATTCCTTATT 
TCTTGAGTGG 
AGTTACCTTT 
AAGTAACTTG 
ATTTTGTGGA 
TTATTTTTGA 
TCAGGCTGCA 
TTTCTTCTTA 
CTCAAAAGGA 
TACGTTTCAG 
ACATAAGAAT 
CTTGGTTGAC 
TTGGAAGCAG 
GACATTAAAG 
TATTTCTAAC 
ATACTAGGCA 
CAAGTGGATG 
CATGGCTACT 
CTCCATGTTC 
TTGAATATTT 
ATACTGGGAC 
ATGTATGACT 
GTTGTACAAC 
CCAAAAAGAT 
ACAATTCTCA 
ATATTGACTT 
AAACAACACT 
TTCATGCAAA 
TGCTCTTTGT 
CTTTGAAGTT 
CAAAC 



31 
I 

GTGTGTGTAT 
AGTGTGTGGA 
GCTATTCTCT 
GTTCCTTTTC 
CAGGAAAGGG 
GAAGATTAAC 
TAATTGCCTG 
ATTTCTGAGT 
ACTTCGAAAA 
GAAATGTCCA 
AGTAGAAGCT 
CACAAACCTG 
AGATGGATTT 
TGGAGATGAA 
GGAAGAGGAG 
AGGTTCAGAG 
AATTCAGGAT 
GGAAGAAGGA 
AGAAGATGAC 
TGATCACATC 
TTTTATAGGA 
TGTTAATGAT 
TAGCAATTTA 
CCATAATTAA 
TATAGATTAA 
TAAGTTGGTT 
TTTTTTTTTT 
GAAAATAGAA 
GGTTCTATTC 
AGGAAAAATT 
AATCATGACA 
GCAATTCCTT 
ACTTGGGAAA 
TCCTGTCTGG 
ATATTATCAA 
AACATCTAAG 
TGTTAGTGTT 
TCAAATTTTA 
CTTTAATTCC 
GCTGGAAAAG 
TCCATTAAAA 
AAATAAAATA 
AAGGCAATGG 
GTAAGTTTTT 
ATATTTAAAT 
TGGTGTTTTA 
TGACCGAAAG 
AACATTTTTC 
TTACATACTA 
TCTTATCAAT 
TAGAGCCTCC 
TAAGTAAGGG 
ATGGCAGTAT 
AATACTTTTG 



41 
I 

GTGTGTGTGT 
CAAGGAGGTG 
TCTTTCTCCC 
ATTCATTTCT 
AAAAGGAGAG 
CTGGAGTTAA 
TGTGTCAATG 
ATGGCTAATG 
TTGGAGCTTA 
AATCTTACCT 
CTGCAAAATC 
GAAGATTATA 
GATCAGGAGG 
GATGATGAAG 
GAGGAAGAGG 
TTGGGAGAGG 
GAAGAAGATG 
GGTCTTCGAG 
TAGATCATTC 
TTTGTTTCTT 
AAAGTGTGGT 
CATATTGTAT 
TTTAGACTTA 
AACATGATCA 
CTGAAATTAC 
ATTTTTTTAG 
TTTTTTTTTG 
GCAGAATAGT 
AGTAATATGG 
GCTTATACTA 
TGCCGATGGT 
TATGATCACC 
CTTGTGAAAC 
GTTTTAAATA 
TACAGTTTTA 
ACAATATTAG 
CCTCAAAAGT 
ATGTTGGCCT 
CAGTTTTAAA 
TATTTGAAAC 
GTAGAAAAAT 
TAATGAGTAT 
CTTTTTAAAT 
AAGAAGTTAG 
ACTGAGTATA 
AAGTAAAAAT 
AAAACCCTTG 
TTCTGCTTTT 
TGCCAGATTA 
CATCTTACTG 
AGATAACTTT 
TGGGTTTTAT 
GTATATATTG 
TGCAACTGTG 



51 
I 

ATGTGTGTGC 
GGGGCAGCTG 
CCACACCTAT 
AAATCTCTTA 
TAGTAGCTGA 
GGAACAGATC 
GGGAAATTGA 
TGGAACTAAG 
GTGATAATAT 
ACCTCAATCT 
TTAAAAATTT 
GAGAAAGTAT 
ATAATGAAGC 
AGGAAGAGGA 
AAGAGGAGGA 
GAGAAGAGGA 
ATGATGACTA 
GGGAGAAGAG 
TAAGACCAGA 
CATGTACGAT 
TTTACTATTT 
GTAGAGAAAA 
ATTTTTTAAA 
CTTTTAAACA 
AGTTTGCTAT 
AGGTGATCCA 
ACTAAGAGTG 
AAAGGTTCTA 
TTCATCGATT 
AGTCCAGAGA 
TGTTTATTTT 
TTCCCTTCTT 
CCAGGAAAAC 
AGTGAAGTAG 
TGCAGTAAGC 
TGGGATGAAG 
CGGAACAATT 
CTCAAATGAT 
ACAGATATAA 
TAAATTGACA 
ATTTGGGATA 
ACAAGTATAT 
CTTGGCTATC 
TGTCAGCAAA 
GTACTGCTGC 
TATGATGTTA 
GGGATAAGTT 
AAAAACTGAA 
CAAAATACTT 
TGCAATCAAA 
TAAGACTTAT 
ATTTTGTAGA 
TGTTAAGTTC 
TTTTGAATAA 



Seq ID NO: 323 Protein sequence: 
Protein Accession ft: NP 112182.1 



11 



21 



31 



41 



51 



I I ■ I ! ! I 

MEMKKKINLE LRNRSPEEVT ELVLDNCLCV NGEIEGLNDT FKELEFLSMA NVELSSLARL 
PSLNKLRKLE LSDNIISGGL EVLAEKCPNL TYLNLSGNKI KDLSTVEALQ NLKNLKSLDL 
FNCEITNLED YRESIFELLQ QITYLDGFDQ EDNEAPDSEE EDDEDGDEDD EEEEENEAGP 
PEGYEEEEEE EEEEDEDEDE DEDEAGSELG EGEEEVGLSY LMKEEIQDEE DDDDYVEEGE 
BEEEEEEGGL RGEKRKRDAE DDGEEEDD 

Seq ID NO: 324 DNA sequence 
Nucleic Acid Accession &: NM_003812 
Coding sequence: 224.. 2722 



TCCTCTGCGT 
GCCCAGCCCC 
CCATGCGCGC 
GCTCTCGCCG 
CAGCTCGCGG 
ACGCGGCCCC 
GCTTCTCGTC 
GGCTGCTGCG 
GGCAGATGAA 
AATGCAGAAA 



11 
I 

CCCGCCCCGG 
GAGCCCCGCG 
CGAGCCGGCG 
GCCACACGGA 
CAGCCGCCCC 
GCCGGCTCGG 
CTTCTCCTGC 
CCCAGCGCTC 
GACAATACAT 
GAAATCACAC 



21 
I 

GAGTGGCTGC 
CCCCGTGCCC 
TGACCGGCTC 
GCGGCGCCCG 
TGGCGGGCTG 
TGCCTGCCAG 
TGCCTCCGCT 
CGCATTGGAA 
TGCAACAGAA 
TGCCTTCAAG 



31 
I 

GAGGCTAGGC 
CGAGCCCGGA 
CGCCCGCGGC 
GGAGCTATGA 
CAGCCTTGCC 
CGCCCCGGCC 
CGCCGCCTCG 
TGAAACTGCA 
TAGCAGCAGT 
ACTCATATAT 



41 
I 

GAGCCGGGAA 
GCCCCCTGCC 
CGCCCCGCAG 
GCCATGAAGC 
GGCGCTTCCT 
CGCACGCCGC 
TCCCGGCCCC 
GAAAAAAATT 
AATATCAGTT 
TACATCAACC 



51 
I 

AGGGGGCGCC 
CGCGGCGGCA 
CTAGCCCGGC 
CGCCCGGCAG 
GCGGCCCCCA 
CCTGCCGCCT 
GCGCCTGGGG 
TGGGAGTCCT 
ACAGCAATGC 
AAGACTCGGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



307 



WO 02/086443 

AAGCCCTTAT CACGTTCTTG ACACAAAGGC 
CCATCTGGCC CAGGCAAGCT TCCAGATTGA 
CATACTGAAC AATGGTTTGT TGTCTTCTGA 
ACCACAGTAC TCTAAGGGTG GAGAGCACTG 
AGACTCCAAG GTGGCTCTGT CAACCTGCAA 
CTTCGTGTAT ATGATAGAGC CACTAGAGCT 
ACATATAATC CAGAAAACCT TGGCAGGACA 
GGAAAGAGGT GACCAGTGGC CCTTTCTCTC 
AGCAGTGAAT CCATCACGTG GTATATTTGA 
TAATGATCAC AAAACGTATA AGAAGCATCG 
AAAGTCCGTG GTCAACCTTG TGGATTCTAT 
CCTGGTGGCT GTAGAGACCT GGACTGAGAA 
GCAGATGCTC CATGAGTTCT CAAAATACCG 
GCACCTCATC TCGCGGGTGA CATTTCACTA 
TGTCTGTTCT CGCACAAGAG GAGTTGGTGT 
ACAAGTATTA TCGCAGAGCC TGGCTCAAAA 
AAAGCCAAAA TGTGACTGCA CAGAATCCTG 
GTCCCATTCT OGAAAATTTT CAAAGTGCAG 
AGGAGGTGGA GCCTGCCTTT TCAACAGGCC 
AAATGGATAC GTGGAAGCTG GGGAGGAGTG 
ATTATGCTGT AAGAAATGTT CCCTCTCCAA 
TAACAATACC TCATGTCTTT TTCAGCCACG 
GTGTGATATT ACTGAATATT GTACTGGAGA 
GCAAGACGGA TATGCATGCA ATCAAAATCA 
CAGAGACAAC CAGTGTCAGT ACATCTGGGG 
CTATGAAAAG CTGAATACAG AAGGCACTGA 
GTGGATTCAG TGCAGCAAAC ATGATGTGTT 
TCGAGCTCCA CGTATTGGTC AACTTCAGGG 
AGGCCGGGTG ATTGACTGCA GTGGTGCCCA 
CTATGTAGAA GATGGAACGC CATGTGGCCC 
ACAAATTCAA GCCCTAAATA TGAGCAGCTG 
GGGCCATGGG GTGTGTAGTA ATGAAGCCAC 
AGATTGCAGT ATCCGGGATC CAGTTAGGAA 
GGGTCCTAGT GCCACCAATC TCATAATAGG 
TATTGTCCTT GGGGGCACAG GCTGGGGATT 
TACTCAGCAA GGCCCCATCT GAATCAGCTG 
ATTCTGGGTA TGACATACTC GCAGCAGTGT 
CTTTGGGTGG TAATGACTAC GGAGCTAAAG 
CTGTCTCTTT TGGAAATAAT GTCAAAGAAC 
GGGGGCAAAA GACCATGCTA TAAAAAGAAC 
ACGAAGGAAC AACACACACA CAAAAATTAA 



AAGACACCAG CAAAAACATA ATAAGGCTGT 660 

AGCCTTCGGC TCCAAATTCA TTCTTGACCT 720 

TTATGTGGAG ATTCACTACG AAAATGGGAA 780 

TTACTACCAT GGAAGCATCA GAGGCGTCAA 840 

TGGACTTCAT GGCATGTTTG AAGATGATAC 900 

GGTTCATGAT GAGAAAAGCA CAGGTCGACC 960 

GTATTCTAAG CAAATGAAGA ATCTCACTAT 1020 

TGAATTACAG TGGTTGAAAA GAAGGAAGAG 1080 

AGAAATGAAA TATTTGGAAC TTATGATTGT 1140 

CTCTTCTCAT GCACATACCA ACAACTTTGC 1200 

TTACAAGGAG CAGCTCAACA CCAGGGTTGT 1260 

GGATCAGATT GACATCACCA CCAACCCTGT 1320 

GCAGCGCATT AAGCAGCATG CTGATGCTGT 1330 

TAAGAGAAGC AGTCTGAGTT ACTTTGGAGG 1440 

GAATGAGTAT GGTCTTCCAA TGGCAGTGGC 1500 

CCTTGGAATC CAATGGGAAC CTTCTAGCAG 1560 

GGGTGGCTGC ATCATGGAGG AAACAGGGGT 1620 

CATTTTGGAG TATAGAGACT TTTTACAGAG 1680 

AACAAAGCTA TTTGAGCCCA CGGAATGTGG 1740 

TGATTGTGGT TTTCATGTGG AATGCTATGG 1800 

CGGGGCTCAC TGCAGCGACG GGCCCTGCTG 1860 

AGGGTATGAA TGCCGGGATG CTGTGAACGA 1920 

CTCTGGTCAG TGCCCACCAA ATCTTCATAA 1980 

GGGCCGCTGC TACAATGGCG AGTGCAAGAC 2040 

AACAAAGGCT GCAGGGTCTG ACAAGTTCTG 2100 

GAAGGGAAAC TGCGGGAAGG ATGGAGACCG 2160 

CTGTGGATTC TTACTCTGTA CCAATCTTAC 2220 

TGAGATCATT CCAACTTCCT TCTACCATCA 2280 

TGTAGTTTTA GATGATGATA CGGATGTGGG 2340 

GTCTATGATG TGTTTAGATC GGAAGTGCCT 2400 

TCCACTCGAT TCCAAGGGTA AAGTCTGTTC 2460 

CTGCATTTGT GATTTCACCT GGGCAGGGAC 2520 

CCTTCACCCC CCCAAGGATG AAGGACCCAA 2580 

CTCCATCGCT GGTGCCATCC TGGTAGCAGC 2640 

TAAAAATGTC AAGAAGAGAA GGTTCGATCC 2700 

CGCTGGATGG ACACCGCCTT GCACTGTTGG 2760 

TACTGGAACT ATTAAGTTTG TAAACAAAAC 2820 

TTGGGGTGAC AAGGATGGGG TAAAAGAAAA 2880 

ACCTTTCACC ACCTGTCAGT AAACGGGGGA 2940 

TGTTCCAGAA TCTTTTTTTT TCCCTAATGG 3000 
ATGCAATAAA GGAATCATTA AAAA 



Seq ID NO: 325 Protein sequence: 
Protein Accession #: NPJJ03803 



1 11 21 31 41 51 

I 1 I I I I 

MKPPGSSSRQ PPLAGCSLAG ASCGPQRGPA GSVPASAPAR TPPCRMjLVL LLLPPLAASS 60 

RPRAWGAAAP SAPHWNETAE KNLGVLADED NTLQQNSSSN ISYSNAMQKE ITLPSRMYY 120 

INQDSESPYH VLDTKARHQQ KHNKAVHLAQ ASFQIEAFGS KFILDLILNN GLLSSDYVEI 180 

HYENGKPQYS KGGEHCYYHG SIRGVKDSKV ALSTCNGLHG MFEDDTFVYM IEPLELVHDE 240 

KSTGRPHIIQ KTLAGQYSKQ MKNLTMERGD QWPFLSELQW LKRRKRAVNP SRGIFEEMKY 300 

LELMIVNDHK TYKKHRSSHA HTNNFAKSW NLVDSIYKEQ LNTRWLVAV ETWTEKDQID 360 

ITTNPVQMLH EFSKYRQRIK QHADAVHLIS RVTFHYKRSS LSYFGGVCSR TRGVGVNEYG 420 

LPMAVAQVLS QSLAQNLGIQ WEPSSRKPKC DCTESWGGCI MEETGVSHSR KFSKCSILEY 480 

RDFLQRGGGA CLFNRPTKLF EPTECGNGYV EAGEECDCGF HVECYGLCCK KCSLSNGAHC 540 

SDGPCCNNTS CLFQPRGYEC RD AVNE CD IT EYCTGDSGQC PPNLHKQDGY ACNQNQGRCY 600 

NGECKTRDNQ CQYIWGTKAA GSDKFCYEKL NTEGTEKGNC GKDGDRWIQC SKHDVFCGFL 660 

LCTNLTRAPR IGQLQGEIIP TSFYHQGRVI DCSGAHWLD DDTDVGYVED GTPCGPSMMC 720 

LDRKCLQIQA LNMSSCPLDS KGKVCSGHGV CSNEATCICD FTWAGTDCSI RDPVRNLHPP 780 
KDEGPKGPSA TNLIIGSIAG AILVAAIVLG GTGWGFKNVK KRRFDPTQQG PI 



Seq ID NO: 326 DNA sequence 

Nucleic Acid Accession tf: AK074418.1 

Coding sequence: 244-1515 

1 11 21 31 41 51 

I I I I I I 

CTTTCTCCAA GACGGCCGGC CATGCTCTCC TCCTCTGCCA GTCTCCTCCA CCACTCTCTA 60 

ACCTGAGAGC CTGTGGAACC TGCCCGTCTC CCCTCCTCCA TCAGACACAC CTGCCTAGGA 120 

AACAGATGGA AAAAGTGAGG GACCGGTGAG TGACTTGCTG CTAAAGTTTA TACCAGATGC 180 

AAATGACAGA GCTGGAGTTC TGCTGTGCCT GGAAAGGACC TCGGAAGTCT TCTAAGGAGA 240 

GTCATGGCGT ATTACCAGGA GCCTTCAGTG GAGACCTCCA TCATCAAGTT CAAAGACCAG 300 

GACTTTACCA CCTTGCGGGA TCACTGCCTG AGCATGGGCC GGACGTTTAA GGATGAGACA 360 

TTCCCCGCAG CAGATTCTTC CATAGGCCAG AAGCTGCTCC AGGAAAAACG CCTCTCCAAT 420 

GTGATATGGA AGCGGCCACA GGATCTACCA GGGGGTCCTC CTCACTTCAT CCTGGATGAT 480 

ATAAGCAGAT TTGACATCCA ACAAGGAGGC GCAGCTGACT GCTGGTTCCT GGCAGCACTG 540 

GGATCCTTGA CTCAGAACCC ACAGTACAGG CAGAAGATCC TGATGGTCCA AAGCTTTTCA 600 

CACCAGTATG CTGGCATTTT CCGTTTCCGG TTCTGGCAAT GTGGCCAGTG GGTGGAAGTG 660 

GTGATTGATG ACCGCCTACC TGTCCAGGGA GATAAATGCC TCTTTGTGCG TCCTCGCCAC 720 

CAAAACCAAG AGTTCTGGCC CTGCCTGCTG GAGAAGGCCT ATGCCAAGCT GCTCGGATCC 780 

TATTCCGATC TGCACTATGG CTTCCTCGAG GATGCCCTGG TGGACCTCAC AGGAGGCGTG 840 

ATCACCAACA TCCATCTGCA CTCTTCCCCT GTGGACCTGG TGAAGGCAGT GAAGACAGCG 900 

ACCAAGGCAG GCTCCCTGAT AACCTGTGCC ACTCCAAGTG GGCCAACAGA TACAGCACAG 960 

GCGATGGAGA ATGGGCTGGT GAGTCTCCAT GCCTACACTG TGACTGGGGC TGAGCAGATT 1020 

CAATACCGAA GGGGCTGGGA AGAAATTATC TCCCTGTGGA ACCCCTGGGG CTGGGGCGAG 1080 

ACCGAATGGA GAGGGCGCTG GAGTGATGGG TCTCAGGAGT GGGAGGAAAC CTGTGATCCG 1140 
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WO 02/086443 

CGGAAAAGCC AGCTACATAA GAAAOGGGAA 
TTCCAACAGA AATTCATCGC CATGTTTATA 
GGAAACACAC TCCACGAAGG ATGGTCCCAA 
AACACTGCAG GAGGACCTCG GAATGATGCT 
GAAGGCACCA ATGTTGTCGT GTGCGTCACA 
GAAGATGCAA AATTTCCACT CGATTTCCAA 
CCAAAGCTCA AATAATAAAT TCCGCCGCAA 
GAACTATGTT GTGGTTGCAC AGACACGGAG 
CCTGAAAATG CCAGACAGTG ACAGGCACCT 
AAGCCCTTCA GAACATGGCT CCCAACAAAG 
GTACCTAGCA CCCAGGGGCC TTACGTGGGA 
CCCTCACAGG CCCTTACTGG GATGCAGAGA 
GCCTCTCTTC CTGGATCGTC TCCAGAACTG 
CGCCCCACCC AGTCTCATCC GGGGGACTTC 
GGATAATTAT GGGGTGTGAG GTGCATTGCC 
ACCCCGTGAA ACCTTTCCTT CTCCTACTCG- 
CCCGGGAGCT AGCCAGCTTC AGAAAGCACA 
TGCACACAGG ATTTCCTTAA TGGCTTAATA 
GAATAAAATA GCTGCCAGGG GCTCTGCACA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 327 Protein sequence: 
Protein Accession ft: BAB85075.1 



GATGGCGAGT 
TGTAGCGAAA 
ATAATGTTTA 
CAATTCAACT 
GTTGCTGTCA 
GTGATTCTGG 
CTTCACCATG 
AAAATCAGCG 
GAGCAGCCAT 
CATTTTCAAC 
TTGGAGAAAG 
GGAGAAGTGA 
CTGTGGCTGC 
AAGCTGGAAT 
CTCTAAATCT 
GCCACCTCCC 
TACAGCATCC 
AACTGTTATA 
ATGAGCCTCT 



MAYYQEPSVE 
IWKRPQDLPG 
QYAGIFRFRF 
SDLHYGFLED 
MENGLVSLHA 
KSQLHKKRED 
TAGGPRNDAQ 
KLK 



11 
I 

TSIIKPKDQD 
GPPHFILDDI 
WQCGQWVEW 
ALVDLTGGVI 
YTVTGAEQIQ 
GEPWMSCQDF 
FNFSVQEPME 



21 
I 

FTTLRDHCLS 
SRFDIQQGGA 
IDDRLPVQGD 
TNIHLHSSPV 
YRRGWEEIIS 
QQKFIAMFIC 
GTNWVCVTV 



31 
I 

MGRTFKDETF 
ADCWFLAALG 
KCLFVRPRHQ 
DLVKAVKTAT 
LWNPWGWGET 
SEIPITLDHG 
AVTPSNLKAE 



TTTGGATGTC 
TTCCAATTAC 
GGAAGCAAGT 
TCTCTGTGCA 
CACCATCAAA 
CTGGCTCACA 
ACTTACCATC 
GAGTTCTTGC 
TTCAACCTCA 
AGATATGCTC 
GGGACCTGAG 
CTTGATGGAC 
CAAGCTCGGT 
GCAGAGCTTA 
TTAAACAAGC 
ACCAACCTGG 
TTGCTGCCAA 
AAGAACTCCT 
TACCGTTAAA 



41 
1 

PAADSSIGQK 
SLTQNPQYRQ 
NQEFWPCLLE 
KAGSLITCAT 
EWRGRWSDGS 
NTLHEGWSQI 
DAKFPLDFQV 



GTGTCAAGAT 
CCTGGACCAT 
GATTCTAGGA 
AGAGCCAATG 
TTTGAAAGCA 
GAAACACTGT 
TGAGCCCTGG 
TCCGAATCTT 
GAATGAAGGG 
AGCAGGTATG 
GGAGGGACAG 
TATTTTACCT 
AGAGACGTGG 
GAAAGGGAGG 
AATTGGCAGT 
CATCGTTCCT 
ACCACCTATG 
TGACTTGTCA 
AAAAAAAAAA 



51 
I 

LLQEKRLSNV 
KILMVQSFSH 
KAYAXLLGSY 
PSGPTDTAQA 
QEWEETCDPR 
MFRKQVILGN 
ILAGSQKHCP 



Seq ID NO: 328 DNA sequence 

Nucleic Acid Accession ft: BC0 17490.1 

Coding sequence: 74-2788 



GTGGGTCACG 
TCGTGGTACT 
GCGTCGGCGA 
TGCCCTCACC 
CCTAGGCACA 
CATGGAAAGG 
TCTGGATGAT 
CATGCGGCAG 
GTATGACAGC 
GGCCACGGAG 
TCTCAAAGGC 
CCACCGCTTC 
GGAGCGCATC 
CTTGGCAGCC 
GCAGATCTTT 
CATCACCAAC 
GCTGAGGCAG 
TGGCGTCCTG 
GGGTCCTTTC 
GTCGGCCGGC 
CCGAATCCAG 
CATTCTCCTC 
CATCTATCAC 
CACTGTCATC 
GACCGATGAA 
GATCTTTGCC 
TCTGGCCCTG 
TATCAACGTG 
TGAGAAAGTG 
CACGGCGTAT 
GGTTCTGGCT 
CAGAACCAGC 
CGTCACCTCC 
CTACGACCCC 
CTTTGACATC 
CCGCTTCGTG 
GGCCAATGGC 
CCAGGAGGTC 
CCAGATGGAC 
GACAGGCAGC 
CCACGOGCGC 
CGTGATGCTG 
GACTTTTGCC 
GAAGCAGTTA 
CACTATTGAG 



11 
I 

TGAACCACTT 
GCTATGGCGG 
GGCAATGATC 
TCCAGCCCTG 
GAGGGGCCCC 
GACTACCGCG 
GAGGACGTAG 
CGTGACCGGG 
GATGAGGAGG 
GACGGCGAGG 
CACTCTGTGC 
AAGAACTTCC 
AGOGACATGT 
AGGGAGCACG 
GATGAGGCTG 
CACATCCATG 
CTGCATCTGA 
CCCCAGCTCA 
TGCCAGTCCC 
CCCTTTGAGG 
GAGAGTCCAG 
GCAGATCTGG 
AACAACTATG 
CTAGCCAACC 
GATGTGAAGA 
AGCATTGCTC 
TTCGGAGGGG 
CTCTTGTGCG 
TCCAGCCGAG 
GTCCAGCGGC 
GACCGAGGAG 
ATCCATGAGG 
CTGCAGGCTC 
TCGCTGACTT 
CTGTGTGTGG 
GTGGGCAGCC 
AGCGCTGCTG 
CTGAAGAAGT 
CAGGACAAGG 
ATCCCCATTA 
ATCCATCTGC 
GAGAGCTTCA 
CGCTACCTTT 
GTGGCAGAGC 
GTCCCTGAGA 



21 
I 

TTCGCGCGAA 
AATCATCGGA 
CTCTCACCTC 
GCCGTGACCT 
TGGAGGAAGA 
CCATCCCAGA 
AGGAGCTGAC 
AGGCTGGCCG 
ACGAGGAGCG 
AGGACGAGGA 
GCGAGTGGGT 
TGCGCACTCA 
GCAAAGAGAA 
TGCTGGCCTA 
CCCTGGAGGT 
TCCGCATCTC 
ACCAGCTGAT 
GCATGGTCAA 
AGAACCAGGA 
TCAACATGGA 
GCAAAGTGGC 
TGGACAGCTG 
ATGGCTCCCT 
ACGTGGCCAA 
TGATCACTAG 
CTTCCATCTA 
AGCCCAAAAA 
GAGACCCTGG 
CCATCTTCAC 
ACCCTGTCAG 
TGTGTCTCAT 
CCATGGAGCA 
GCTGCACGGT 
TCTCTGAGAA 
TGAGGGACAC 
ACGTCAGACA 
AGCCCGCCAT 
ACATCATCTA 
TGGCCAAGAT 
CGGTGCGGCA 
GGGACTATGT 
TAGACACACA 
CATTCCGGCG 
AGGTGACATA 
AGGACTTGGT 



31 
I 

ACCTGGTTGT 
ATCCTTCACC 
CAGCCCTGGC 
TCCACCATTT 
AGAGGATGGA 
GCTGGACGCC 
GGCCAGTCAG 
GGGCCTGGGC 
CCCTGCCCGC 
GATGATCGAG 
GAGCATGGCG 
CGTCGACAGC 
CCGTGAGAGC 
CTTCCTGCCT 
GGTACTGGCC 
CCACCTGCCT 
CCGCACCAGT 
GTACAACTGC 
GGTGAAACCA 
GGAGACCATC 
GGCTGGCCGG 
CAAGCCAGGA 
CAACACTGCC 
GAAGGACAAC 
CCTCTCCAAG 
TGGTCATGAA 
CCCAGGTGGC 
CACAGCGAAG 
CACTGGCCAG 
CAGGGAGTGG 
TGATGAATTT 
ACAGAGCATC 
CATTGCTGCC 
CGTGGACCTC 
CGTGGACCCA 
CCACCCCAGC 
GCCCAACACG 
CGCCAAGGAG 
GTACAGTGAC 
CATCGAGTCC 
GATCGAAGAC 
GAAGTTCAGC 
TGACAACAAT 
TCAGCGCAAC 
GGATAAGGCT 



41 
I 

TGCTGTAGTG 
ATGGCATCCA 
CGAAGCTCCC 
GAGGATGAGT 
GAGGAGCTCA 
TATGAGGCCG 
AGGGAGGCAG 
CGCATGCGCC 
AAGCGCCGCC 
AGCATCGAGA 
GGCCCCCGGC 
CACGGCCACA 
CTGGTGGTGA 
GAGGCACCGG 
ATGTACCCCA 
CTGGTGGAGG 
GGGGTGGTGA 
AACAAGTGCA 
GGCTCCTGTC 
TATCAGAACT 
CTGCCCCGCT 
GACGAGATAG 
AATGGCTTCC 
AAGGTTGCTG 
GATCAGCAGA 
GACATCAAGA 
AAGCACAAGG 
TCGCAGTTTC 
GGGGCGTCGG 
ACCTTGGAGG 
GACAAGATGA 
TCCATCTCGA 
GCCAACCCCA 
ACAGAGCCCA 
GTCCAGGACG 
AACAAGGAGG 
TATGGCGTGG 
AGGGTCCACC 
CTGAGGAAAG 
ATGATCCGCA 
GACGTCAACA 
GTCATGCGCA 
GAGCTGTTGC 
CGCTTTGGGG 
CGTCAGATCA 



51 
I 

GCGGAGAGGA 
GCCCGGCCCA 
GGCGTACTGA 
CCGAGGGGCT 
TTGGAGATGG 
AGGGACTGGC 
CAGAGCGGGC 
GTGGGCTCCT 
AGGTGGAGCG 
ACCTGGAGGA 
TGGAGATCCA 
ACGTCTTCAA 
ACTATGAGGA 
CGGAGCTGCT 
AGTACGACCG 
AGCTGCGCTC 
CCAGCTGCAC 
ATTTCGTCCT 
CTGAGTGCCA 
ACCAGCGTAT 
CCAAGGACGC 
AGCTGACTGG 
CTGTCTTTGC 
TAGGGGAACT 
TCGGAGAGAA 
GAGGCCTGGC 
TACGTGGTGA 
TCAAGTATAT 
CTGTGGGCCT 
CTGGGGCCCT 
ATGACCAGGA 
AGGCTGGCAT 
TAGGAGGGCG 
TCATCTCACG 
AGATGCTGGC 
AGGAGGGGCT 
AGCCCCTGCC 
CGAAGCTCAA 
AATCTATGGC 
TGGCGGAGGC 
TGGCCATCCG 
GCATGCGCAA 
TCTTCATACT 
CCCAGCAGGA 
ACATCCACAA 
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CCTCTCTGCA TTTTATGACA GTGAGCTCTT CAGGATGAAC AAGTTCAGCC ACGACCTGAA 2760 

AAGGAAAATG ATCCTGCAGC AGTTCTGAGG CCCTATGCCA TCCATAAGGA TTCCTTGGGA 2820 

TTCTGGTTTG GGGTGGTCAG TGCCCTCTGT GCTTTATGGA CACAAAACCA GAGCACTTGA 2880 

TGAACTCGGG GTACTAGGGT CAGGGCTTAT AGCAGGATGT CTGGCTGCAC CTGGCATGAC 2940 

TGTTTGTTTC TCCAAGCCTG CTTTGTGCTT CTCACCTTTG GGTGGGATGC CTTGCCAGTG 3000 

TGTCTTACTT GGTTGCTGAA CATCTTGCCA CCTCCGAGTG CTTTGTCTCC ACTCAGTACC 3060 

TTGGATCAGA GCTGCTGAGT TCAGGATGCC TGCGTGTGGT TTAGGTGTTA GCCTTCTTAC 3120 

ATGGATGTCA GGAGAGCTGC TGCCCTCTTG GCGTGAGTTG CGTATTCAGG CTGCTTTTGC 3180 

TGCCTTTGGC CAGAGAGCTG GTTGAAGATG TTTGTAATCG TTTTCAGTCT CCTGCAGGTT 3240 

TCTGTGCCCC TGTGGTGGAA GAGGGCACGA CAGTGCCAGC GCAGCGTTCT GGGCTCCTCA 3300 

GTCGCAGGGG TGGGATGTGA GTCATGCGGA TTATCCACTC GCCACAGTTA TCAGCTGCCA 3360 

TTGCTCCCTG TCTGTTTCCC CACTCTCTTA TTTGTGCATT CGGTT7GGTT TCTGTAGTTT 3420 
TAATTTTTAA TAAAGTTGAA TAAAATATAA AAAAAAAAAA AAAAAA 

Seq ID NO: 329 Protein sequence: 
Protein Accession #: AAH17490.1 

1 ' 11 21 31 41 51 

I I I I I I 

MAESSESFTM ASSPAQRRRG NDPLTSSPGR SSRRTDAIiTS SPGRDLPPFE DBSBGLLGTE 60 

GPLEEEEDGE ELIGDGMERD YRAIPELDAY EAEGLALDDE DVEELTASQR EAAERAMRQR 120 

DREAGRGLGR MRRGLLYDSD EEDEERPARK RRQVERATED GEEDEEMIES IENLEDLKGH 180 

SVREWVSMAG PRLEIHHRFK NFLRTHVDSH GHNVFKERIS DMCKENRESI* WNYEDLAAR 240 

EHVLAYFLPE APAELLQIFD EAALEWLAM YPKYDRITNH IHVRISHLPL VEELRSLRQIi 300 

HLNQLIRTSG WTSCTGVLP QLSMVKYNCN KCNFVLGPFC QSQNQEVKPG SCPECQSAGP 360 

FEVNMEETIY QNYQRIRIQE SPGKVAAGRL PRSKDAILLA DIATOSCKPGD EIELTGIYHN 420 

NYDGSLNTAN GFPVFATVIIj ANHVAKKDNK VAVGELTDED VKMITSLSKD QQIGEKIFAS 480 

IAPSIYGHED IKRGLALALF GGEPKNPGGK HKVRGDINVL LCGDPGTAKS QPLKYIEKVS 540 

SRAIFTTGQG ASAVGLTAYV QRHPVSREWT LEAGALVLAD RGVCLIDEFD KMNDQDRTSI 600 

HEAMEQQSIS ISKAGIVTSL QARCTVIAAA NPIGGRYDPS LTFSENVDLT EPIISRFDII, 660 

CWRDTVDPV QDEMLARFW GSHVRHHPSN KEEEGLANGS AAEPAMPKTY GVEPLPQEVL 720 

KKYIIYAKER VHPKLNQMDQ DKVAKMYSDL RKESMATGSI PITVRHIESM IRMAEAHARI 780 

HLRDYVIEDD VNMAIRVMLE SFIDTQKFSV MRSMRKTFAR YLSFRRDNNE LLLFILKQLV 840 

AEQVTYQRNR FGAQQDTIEV PEKDLVDKAR QINIHNLSAF YDSELFRMNK FSHDLKRKMI 900 
LQQF 

Seq ID NO: 330 DNA sequence 
Nucleic Acid Accession #: M17254 
Coding sequence: 257-1645 

1 11 21 31 41 51 

I I I I I I 

GTCCGCGCGT GTCCGCGCCC GCGTGTGCCA GCGCGCGTGC CTTGGCCGTG CGCGCCGAGC 60 

CGGGTCGCAC TAACTCCCTC GGCGCCGACG GCGGCGCTAA CCTCTCGGTT ATTCCAGGAT 120 

CTTTGGAGAC CCGAGGAAAG CCGTGTTGAC CAAAAGCAAG ACAAATGACT CACAGAGAAA 1B0 

AAAGATGGCA GAACCAAGGG CAACTAAAGC CGTCAGGTTC TGAACAGCTG GTAGATGGGC 240 

TGGCTTACTG AAGGACATGA TTCAGACTGT CCCGGACCCA GCAGCTCATA TCAAGGAAGC 300 

CTTATCAGTT GTGAGTGAGG ACCAGTCGTT GTTTGAGTGT GCCTACGGAA CGCCACACCT 360 

GGCTAAGACA GAGATGACCG CGTCCTCCTC CAGCGACTAT GGACAGACTT CCAAGATGAG 420 

CCCACGCGTC CCTCAGCAGG ATTGGCTGTC TCAACCCCCA GCCAGGGTCA CCATCAAAAT 480 

GGAATGTAAC CCTAGCCAGG TGAATGGCTC AAGGAACTCT CCTGATGAAT GCAGTGTGGC 540 

CAAAGGCGGG AAGATGGTGG GCAGCCCAGA CACCGTTGGG ATGAACTACG GCAGCTACAT 600 

GGAGGAGAAG CACATGCCAC CCCCAAACAT GACCACGAAC GAGCGCAGAG TTATCGTGCC 660 

AGCAGATCCT ACGCTATGGA GTACAGACCA TGTGCGGCAG TGGCTGGAGT GGGCGGTGAA 720 

AGAATATGGC CTTCCAGACG TCAACATCTT GTTATTCCAG AACATCGATG GGAAGGAACT 780 

GTGCAAGATG ACCAAGGACG ACTTCCAGAG GCTCACCCCC AGCTACAACG CCGACATCCT 840 

TCTCTCACAT CTCCACTACC TCAGAGAGAC TCCTCTTCCA CATTTGACTT CAGATGATGT 900 

TGATAAAGCC TTACAAAACT CTCCACGGTT AATGCATGCT AGAAACACAG ATTTACCATA 960 

TGAGCCCCCC AGGAGATCAG CCTGGACCGG TCACGGCCAC CCCACGCCCC AGTCGAAAGC 1020 

TGCTCAACCA TCTCCTTCCA CAGTGCCCAA AACTGAAGAC CAGCGTCCTC AGTTAGATCC 1080 

TTATCAGATT CTTGGACCAA CAAGTAGCCG CCTTGCAAAT CCAGGCAGTG GCCAGATCCA 1140 

GCTTTGGCAG TTCCTCCTGG AGCTCCTGTC GGACAGCTCC AACTCCAGCT GCATCACCTG 1200 

GGAAGGCACC AACGGGGAGT TCAAGATGAC GGATCCCGAC GAGGTGGCCC GGCGCTGGGG 1260 

AGAGCGGAAG AGCAAACCCA ACATGAACTA CGATAAGCTC AGCCGCGCCC TCCGTTACTA 1320 

CTATGACAAG AACATCATGA CCAAGGTCCA TGGGAAGCGC TACGCCTACA AGTTCGACTT 1380 

CCACGGGATC GCCCAGGCCC TCCAGCCCCA CCCCCCGGAG TCATCTCTGT ACAAGTACCC 1440 

CTCAGACCTC CCGTACATGG GCTCCTATCA CGCCCACCCA CAGAAGATGA ACTTTGTGGC 1500 

GCCCCACCCT CCAGCCCTCC CCGTGACATC TTCCAGTTTT TTTGCTGCCC CAAACCCATA 1560 

CTGGAATTCA CCAACTGGGG GTATATACCC CAACACTAGG CTCCCCACCA GCCATATGCC 1620 

TTCTCATCTG GGCACTTACT ACTAAAGACC TGGCGGAGGC TTTTCCCATC AGCGTGCATT 1680 

CACCAGCCCA TCGCCAGAAA CTCTATCGGA GAACATGAAT CAAAAGTGCC TCAAGAGGAA 1740 

TGAAAAAAGC TTTACTGGGG CTGGGGAAGG AAGCCGGGGA AGAGATCCAA AGACTCTTGG 1800 

GAGGGAGTTA CTGAAGTCTT ACTACAGAAA TGAGGAGGAT GCTAAAAATG TCACGAATAT 1860 

GGACATATCA TCTGTGGACT GACCTTGTAA AAGACAGTGT ATGTAGAAGC ATGAAGTCTT 1920 

AAGGACAAAG TGCCAAAGAA AGTGGTCTTA AGAAATGTAT AAACTTTAGA GTAGAGTTTG 1980 

AATCCCACTA ATGCAAACTG GGATGAAACT AAAGCAATAG AAACAACACA GTTTTGACCT 2040 

AACATACCGT TTATAATGCC ATTTTAAGGA AAACTACCTG TATTTAAAAA TAGTTTCATA 2100 

TCAAAAACAA GAGAAAAGAC ACGAGAGAGA CTGTGGCCCA TCAACAGACG TTGATATGCA ' 2160 

ACTGCATGGC ATGTGCTGTT TTGGTTGAAA TCAAATACAT TCCGTTTGAT GGACAGCTGT 2220 

CAGCTTTCTC AAACTGTGAA GATGACCCAA AGTTTCCAAC TCCTTTACAG TATTACCGGG 2280 

ACTATGAACT AAAAGGTGGG ACTGAGGATG TGTATAGAGT GAGCGTGTGA TTGTAGACAG 2340 

AGGGGTGAAG AAGGAGGAGG AAGAGGCAGA GAAGGAGGAG ACCAGGCTGG GAAAGAAACT 2400 

TCTCAAGCAA TGAAGACTGG ACTCAGGACA TTTGGGGACT GTGTACAATG AGTTATGGAG 2460 

ACTCGAGGGT TCATGCAGTC AGTGTTATAC CAAACCCAGT GTTAGGAGAA AGGACACAGC 2520 

GTAATGGAGA AAGGGAAGTA GTAGAATTCA GAAACAAAAA TGCGCATCTC TTTCTTTGTT 2580 

TGTCAAATGA AAATTTTAAC TGGAATTGTC TGATATTTAA GAGAAACATT CAGGACCTCA 2640 

TCATTATGTG GGGGCTTTGT TCTCCACAGG GTCAGGTAAG AGATGGCCTT CTTGGCTGCC 2700 
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WO 02/086443 

ACAATCAGAA ATCACGCAGG CATTTTGGGT AGGCGGCCTC 
AACGCTGTGC GTTTGTCAGA ATGAAGTATA CAAGTCAATG 
ATAATTATAT AACTTATGCA TTTATACACT ACGAGTTGAT 
CGACAAAAGA GACAATC6AT ATAATGTGGC CTTGAATTTT 
TACAATATGA AGTTATTAGT TCTTAGAATG CAGAATGTAT 
TAGCATGGCA AATCAGATTT ATACAGGAGT . CTGCATTTGC 
TTGCTTAATG AAAACATGTG CTGAATGTTG TGGATTTTGT 
GGAACTTGTG CAAGGGAGAG CCAAGGAAAT AGGATGTTTG 

Seq ID NO: 331 Protein sequence 
Protein Accession 8: AAA52398 
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CAGTTTTCCT 
TTTTTCCCCC 
CTCGGCCAGC 
AACTCTGTAT 
GTAATAAAAT 
ACTTTTTTTA 
GTTATAATTT 
GCACCC 



TTGAGTCGCG 
TTTTTATATA 
CAAAGACACA 
GCTTAATGTT 
AAGCTTGGCC 
GTGACTAAAG 
ACTTTGTCCA 



2760 
2820 
2860 
2940 
3000 
3060 
3120 



MIQTVPDPAA 
QDWLSQPPAR 
PPPNMTTNBR 
DDFQRLTPSY 
SAWTGHGHPT 
LELLSDSSNS 
MTKVHGKRYA 
LPVTSSSFPA 



11 

I 

HIKEALSWS 
VTIKMECNPS 
RVIVPADPTL 
NADILLSHLH 
PQSKAAQPSP 
SCITWEGTNG 
YKFDFHGIAQ 
APNPYWNSPT 



21 
I 

EDQSLFECAY 
QVNGSRNSPD 
WSTDHVRQWL 
YLRETPLPHL 
STVPKTEDQR 
EFKMTDPDEV 
AIiQPHPPESS 
GGIYPNTRLP 



31 
I 

GTPHLAKTEM 
ECSVAKGGKM 
EWAVKEYGLP 
TSDDVDKALQ 
PQLDPYQILG 
ARRWGERKSK 
LYKYPSDLPY 
TSHMPSHLGT 



41 


51- 




1 

TASSSSDYGQ 


1 

TSKMSPRVPQ 


60 


VGSPDTVGMN 


YGSYMEEKHM 


120 


DVNILLFQNI 


DGKELCKMTK 


180 


NSPRLMHARN 


TDLPYEPPRR 


240 


PTSSRLANPG 


SGQIQLWQFL 


300 


PNMNYDKLSR 


ALRYYYDKNI 


360 


MGSYHAHPQK 


MNFVAPHPPA 


420 


YY 




462 



Seq ID NO: 332 DNA sequence 
Nucleic Acid Accession #: NM_000020 
Coding sequence: 283-1794 



AGGAAACGGT 
AGAAACATTT 
GAGCGAGCCC 
CCAGCGCTGG 
AGGCTAGCGC 
AGGAAAGGCC 
TCTCGGGGCC 
CGGGGGGCCT 
CGGGGCTGCG 
CACTACTGCT 
CAACCTCCTT 
CTGGCCTTGC 
CAGGAGAAGC 
TCTGAGCAGG 
GGCTCAGGGC 
TGTGTGGGAA 
GCCGTCAAGA 
AACACAGTAT 
CGCAACTCGA 
GACTTTCTGC 
GCATGCGGCC 
GCCCACCGCG 
GCCGACCTGG 
AACCCGAGAG 
ACGGACTGCT 
GAGATTGCCC 
GATGTGGTGC 
CAGACCCCCA 
ATGATGCGGG 
AAGACACTAC 
AGCACCTGAT 
CTATCTGGGT 
TGCTCGGCCC 
GTCTGGCCTG 
CAGCATGGTG 
GTGCCAAGCC 
CCCTTGATCA 
CCCTGGCACA 
CCCATCAGTT 
TCCTCAACAA 
ACTAGGGCAT 
AAAAGGGCAG 
GCCAAGCATG 
TTTGCTCCAT 
TTTTTTTTTT 
CCAGCTCACC 
GTAGCTGGGA 
CAGGGTTTCA 
ACCTCAGCCT 
TTGTTTCTTA 
CTAGTTCTCT 
ATGCTCCAGC 
CAAGGAGTGT 
CATGCCAGTG 
CTCGCCCTCT 
GCTTCCAAGG 
CCCTGGCTTC 
ATGGGCTCTA 



11 

I 

TTATTAGGAG 
TTGCTCCAGC 
CTCCCCGGCT 
CGGTGCAACT 
CCCGCCACCC 
TTCTGATGCT 
CGCTGGTGAC 
GGTGCACAGT 
GGAACTTGCA 
GCGACAGGCA 
CGGAGCAGCC 
TGGCCCTGGT 
AGCGTGGCCT 
GCGACACGAT 
TCCCCTTCCT 
AAGGCCGCTA 
TCTTCTCCTC 
TGCTCAGACA 
GCACGCAGCT 
AGAGACAGAC 
TGGCGCACCT 
ACTTCAAGAG 
GCCTGGCTGT 
TGGGCACCAA 
TTGAGTCCTA 
GCCGGACCAT 
CCAATGACCC 
CCATCCCTAA 
AGTGCTGGTA 
AAAAAATTAG 
TCCTTTCTGC 
AGAGGTAGTG 
CCAGCCCACC 
CTCAAAGCGG 
CACCCCCTAC 
AGGGAATCCC 
ACCCCACTGC 
CACTTCCCTG 
TCTCTCTGTG 
GAGTGCAGCT 
TAAATCCTAA 
GTCAGATGGG 
GCAGGGGGAA 
GTGACAAAAG 
GACACGGAGT 
GCAACGTCTA 
TTACAGGCAC 
CCATGCTGGC 
CCCAAAGTGC 
TCTACATATT 
GACACTTCAG 
CCCTGGCAAT 
CTGGAGCACC 
GCCACCCTTG 
CTGTGGCATA 
CTCAAAAGAA 
AGGCCCACAC 
GAGAGACACA 



21 
I 

GGAGTGGTGG 
CCCCATCCCA 
CCAGCCCGGT 
GCGGCCGCGC 
GCAGAGCGGG 
GCTGATGGCC 
CTGCACGTGT 
AGTGCTGGTG 
CAGGGAGCTC 
CCTCTGCAAC 
GGGAACAGAT 
GGCCCTGGGT 
GCACAGCGAG 
GTTGGGGGAC 
GGTGCAGAGG 
TGGCGAAGTG 
GAGGGATGAA 
CGACAACATC 
GTGGCTCATC 
GCTGGAGCCC 
GCACGTGGAG 
CCGCAATGTG 
GATGCACTCA 
GCGGTACATG 
CAAGTGGACT 
CGTGAATGGC 
CAGCTTTGAG 
CCGGCTGGCT 
CCCAAACCCC 
CAACAGTCCA 
CTGCAGGGGG 
TGAGTGTGGT 
CAGCCAAAAA 
CAGGCTCCCT 
CACTCCCGGG 
AGTCCCAGAC 
CCCACCAGAG 
CCAGGCCTCA 
GATTTGTATC 
TGCTGAATGT 
GAGGTCCTAC 
CAAGGCCCAG 
GGTCAGTGGG 
CAGGCCTGTC 
TTCGCTCTTG 
CCTCCCAGGT 
ATGCCACCAT 
CATGCTGGTT 
TGGGGTTACA 
GGAAGATTTG 
CCTATATCAC 
TTGCCTCAAG 
TCCTAGTCTA 
GGCTCAGACA 
GTCTTCTCTG 
ATTTGGCTCC 
CCCTGGGCCA 
CAGAAAGTTT 



31 
I 

AGCTGGGCCA 
GTCCCGGGAG 
CCGGGGCCGC 
GGTGGAGGGG 
CCCAGAGGGA 
TTGGTGACCC 
GAGAGCCCAC 
CGGGAGGAGG 
TGCAGGGGGC 
CACAACGTGT 
GGCCAGCTGG 
GTCCTGGGCC 
CTGGGAGAGT 
CTCCTGGACA 
ACAGTGGCAC 
TGGCGGGGCT 
CAGTCCTGGT 
CTAGGCTTCA 
ACGCACTACC 
CATCTGGCTC 
ATCTTCGGTA 
CTGGTCAAGA 
CAGGGCAGCG 
GCACCCGAGG 
GACATCTGGG 
ATCGTGGAGG 
GACATGAAGA 
GCAGACCCGG 
TCTGCCCGAC 
GAGAAGCCTA 
CTGGGGGGGT 
GTGTGCTGGG 
TACAGCTGGG 
GACGCCTGGC 
ACAGGATGCA 
TCAGAGCCCG 
CTGCCAGGGT 
GCCTCTAGCA 
TCAGCTCCAT 
CAGCTGCCTG 
TGAGGTGTGG 
GACTTTCAGA 
TGTCAAGAGA 
TCAGGACCTT 
TTGTCCAGGC 
TCAAATCATT 
GCCTGGCTAA 
CTCGAACTCC 
GGTGTGAGCC 
GTCCTGATGT 
AGCTAACTTC 
ATGGGGGTTT 
AGTCTGCAAG 
GCTCTGGGCC 
CCCCAGGACT 
ATCCAAGAAG 
GGSCCAGAGA 
GGGCATTTGG 



41 
I 

GGCAGGAAGA 
GCTGCCGCGC 
GCCGGACCCC 
AGGTGGCCCC 
CCATGACCTT 
AGGGAGACCC 
ATTGCAAGGG 
GGAGGCACCC 
GCCCCACCGA 
CCCTGGTGCT 
CCCTGATCCT 
TGTGGCATGT 
CCAGTCTCAT 
GTGACTGCAC 
GGCAGGTTGC 
TGTGGCACGG 
TCCGGGAGAC 
TCGCCTCAGA 
ACGAGCACGG 
TGAGGCTAGC 
CACAGGGCAA 
GCAACCTGCA 
ATTACCTGGA 
TGCTGGACGA 
CCTTTGGCCT 
ACTATAGACC 
AGGTGGTGTG 
TCCTCTCAGG 
TCACCGCGCT 
AAGTGATTCA 
GGGGGGCAGT 
GATGGGCAGC 
CTGAAACCTG 
TCTCTCCCCA 
AAAGAGGCTC 
GGCCTGCACT 
GGCACAGGGC 
TAAGCTCCAG 
GATGCCTTGG 
AGAGAGCTGG 
CAGGATCACA 
TTAACTGAGA 
CCCAGGTCTG 
TTCTTTTCTT 
TAGAGTGCAA 
CTCTTGCCTC 
TTTTGTATAT 
TGACCTCAGG 
ATCGCGCCTG 
CCTTTGAGGC 
YTCAGTCTCA 
GAAAATAACT 
CTCCAGTTCT 
TTTTGACCAC 
GCAGGGCGGC 
GCTCCAGCTC 
GTGTGTCTCA 
GAAATTTTCA 



51 

1 

CGCTGGAATA 
CAGCTGCGCC 
AGCCCGCCGT 
GGTCCGCCGA 
GGGCTCCCCC 
TGTGAAGCCG 
GCCTACCTGC 
CCAGGAACAT 
GTTCGTCAAC 
GGAGGCCACC 
GGGCCCCGTG 
CCGACGGAGG 
CCTGAAAGCA 
CACAGGGAGT 
CTTGGTGGAG 
TGAGAGTGTG 
TGAGATCTAT 
CATGACCTCC 
CTCCCTCTAC 
TGTGTCCGCG 
ACCAGCCATT 
GTGTTGCATC 
CATCGGCAAC 
GCAGATCCGC 
GGTGCTGTGG 
ACCCTTCTAT 
TGTGGATCAG 
CCTAGCTCAG 
GCGGATCAAG 
ATAGCCCAGG 
GGATGGTGCC 
TGCGCCTGCC 
ATCCCCTGCT 
CCCCTATGGC 
CAGAGTCAGA 

CCTGTCCAGC 
AGAGCCAGGG 
GCTTTCTGTC 
GGCCTGACTT 
GGCCAGTGGA 
GGATATCGAG 
ACCCCGGATG 
TTTTCCTTCT 
TGGCATGATC 
AGACTCCCGA 
TTAGTAGAAA 
TGTTCCACCT 
GCCAGGACCT 
TTCTTTAGCT 
TCTATTCCTT 
TTACCTGACT 
TGCCTAAAAC 
AAGCCAGCCC 
TTCCTCCAAG 
CCCTACTGGC 
GGAGAATTCA 
AGGRTGTATG 



60 
120 
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2340 
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2580 
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3360 
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3480 



311 
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TATGGYTCAC GTATGGWGCA GGTTGTCCTG GTCCYKGGGT GCAGGGAAGT GGGCTGCAGG 3540 

GAAGTGGATT GGAGGGGAGC TTGAGGAATA TAAGGAGCGG GGGTGGAGAC TCAGGCTATG 3600 

GACAAGGACA GCCCCAAGGT TGGGAAGACC TGGCCTTAGT CGTCCTCAGC CTAGGGCAGG 3660 

GCAGTGAAGA AAGCTCTCCC CGCTCCTGCT GTAATGACCC AGAGTAGCCT CCCCAGGCCG 3720 

GCATCTTATG TGTGTCTTCC ACCATCCTCA TGGTGGCACT TTTCTAGGCC TGTCTCCCAG 3780 

CATTGTGCAA GGCTGGGAAG AGAACCAGGA AGTGAAACTG GGTGAAAACA GAAAGCTCAA 3840 

TGGATGGGCT AGGTTCCCAG ATCATTAGGG CAGAGTTTGC ACGTCCTCTG GTTCACTGGG 3900 

AATCCACCCA GCCCACGAAT CATCTCCCTC TTTGAAGGAT TTTWATTTCT ACTGGGTTTT 3960 

GGAACAAACT CCTGCTGAGA CCCCACAGCC AGAAACTGAA AGCAGCAGCT CCCCAAAGCC 4020 

TGGAAAATCC CTAAGAGAAG GCCTGGGGGA MAGGAAKTGG AGTGACAGGG GACAGGTAGA 4080 

GAGAAGGGGG CCCAATGGCC AGGGAGTGAA GGAGGTGGCG TTGCTGAGAG CAGTCTGCAC 4140 

ATGCTTCTGT CTGAGTGCAG GAAGGTGTTC CAGGGTCGAA ATTACACTTC TCGTACCTGG 4200 

AGACGCTGTT TGTGGGAGCA CTGGGCTCAT GCCTGGCACA CAATAGGTCT GCAATAAACC 4260 
ATGGTTAAAT CCTGAAAAAA AAAAAAAAA 



Seq ID NO: 333 Protein sequence 
Protein Accession ft: NP_000011 



1 11 21 31 41 51 

I I I I I I 

MTLGSPRKGL LMLLMALVTQ GDPVKPSRGP LVTCTCBSPH CKGPTCRGAW CTWLVREEG 60 

RHPQEHRGCG NUKRELCRGR PTEFVNHYCC DSHLCNHNVS LVLEATQPPS EQPGTDGQLA 120 

LILGPVLALL ALVALGVLGL WHVRRRQEXQ RGLHSELGES SLILKASEQG DTMLGDLLDS 180 

DCTTGSGSGL PFLVQRTVAR QVALVECVGK GRYGEVWRGL WHGESVAVKI FSSRDEQSWF 240 

RETEIYNTVL LRHDNILGFI ASDMTSRNSS TQLWLITHYH EHGSLYDFLQ RQTLEPHLAL 300 

RLAVSAACGL AHLHVEIPGT QGKPAIAHRD FKSRNVLVKS NLQCCIADLG LAVMHSQGSD 360 

YLDIGNNPRV GTKRYMAPEV LDEQIRTDCF ESYKWTDIWA FGLVLWEIAR RTIVNGIVED 420 

YRPPFYDWP NDPSFEDMKK WCVDQQTPT IPNRLAADPV LSGLAQMMRE CWYPNPSARIj 480 

TALRIKKTXjQ kisnspekpk viq 



Seq ID NO: 334 DNA sequence 

Nucleic Acid Accession ft: NM_G04126.1 

Coding sequence: 108-329 

1 11 21-31 41 51 

I I I I I I 

GGCACGAGCT CGTGCCGGCC TTCAGTTGTT TCGGGACGCG CCGAGCTTCG CCGCTCTTCC 60 
AGCGGCTCCG CTGCCAGAGC TAGCCCGAGC CCGGTTCTGG GGCGAAAATG CCTGCCCTTC 120 
ACATCGAAGA TTTGCCAGAG AAGGAAAAAC TGAAAATGGA AGTTGAGCAG CTTCGCAAAG 180 
AAGTGAAGTT GCAGAGACAA CAAGTGTCTA AATGTTCTGA AGAAATAAAG AACTATATTG 240 
AAGAACGTTC TGGAGAGGAT CCTCTAGTAA AGGGAATTCC AGAAGACAAG AACCCCTTTA 300 
AAGAAAAAGG CAGCTGTGTT ATTTCATAAA TAACTTGGGA GAAACTGCAT CCTAAGTGGA 360 
AGAACTAGTT TGTTTTAGTT TTCCCAGATA AAACCAACAT GCTTTTTAAG GAAGGAAGAA 420 
TGAAATTAAA AGGAGACTTT CTTAAGCACC ATATAGATAG GGTTATGTAT AAAAGCATAT 480 
GTGCTACTCA TCTTTGCTCA CTATGCAGTC TTTTTTAAGA GAGCAGAGAG TATCAGATGT 540 
ACAATTATGG AAATAAGAAC ATTACTTGAG CATGACACTT CTTTCAGTAT ATTGCTTGAT 600 
GCTTCAAATA AAGTTTTGTC TT 



Seq ID NO: 335 Protein sequence 
Protein Accession ft: NP_004117.1 



1 11 21 31 41 51 

I I I I I I 

MPALHIEDLP EKEKLKMEVE QLRKEVKLQR QQVSKCSEEI KNYIEERSGE DPLVKGIPED 60 
KNPFKEKGSC VIS 



Seq ID NO: 336 DNA sequence 
Nucleic Acid Accession ft: NM_005795 
Coding sequence: 555-1940 



1 11 21 31 41 51 

I I I I I I 

GCACGAGGGA ACAACCTCTC TCTCTSCAGC AGAGAGTGTC ACCTCCTGCT TTAGGACCAT 60 

CAAGCTCTGC TAACTGAATC TCATCCTAAT TGCAGGATCA CATTGCAAAG CTTTCACTCT 120 

TTCCCACCTT GCTTGTGGGT AAATCTCTTC TGCGGAATCT CAGAAAGTAA AGTTCCATCC 180 

TGAGAATATT TCACAAAGAA TTTCCTTAAG AGCTGGACTG GGTCTTGACC CCTGGAATTT 240 

AAGAAATTCT TAAAGACAAT GTCAAATATG ATCCAAGAGA AAATGTGATT TGAGTCTGGA 300 

GACAATTGTG CATATCGTCT AATAATAAAA ACCCATACTA GCCTATAGAA AACAATATTT 360 

GAATAATAAA AACCCATACT AGCCTATAGA AAACAATATT TGAAAGATTG CTACCACTAA 420 

AAAGAAAACT ACTACAACTT GACAAGACTG CTGCAAACTT CAATTGGTCA CCACAACTTG 480 

ACAAGGTTGC TATAAAACAA GATTGCTACA ACTTCTAGTT TATGTTATAC AGCATATTTC 540 

ATTTGGGCTT AATGATGGAG AAAAAGTGTA CCCTGTATTT TCTGGTTCTC TTGCCTTTTT 600 

TTATGATTCT TGTTACAGCA GAATTAGAAG AGAGTCCTGA GGACTCAATT CAGTTGGGAG 660 

TTACTAGAAA TAAAATCATG ACAGCTCAAT ATGAATGTTA CCAAAAGATT ATGCAAGACC 720 

CCATTCAACA AGCAGAAGGC GTTTACTGCA ACAGAACCTG GGATGGATGG CTCTGCTGGA 780 

ACGATGTTGC AGCAGGAACT GAATCAATGC AGCTCTGCCC TGATTACTTT CAGGACTTTG 840 

ATCCATCAGA AAAAGTTACA AAGATCTGTG ACCAAGATGG AAACTGGTTT AGACATCCAG 900 

CAAGCAACAG AACATGGACA AATTATACCC AGTGTAATGT TAACACCCAC GAGAAAGTGA 960 

AGACTGCACT AAATTTGTTT TACCTGACCA TAATTGGACA CGGATTGTCT ATTGCATCAC 1020 

TGCTTATCTC GCTTGGCATA TTCTTTTATT TCAAGAGCCT AAGTTGCCAA AGGATTACCT 1080 

TACACAAAAA TCTGTTCTTC TCATTTGTTT GTAACTCTGT TGTAACAATC ATTCACCTCA 1140 

CTGCAGTGGC CAACAACCAG GCCTTAGTAG CCACAAATCC TGTTAGTTGC AAAGTGTCCC 1200 

AGTTCATTCA TCTTTACCTG ATGGGCTGTA ATTACTTTTG GATGCTCTGT GAAGGCATTT 1260 

ACCTACACAC ACTCATTGTG GTGGCCGTGT TTGCAGAGAA GCAACATTTA ATGTGGTATT 1320 

ATTTTCTTGG CTGGGGATTT CCACTGATTC CTGCTTGTAT ACATGCCATT GCTAGAAGCT 1380 
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WO 02/086443 

TATATTACAA TGACAATTGC TGGATCAGTT CTGATACCCA 
GCCCAATTTG TGCTGCTTTA CTG6TGAATC TTTTTTTCTT 
TCATCACCAA GTTAAAAGTT ACACACCAAG CGGAATCCAA 
GAGCTACTCT TATCTTGGTG CCATTGCTTG GCATTGAATT 
CTGAAGGAAA GATTGCAGAG GAGGTATATG ACTACATCAT 
AGGGTCTTTT GGTCTCTACC ATTTTCTGCT TCTTTAATGG 
GAAGAAACTG GAATCAATAC AAAATCCAAT TTGGAAACAG 
TTCGTAGTGC GTCTTACACA GTGTCAACAA TCAGTGATGG 
GTCCTAGTGA ACACTTAAAT GGAAAAAGCA TCCATGATAT 
CAGAAAATTT ATATAATTGA AAATAGAAGG ATGGTTGTCT 
AACTCAAGGA CTTGGACCCA TGACTCTGTA GCCAGAAGAC 
GGGAATGTCA TAAAGAAGAG CCTTCACATG AAATTAGTAG 
ATCCAGCTCT ATGTGGGAAA AAAGAAATCC TGGTTTGTAA 
CACTATGCCT GATGTGACGC TACTAACCTG ACATCACCAA 
ACAATCAACT TTTCTGAGCT GGTGTAAGCC AGTTCCAGCA 
AAATGGCTGT AAAACTAAAC ATACATGTTG GGCATGATTC 
GACCTAGCTA AGGTCTATAA ACATGAAGGG AAAATTAGCT 
TCCCATCTTG ATTGGGGCAG TTGACTTTTT TTTTTTCCCA 
TAACTACCCT CTCAAATGGA CAATACCAGA AGTGAATTAT 
CTATGAAAAG CAACTGAGTA CAATTGTTAT GATCTACTCA 
ATCTTGTGGC ATATCCATTG TGGAAACTGG ATGAACAGGA 
TTCTATATCA TTAGGAAAAC ATCTTAGTTG ATGCTACAAA 
TGTCTTACCA AACAGTGGGA GGGAATTCCT AGCTGTAAAT 
TCTACTGTAT AAACAAATTA GCAATCATTT TATATAAAGA 
ATTTTCTTGG AATTTTGTAA AAAGAAATTG TGAAAAATGA 
TTTATTTTAT AGTCTCAAAT CAAATACATA CAACCTATGT 
AATGCAACAA TGTGTGTATG TTAATATCTG ATACTGTATC 
AATAGAGTCT GGAATGCT 

Seq ID NO: 337 protein sequence 
Protein Accession 8: NP 005786.1 



TCTCCTCTAC 
GTTAAATATT 
TCTGTACATG 
TGTGCTGATT 
GCACATCCTT 
AGAGGTTCAA 
CTTTTCCAAC 
TCCAGGTTAT 
TGAAAATGTT 
CACTGTTTGG 
TTCAATATTA 
TGTGTTGATA 
TGTTTGTCAG 
GTGTGGAATT 
CACCATTGAT 
TACCCTTATT 
TTTAGTTTTA 
GAGTGCCGTA 
CCCTGCTGGC 
TTTGCTGACA 
TGTATAATAT 
ACACCTTGTC 
ATAAATTTTG 
AAATCAATGA 
GCTTGTAAAT 
AATTTTTAAA 
TGGGCTGATT 



ATTATCCATG 
GTACGCGTTC 
AAAGCTGTGA 
CCATGGCGAC 
ATGCACTTCC 
GCAATTCTGA 
TCAGAAGCTC 
AGTCATGACT 
CTCTTAAAAC 
TGCTTCTCCT 
AATGACTTTG 
AGAGTGTAAC 
TAAATACTCC 
GGAGAAAAGC 
GAATTCAAAC 
CSCCCCAAGA 
AAACTCTTTA 
GTCCTTTTTG 
TTTCTTTTCT 
CATCAGTTAT 
GCAATCTTAC 
AACCTCTTCC 
CCCTTCCATT 
AGGATTTCTT 
ACTCCATTAT 
GCAAATATAT 
TTTTAAATAA 



PCT/US02/12476 



MEKKCTLYPL 
EGVYCNRTWD 
WTNYTQCNVN 
FFSFVCNSW 
IWAVFAEKQ 
ALLVNLFFLL 
AEEVYDYIMH 
YTVSTISDGP 



11 
I 

VLLPFFMILV 
GWLCWNDVAA 
THEKVKTALN 
TIIHLTAVAN 
HLMWYYFLGW 
NIVRVLITKL 
ILMHFQGLLV 
GYSHDCPSEH 



21 
I 

TAELEESPED 
GTESMQLCPD 
LFYLTIIGHG 
NQALVATNPV 
GFPLIPACIH 
KVTHQAESNL 
STIFCFFNGE 
LNGKSIHDIE 



31 
I 

SIQLGVTRNK 
YFQDFDPSEK 
LSIASLLISL 
SCKVSQFIHL 
AIARSLYYND 
YMKAVRATLI 
VQAILRRNWN 
NVLLKPENLY 



41 
I 

IMTAQYECYQ 
VTKICDQDGN 
GIFFYFKSLS 
YLMGCNYFWM 
NCWISSDTHL 
LVPLLGIEFV 
QYKIQFGNSF 



51 
I 

KIMQDPIQQA 
WFRHPASNRT 
CQRITLHKNL 
LCEGIYLHTL 
LYIIHGPICA 
LIPWRPEGKI 
SNSEALRSAS 



Seq ID NO: 338 DNA sequence 
Nucleic Acid Accession #: NM_001795 
Coding sequence: 25-2379 



1 
I 

GCACGATCTG 
GCCTGCCTGG 
CGGGACACCC 
CAGATGCACA 
TCAAGCGTGA 
TTCCGGGTCG 
ATCTCAGAGT 
ACTCCTTCCA 
CATCGGTTGT 
GTGACAGCAG 
ATCCTGAAGG 
AAAAGCTTGG 
CAGGGCCTCC 
GACAACTTCC 
GTGGGCACCT 
ACCAAGTACA 
GCCCACAACG 
TACAGCTTCA 
GCGGGAAACA 
CAGCAGCCTT 
GTGCTGGCCA 
AGTGACAAGG 
CTGGACAGAG 
ACTGGAACCC 
AATGACAATG 
CATGGCCAGC 
AAGTTCAAAT 
ACGGCCAACA 
CTACCCGTGG 
GTGGCCGTGT 
CAGGTGGGCG 
GTGATCACCC 
AAGAGCGTGC 
ATGGACACCA 
CCCCCGCGGC 
AGGCACGCGC 
AAGGACGAGG 
TACGAGGGCT 



11 
I 

TTCCTCCTGG 
GCCTGCTGGC 
ACAGCCTGCT 
TTGATGAAGA 
GTCGCAAGAA 
ATGCAGAGAC 
ACCACCTCAC 
GCTTCACCAT 
TCAATGCGTC 
TGGATGCAGA 
GGAAAGAGTA 
ACCGAGAGAA 
GGGGGGACTC 
CCTTCTTCAC 
CTGTGGGCTC 
GCATCTTGCG 
AGGGCATCAT 
TCGTCGAGGC 
GAGCCCAGGT 
TCTACCACTT 
TGGACCCTGA 
GCCAGTTCTT 
AAGTCTACCC 
CCACAGGAAA 
CCCCGGAGTT 
TGGTCCTGCA 
TCACCTTGAA 
TCACAGTCAA 
TCATCTCAGA 
GCAAGTGCAA 
TGAGCATCCA 
TGCTCATCTT 
CGGAGATCCA 
CCAGCTACGA 
CCGCGCTGGA 
CTGGGGCACA 
CGGACCACGA 
CCGAGTCCAT 



21 
I 

GAAGATGCAG 
AGTGGCAGCA 
GCCCACCCAC 
GAAAAACACC 
TGCCAAGTAC 
AGGAGACGTG 
TGCTGTCATT 
CAAAGTTCAT 
CGTGCCTGAG 
CGACCCCACT 
TTTTGCCATC 
GCAGGCCAGG 
GGGCACGGCC 
CCAGACCAAG 
TCTGTTTGTT 
GGGCGACTAC 
CAAGCCCATG 
CACAGACCCC 
CATTATCAAC 
CCAGCTGAAG 
TGCGGCTAGG 
CCGAGTCACA 
CTGGTATAAC 
AGAATCCATT 
TGCCAAGCCC 
GATCTCCGCA 
TACTGAGAAC 
GTATGGGCAG 
CAATGGGATG 
CGAGCAGGGC 
GGCAGTGGTA 
CCTGCGGCGG 
CGAGCAGCTG 
TGTGTCGGTG 
CGCCCGGCCT 
CGGAGGGCCC 
CGGCGACGGC 
AGCCGAGTCC 



31 
I 

AGGCTCATGA 
GTGGCAGCAG 
CGGCGCCAAA 
TCACTTCCCC 
CTGCTCAAAG 
TTCGCCATTG 
GTGGACAAGG 
GACGTGAACG 
TCGTCGGCTG 
GTGGGAGACC 
GATAATTCTG 
TATGAGATCG 
ACCGTGCTGG 
TACACATTTG 
GAGGACCCAG 
CAGGACGCTT 
AAGCCTCTGG 
ACCATCGACC 
ATCACAGATG 
GAAAACCAGA 
CATAGCATTG 
AAAAAGGGGG 
CTGACTGTGG 
GTGCAAGTCC 
TACCAGCCCA 
ATAGACAAGG 
AACTTTACCC 
TTTGACCGGG 
CCAAGTCGCA 
GAGTTCACCT 
GCCATCTTAC 
CGGCTCCGGA 
GTCACCTACG 
CTCAACTCGG 
TCCCTCTATG 
GGGGAGATGG 
CCCCCCTACG 
CTCAGCTCCC 



41 

I 

TGCTCCTCGC 
CAGGTGCTAA 
AGAGAGATTG 
ATCATGTAGG 
GAGAATATGT 
AGAGGCTGGA 
ACACTGGTGA 
ACAACTGGCC 
TGGGGACCTC 
ACGCCTCTGT 
GACGTATTAT 
TGGTGGAAGC 
TCACTCTGCA 
TCGTGCCTGA 
ATGAGCCCCA 
TCACCATTGA 
ATTATGAATA 
TCCGATACAT 
TGGACGAGCC 
AGAAGCCTCT 
GATACTCCAT 
ACATTTACAA 
AGGCCAAAGA 
ACATTGAAGT 
AAGTGTGTGA 
ACATAACACC 
TCACGGATAA 
AGCATACCAA 
CGGGCACCAG 
TCTGCGAGGA 
TCTGCATCCT 
AGCAGGCCCG 
ACGAGGAGGG 
TGCGCCGCGG 
CGCAGGTGCA 
CAGCCATGAT 
ACACGCTGCA 
TGGGCACCGA 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



51 
I 

CACATCGGGC 
CCCTGCCCAA 
GATTTGGAAC 
CAAGATCAAG 
GGGCAAGGTC 
CCGGGAGAAT 
AAACCTGGAG 
TGTGTTCACG 
AGTCATCTCT 
CATGTACCAA 
CACAATAACG 
GCGAGATGCC 
AGACATCAAT 
AGACACCCGT 
GAACCGGATG 
GACAAACCCC 
CATCCAGCAA 
GAGCCCTCCC 
CCCCATTTTC 
GATTGGCACA 
CCGCAGGACC 
TGAGAAAGAA 
ACTGGATTCC 
TTTGGATGAG 
GAACGCTGTC 
ACGAAACGTG 
TCACGATAAC 
GGTCCACTTC 
CACGCTGACC 
TATGGCCGCC 
CACCATCACA 
CGCGCACGGC 
CGGCGGCGAG 
CGGGG CCAAG 
GAAGCCACCG 
CGAGGTGAAG 
CATCTACGGC 
CTCATCCGAC 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
il40 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160- 
2220 
2280 
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50 



WO 02/086443 

TCTGACGTGG ATTACGACTT CCTTAACGAC TGGGGACCCA 
CTGTACGGCT OGGACCCCCG GGAGGAGCTG CTGTATTAGG 
CTGGGGACCC AAACCCCCTG CAGCCCAGGC CAGTCAGACT 
AAATGGCAGT GACTCCCCAG CCCAGCACCC CTTCCTCGTG 
CCTTGGGATA GCAAACTCCA GGTTCCTGAA ATATCCAGGA 
TTCTCAAATG CTGGCAAATC CAGGCTGGTG TTCTGTCTGG 
CTGTCACCCA CAGACCGCCG TCTAACTCAA AGACTTCCTC 
GCAAAACAGA CTGTGTTTAA CTGCTGCAGG GTCTTTTTCT 
TAAGGCTGGT GAGGTCCTGG TGCCTATCTG CCTGGAGGCA 
TGTGGGGCAG GATTCTCTGC AGCCCATTCC CAAGGGAGAC 
GGGAGCCCTA GCCCTGCTCC AACTCCATAC TCCACTCCAA 
CCCTCTCCAG GCCTGTCAAG AGGGAGGAAG GGGCCCCATG 
CTGAAGTGAC CTCACTGGCC TGCCATGCCA GTAACTGTGC 
ATTCAGGGAA ATGCTTATTA AACCTTGAAG CAACTGTGAA 
GAGATCAGGA GTGACAGATC ACAGGGTGAG GGCCACCTCC 
AGGCCTGGAA GAGCTGAGAC CTTGCTTTGA GACTCCTCAG 
AGAAGGGGCA GATGTTCCCG GAGATCAGAA GACGTCTCCC 
GCCAATCCAT. GCTCTCTTTC TTTTCTCTGT CTACTCCTTA 
CAAGATGTGG CCTTTAGCAA AACTGACAAT GTCCAAACCC 
GCCGAGCATG TGTCTTTACA CCTCGCTGTT GTCACATCTC 
CACCTTGCAG AAGGAAGGCC CTGCCCTGCC CAACCTCTGT 
ACTGGAACGT TTCACTGCAA ACACACCTTG GAGAAGTGGC 
AGGGAAGGAG ACACCAAGCT CACCCTTCGT CATGGACCGA 
CCCTCACACT GCAAGGGATT GTAGATAACA CTGACTTGTT 
TTCTTATAAT GATTTTTTTA CTAATGATAC TTACAAGTTT 
GAATAAGGGT TTTTGCATAA TAAGCAGGTT GTTATTTAGG 
TTTTTAGTTG GAAAAACAAT TCCTGTAACC TTCTATTTTC 
TACAGATAAT GTCTATATAT TGGCCAAACT GGTGCATGAC 
CCTAAATAAA GAAAAATCTT TAGCCTGGGC AACAAAAAAA 

Seq ID NO: 339 Protein sequence 
Protein Accession #: NP 001786 



PCT/US02/12476 



MQRLMMLLAT 
NTSLPHHVGK 
VIVDKDTGEN 
PTVGDHASVM 
TATVLVTLQD 
DYQDAFTIET 
INITDVDEPP 
VTKKGDIYNE 
KPYQPKVCEN 
GQFDREHTKV 
WAILLCILT 
SVLNSVRRGG 
DGPPYDTLHI 
ELLY 



11 
I 

SGACLGLLAV 
IKSSVSRKNA 
LETPSSFTIK 
YQILKGKEYF 
INDNFPFFTQ 
NPAHNEGIIK 
IFQQPFYHFQ 
KELDREVYPW 
AVHGQLVLQI 
HFLPWISDN 
ITVITLLIFL 
AKPPRPALDA 
YGYEGSESIA 



21 
I 

AAVAAAGANP 
KYLLKGEYVG 
VHDVNDNWPV 
AIDNSGRIIT 
TKYTFWPED 
PMKPLDYEYI 
LKENQKKPLI 
YNLTVEAKEL 
SAIDKDITPR 
GMPSRTGTST 
RRRLRKQARA 
RPSLYAQVQK 
ESLSSLGTDS 



31 
I 

AQRDTHSLLP 
KVFRVDAETG 
FTHRLFNASV 
ITKSLDREKQ 
TRVGTSVGSL 
QQYSFIVEAT 
GTVLAMDPDA 
DSTGTPTGKE 
NVKFKFTLNT 
LTVAVCKCNE 
HGKSVPEIHE 
PPRHAPGAHG 
SDSDVDYDFL 



GGTTTAAGAT 
CGGCOGAGGT 
CCAGGCACCA 
GGTCCCAGAG 
ATATATGTCA 
GCTCAGACAT 
TGGCTCCCCA 
AGGGTCCCTG 
AAGGCCTGGA 
TGACCATCAT 
GTGCCCCACC 
GCAGCTCCTG 
TGTACTGAGC 
TTCATTCTGG 
ACACCCACCC 
CACCCCTCCA 
CTTCTCTGCC 
TCCCTTGGTT 
ACTCATGACT 
AGGGAACTGA 
GGTCACCCAT 
ATCAGTCAAC 
GGTTCCCACT 
TGTTTTAACC 
CTAGCTCTCA 
TTAACAATAT 
TATAATTGTA 
AAGTACTGTA 



41 
I 

THRRQKRDWI 
DVFAIERLDR 
PESSAVGTSV 
ARYEIWEAR 
FVEDPDEPQN 
DPTIDLRYMS 
ARHSIGYSIR 
SIVQVHIEVL 
ENNFTLTDNH 
QGEFTFCEDM 
QLVTYDEEGG 
GPGEMAAMIE 
NDWGPRFKML 



GCTGGCTGAG 
CACTCTGGGC 
CAGCCTCCAA 
ACCTCATCAG 
GTGATGACTA 
CCACATAACC 
AGGCTGCAAA 
AACGCCCTGG 
CAGCTTGACT 
GCCCTCTCTC 
ACTCCCCAAC 
ACCTTGGGTC 
ACTGAACCAC 
AGGGGCAGTG 
CCTCTGGAGA 
GTTTTGCCTG 
TCACCTGGTC 
TAGAGGAACC 
GCATGACGGA 
CCCTCAGGCA 
GCATCATTCC 
AGAGAGGGGC 
CTGGCAAAGC 
AATAACTAGC 
CAGACATATA 
TAATTCAGGT 
GTAATTGCTC 
TTTTTTTATA 



51 
I 

WNQMHIDEEK 
ENISEYHLTA 
I SVTAVDADD 
DAOGLRGDSG 
RMTKYSILRG 
PPAGNRAQVI 
RTSDKGQPFR 
DENDNAPEFA 
DNTANITVKY 
AAQVGVSIQA 
GEMDTTSYDV 
VKKDEADHDG 
AELYGSDPRE 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3460 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 340 DNA sequence 
Nucleic Acid Accession #: NM_003088 
Coding sequence: 112-1593 



GCGGAGGGTG 
CCCGCCACCC 
AACGGCACAG 
CTGACGGCCG 
CAGATCTGGA 
AGCCACCTGG 
GTGCCCGGTC 
CAGTCCGAGG 
CAGACGGTGT 
ATCTACAGTG 
GCCGTGGACC 
CAGCGCTACA 
GCGCGCCCCG 
CGCGACTGCG 
AAGGCCACCA 
GTGCTGCAGG 
AATCAGGACG 
AAGTGTGCCT 
CAGTCCACCG 
CGCATCACAC 
GCCGCCTCGG 
CCCATCATCG 
CTGGACGCCA 
AACATCAAAG 
AGCGGCGACA 
AAGGTGGGCG 
ACCGTGGACC 
CCACATGGCG 
GGCGGGAGGC 
CCTGTCGCCC 



11 
I 

CGTGCGGGCC 
ACCTCCCGGG 
CCGAGGCGGT 
AGGCGTTCGG 
CGCTGGAGCA 
GCCGCTACCT 
CCGACTGCCG 
CGCACCGGCG 
CCCCCGCCGA 
TCACCCGTAA 
GCGACGTGCC 
GCGTGCAGAC 
AGCCGGCCAC 
AGGGCCGTTA 
AGGTGGGCAA 
CGGCCAACGA 
AGGAGACCGA 
TCCGTACCCA 
CCTCCAGCAA 
TGAGGGCGTC 
TGGAGACAGC 
TGTTCCGCGG 
ACCGCTCCAG 
ACTCCACAGG 
CTCCTGTGGA 
GGCGCTACCT 
CCGCCTCGCT 
GCTCCTGCCA 
AAGCCCCCTT 
CTATGGACTC 



21 
I 

GCGGCAGCCG 
GCCGCGCAGC 
GCAGATCCAG 
GTTCAAGGTG 
GCCCCCTGAC 
GGCGGCGGAC 
TTTCCTCATC 
CTACTTCGGC 
GAAGTGGAGC 
GCGCTACGCG 
CTGGGGCGTC 
CGCCGACCAC 
TGGCTACACG 
CCTGGCGCCG 
GGACGAGCTC 
GAGGAACGTG 
CCAGGAGACC 
CACGGGCAAG 
GAATGCCAGC 
CAATGGCAAG 
AGGGGACTCA 
GGAGCATGGC 
CTATGACGTC 
CAAATACTGG 
CTTCTTCTTC 
GAAGGGCGAC 
CTGGGAGTAC 
ACCCTCCCTG 
GCCTTTCAAA 
CCCACTCTCC 



31 
I 

AACAAAGGAG 
GGCCTCTCGT 
TTCGGCCTCA 
AACGCGTCCG 
GAGGCGGGCA 
AAGGACGGCA 
GTGGCGCACG 
GGCACCGAGG 
GTGCACATCG 
CACCTGAGCG 
GACTCGCTCA 
CGCTTCCTGC 
CTGGAGTTCC 
TCGGGGCCCA 
TTTGCTCTGG 
TCCACGCGCC 
TTCCAGCTGG 
TACTGGACGC 
TGCTACTTTG 
TTTGTGACCT 
GAGCTCTTCC 
TTCATCGGCT 
TTCCAGCTGG 
ACGGTGGGCA 
GAGTTCTGCG 
CACGCAGGCG 
TAGGGCCGGC 
CTAACCCCTT 
CTGGAAACCC 
CCTCCGCCCG 



41 
I 

CAGGGGCGCC 
CTACTGCCAC 
TCAACTGCGG 
CCAGCAGCCT 
GCGCGGCCGT 
ACGTGACCTG 
ACGACGGTCG 
ACCGCCTGTC 
CCATGCACCC 
CGCGGCCGGC 
TCACCCTCGC 
GCCACGACGG 
GCTCCGGCAA 
GCGGCACGCT 
AGCAGAGCTG 
AGGGTATGGA 
AGATCGACCG 
TGACGGCCAC 
ACATCGAGTG 
CCAAGAAGAA 
TCATGAAGCT 
GCCGCAAGGT 
AGTTCAACGA 
GTGACTCCGC 
ACTATAACAA 
TCCTGAAGGC 
CCGTCCTTCC 
CTCCGCCAGG 
CAGAGAAAAC 
GGTTCCCTAC 



51 
I 

GCCGCAGGGA 
CATGACCGCC 
CAACAAGTAC 
GAAGAAGAAG 
GTGCCTGCGC 
CGAGCGCGAG 
CTGGTCGCTG 
CTGCTTCGCG 
TCAGGTCAAC 
CGACGAGATC 
CTTCCAGGAC 
GCGCCTGGTG 
GGTGGCCTTC 
CAAGGCGGGC 
CGCCCAGGTC 
CCTGTCTGCC 
CGACACCAAA 
CGGGGGCGTG 
GCGTGACCGG 
TGGGCAGCTG 
CATCAACCGC 
CACGGGCACC 
TGGCGCCTAC 
GGTCACCAGC 
GGTGGCCATC 
CTCGGCGGAA 
CCGCCCCTGC 
TGGGCTCCAG 
GGTGCCCCCA 
TCCCCTCGGG 



60 
120 
180 
240 
300 
360 
420 
480 
* 540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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TCAGCGGCTG CGGCCTGGCC CTGGGAGGGA TTTCAGATGC CCCTGCCCTC TTGTCTGCCA 1860 

CGGGGCGAGT CTGGCACCTC TTTCTTCTGA CCTCAGACGG CTCTGAGCCT TATTTCTCTG 1920 

GAAGCGGCTA AGGGACGGTT GGGGGCTGGG AGCCCTGGGC GTGTAGTGTA ACTGGAATCT 1980 

TTTGCCTCTC CCAGCCACCT CCTCCCAGCC CCCCAGGAGA GCTGGGCACA TGTCCCAAGC 2040 

CTGTCAGTGG CCCTCCCTGG TGCACTGTCC CCGAAACCCC TGCTTGGGAA GGGAAGCTGT 2100 

CGGGAGGGCT AGGACTGACC CTTGTGGTGT TTTTTTGGGT GGTGGCTGGA AACAGCCCCT 2160 

CTCCCACGTG GGAGAGGCTC AGCCTGGCTC CCTTCCCTGG AGCGGCAGGG CGTGACGGCC 2220 

ACAGGGTCTG CCCGCTGCAC GTTCTGCCAA GGTGGTGGTG GCGGGCGGGT AGGGGTGTGG 2280 

GGGCCGTCTT CCTCCTGTCT CTTTCCTTTC ACCCTAGCCT GACTGGAAGC AGAAAATGAC 2340 

CAAATCAGTA TTTTTTTTAA TGAAATATTA TTGCTGGAGG CGTCCCAGGC AAGCCTGGCT 2400 

GTAGTAGCGA GTGATCTGGC GGGGGGCGTC TCAGCACCCT CCCCAGGGGG TGCATCTCAG 2460 

CCCCCTCTTT CCGTCCTTCC CGTCCAGCCC CAGCCCTGGG CCTGGGCTGC CGACACCTGG 2520 

GCCAGAGCCC CTGCTGTGAT TGGTGCTCCC TGGGCCTCCC GGGTGGATGA AGCCAGGCGT 2580 

CGCCCCCTCC GGGAGCCCTG GGGTGAGCCG CCGGGGCCCC CCTGCTGCCA GCCTCCCCCG 2640 

TCCCCAACAT GCATCTCACT CTGGGTGTCT TGGTCTTTTA TTTTTTGTAA GTGTCATTTG 2700 

TATAACTCTA AACGCCCATG ATAGTAGCTT CAAACTGGAA ATAGCGAAAT AAAATAACTC 2760 
AGTCTGC 

Seq ID NO: 341 Protein sequence 
Protein Accession #: NP_003079 

1 11 21 31 41 51 

I I I I I I 

MTANGTAEAV QIQFGLINCG NKYLTAEAFG FKVNASASSL KKKQIWTLEQ PPDEAGSAAV 60 

CLRSHLGRYL AADKDGNVTC EREVPGPDCR FLIVAHDDGR WSLQSEAHRR YFGGTEDRLS 120 

CFAQTVSPAE KWSVHIAMHP QVNIYSVTRK RYAHLSARPA DBIAVDRDVP WGVDSLITLA 180 

FQDQRYSVQT ADHRFLRHDG RLVARPEPAT GYTLEFRSGK VAFRDCEGRY LAPSGPSGTL 240 

KAGKATKVGK DELFALEQSC AQWLQAANE RNVSTRQGMD LSANQDEETD QETFQLEIDR 300 

DTKKCAFRTH TGKYWTLTAT GGVQSTASSK NASCYFDIEW RDRRITLRAS NGKFVTSKKN 360 

GQLAASVETA GDSELFLMKL INRPIIVFRG EHGFIGCRKV TGTLDANRSS YDVFQLEFND 420 

GAYNIKDSTG KYWTVGSDSA VTSSGDTPVD FFFEFCDYNK VAIKVGGRYL KGDHAGVLKA 480 
SAETVDPASL WEY 



Seq ID NO: 342 DNA sequence 

Nucleic Acid Accession 8: FGENESH predicted 

Coding sequence : 660 . .1705 

1 11 21 31 41 51 

I I I I I I 

CGCTCCGCAC ACATTTCCTG TCGCGGCCTA AGGGAAACTG TTGGCCGCTG GGCCCGCGGG 60 

GGGATTCTTG GCAGTTGGGG GGTCCGTCGG GAGCGAGGGC GGAGGGGAAG GGAGGGGGAA 120 

CCGGGTTGGG GAAGCCAGCT GTAGAGGGCG GTGACCGCGC TCCAGACACA GCTCTGCGTC 180 

CTCGAGCGGG ACAGATCCAA GTTGGGAGCA GCTCTGCGTG CGGGGCCTCA GAGAATGAGG 240 

CCGGCGTTCG CCCTGTGCCT CCTCTGGCAG GCGCTCTGGC CCGGGCCGGG CGGCGGCGAA 300 

CACCCCACTG CCGACCGTGC TGGCTGCTCG GCCTCGGGGG CCTGCTACAG CCTGCACCAC 360 

GCTACCATGA AGCGGCAGGC GGCCGAGGAG GCCTGCATCC TGCGAGGTGG GGCGCTCAGC 420 

ACCGTGCGTG CGGGCGCCGA GCTGCGCGCT GTGCTCGCGC TCCTGCGGGC AGGCCCAGGG 480 

CCCGGAGGGG GCTCCAAAGA CCTGCTGTTC TGGGTCGCAC TGGAGCGCAG GCGTTCCCAC 540 

TGCACCCTGG AGAACGAGCC TTTGCGGGGT TTCTCCTGGC TGTCCTCCGA CCCCGGCGGT 600 

CTCGAAAGCG ACACGCTGCA GTGGGTGGAG GAGCCCCAAC GCTCCTGCAC CGCGCGGAGA 660 

TGCGCGGTAC TCCAGGCCAC CGGTGGGGTC GAGCCCGCAG CTGGAAGGAG ATGCGATGCC 720 

ACCTGCGCGC CAACGGCTAC CTGTGCAAGT ACCAGTTTGA GGTCTTGTGT CCTGCGCCGC 780 

GCCCCGGGGC CGCCTCTAAC TTGAGCTATC GCGCGCCCTT CCAGCTGCAC AGCGCCGCTC 840 

TGGACTTCAG TCCACCTGGG ACCGAGGTGA GTGCGCTCTG CCGGGGACAG CTCCCGATCT 900 

CAGTTACTTG CATCGCGGAC GAAATCGGCG CTCGCTGGGA CAAACTCTCG GGCGATGTGT 960 

TGTGTCCCTG CCCCGGGAGG TACCTCCGTG CTGGCAAATG CGCAGAGCTC CCTAACTGCC 1020 

TAGACGACTT GGGAGGCTTT GCCTGCGAAT GTGCTACGGG CTTCGAGCTG GGGAAGGACG 1080 

GCCGCTCTTG TGTGACCAGT GGGGAAGGAC AGCCGACCCT TGGGGGGACC GGGGTGCCCA 1140 

CCAGGCGCCC GCCGGCCACT GCAACCAGCC CCGTGCCGCA GAGAACATGG CCAATCAGGG 1200 

TCGACGAGAA GCTGGGAGAG ACACCACTTG TCCCTGAACA AGACAATTCA GTAACATCTA 1260 

TTCCTGAGAT TCCTCGATGG GGATCACAGA GCACGATGTC TACCCTTCAA ATGTCCCTTC 1320 

AAGCCGAGTC AAAGGCCACT ATCACCCCAT CAGGGAGCGT GATTTCCAAG TTTAATTCTA 1380 

CGACTTCCTC TGCCACTCCT CAGGCTTTCG ACTCCTCCTC TGCCGTGGTC TTCATATTTG 1440 

TGAGCACAGC AGTAGTAGTG TTGGTGATCT TGACCATGAC AGTACTGGGG CTTGTCAAGC 1500 

TCTGCTTTCA CGAAAGCCCC TCTTCCCAGC CAAGGAAGGA GTCTATGGGC CCGCCGGGCC 1560 

TGGAGAGTGA TCCTGAGCCC GCTGCTTTGG GCTCCAGTTC TGCACATTGC ACAAACAATG 1620 

GGGTGAAAGT CGGGGACTGT GATCTGCGGG ACAGAGCAGA GGGTGCCTTG CTGGCGGAGT 1680 
CCCCTCTTGG CTCTAGTGAT G CAT AG 

Seq ID NO: 343 Protein sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I I I I I I 

MGKDFMTKTP KAFATKAKID KWDLIKLKSF CTAKETIIRV NSQPTDWQKT FAIYPSDKGV 60 

IARIYKELEQ IYKKKKPTKT LRTHFLSRPK GNCWPLGPRG DSWQLGGPSG ARAEGKGGGT 120 

GLGKPAVEGG DRAPDTALRP RAGQIQVGSS SACGASENEA GVRPVPPLAG ALARAGRRRT 180 

PHCRPCWLLG LGGLLQPAPR YHEAAGGRGG LHPARWGAQH RACGRRAARC ARAPAGRPRA 240 

RRGI<QRPAVL GRTGAQAFPIj HPGERAFAGF LLAVLRPRRS RKRHAAVGGG APTLLHRAEM 300 

RGTPGHRWGR ARSWKEMRCH LRANGYLCKY QFEVLCPAPR PGAASNLSYR APFQLHSAAL 360 

DFSPPGTEVS ALCRGQLPIS VTCIADEIGA RWDKLSGDVL CPCPGRYLRA GKCAELPNCL 420 

DDLGGFACEC ATGFELGKDG RSCVTSGEGQ PTLGGTGVPT RRPPATATSP VPQRTWPIRV 480 

DEKIX3ETPLV PEQDNSVTSI PEIPRWGSQS TMSTLQMSLQ AESKATITPS GSVISKFNST 540 

TSSATPQAFD SSSAWFIFV STAWVLVIL TMTVLGLVKk CFHESPSSQP RKESMGPPGL 600 
ESDPEPAALG SSSAHCTNNG VKVGDCDLRD RAEGALLAES PLGSSDA 



315 



WO 02/086443 



PCT/US02/12476 



Seq ID NO: 344 DNA sequence 
Nucleic Acid Accession 8: NMJ)12072 
Coding sequence: 149-2107 

1 . ll 21 31 41 51 

| | I I I I 

AAAGCCCTCA GCCTTTGTGT CCTTCTCTGC GCCGGAGTGG CTGCAGCTCA CCCCTCAGCT 60 

CCCCTTGGGG CCCAGCTGGG AGCCGAGATA GAAGCTCCTG TCGCCGCTGG GCTTCTCGCC 120 

0 TCCCGCAGAG GGCCACACAG AGACCGGGAT GGCCACCTCC ATGGGCCTGC TGCTGCTGCT 180 

GCTGCTGCTC CTGACCCAGC CCGGGGCGGG GACGGGAGCT GACACGGAGG CGGTGGTCTG 240 

CGTGGGGACC GCCTGCTACA CGGCCCACTC GGGCAAGCTG AGCGCTGCCG AGGCCCAGAA 300 

CCACTGCAAC CAGAACGGGG GCAACCTGGC CACTGTGAAG AGCAAGGAGG AGGCCCAGCA 360 

CGTCCAGCGA GTACTGGCCC AGCTCCTGAG GCGGGAGGCA GCCCTGACGG CGAGGATGAG 420 

5 CAAGTTCTGG ATTGGGCTCC AGCGAGAGAA GGGCAAGTGC CTGGACCCTA GTCTGCCGCT 480 

GAAGGGCTTC AGCTGGGTGG GCGGGGGGGA GGACACGCCT TACTCTAACT GGCACAAGGA 540 

GCTCCGGAAC TCGTGCATCT CCAAGCGCTG TGTGTCTCTG CTGCTGGACC TGTCCCAGCC 600 

GCTCCTTCCC AACCGCCTGC CCAAGTGGTC TGAGGGCCCC TGTGGGAGCC CAGGCTCCCC 660 

CGGAAGTAAC ATTGAGGGCT TCGTGTGCAA GTTCAGCTTC AAAGGCATGT GCCGGCCTCT 720 

10 GGCCCTGGGG GGCCCAGGTC AGGTGACCTA CACCACCCCC TTCCAGACCA CCAGTTCCTC 780 

CTTGGAGGCT GTGCCCTTTG CCTCTGCGGC CAATGTAGCC TGTGGGGAAG GTGACAAGGA 840 

CGAGACTCAG AGTCATTATT TCCTGTGCAA GGAGAAGGCC CCCGATGTGT TCGACTGGGG 900 

CAGCTCGGGC CCCCTCTGTG TCAGCCCCAA GTATGGCTGC AACTTCAACA ATGGGGGCTG 960 

CCACCAGGAC TGCTTTGAAG GGGGGGATGG CTCCTTCCTC TGCGGCTGCC GACCAGGATT 1020 

>5 CCGGCTGCTG GATGACCTGG TGACCTGTGC CTCTCGAAAC CCTTGCAGCT CCAGCCCATG 1080 

TCGTGGGGGG GCCACGTGCG TCCTGGGACC CCATGGGAAA AACTACACGT GCCGCTGCCC 1140 

CCAAGGGTAC CAGCTGGACT CGAGTCAGCT GGACTGTGTG GACGTGGATG AATGCCAGGA 1200 

' CTCCCCCTGT GCCCAGGAGT GTGTCAACAC CCCTGGGGGC TTCCGCTGCG AATGCTGGGT 1260 

TGGCTATGAG CCGGGCGGTC CTGGAGAGGG GGCCTGTCAG GATGTGGATG AGTGTGCTCT 1320 

JO GGGTCGCTCG CCTTGCGCCC AGGGCTGCAC CAACACAGAT GGCTCATTTC ACTGCTCCTG 1380 

TGAGGAGGGC TACGTCCTGG CCGGGGAGGA CGGGACTCAG TGCCAGGACG TGGATGAGTG 1440 

TGTGGGCCCG GGGGGCCCCC TCTGCGACAG CTTGTGCTTC AACACACAAG GGTCCTTCCA 1500 

CTGTGGCTGC CTGCCAGGCT GGGTGCTGGC CCCAAATGGG GTCTCTTGCA CCATGGGGCC 1560 

TGTGTCTCTG GGACCACCAT CTGGGCCCCC CGATGAGGAG GACAAAGGAG AGAAAGAAGG 1620 

35 GAGCACCGTG CCCCGCGCTG CAACAGCCAG TCCCACAAGG GGCCCCGAGG GCACCCCCAA 1680 

GGCTACACCC ACCACAAGTA GACCTTCGCT GTCATCTGAC GCCCCCATCA CATCTGCCCC 1740 

ACTCAAGATG CTGGCCCCCA GTGGGTCCTC AGGCGTCTGG AGGGAGCCCA GCATCCATCA 1800 

CGCCACAGCT GCCTCTGGCC CCCAGGAGCC TGCAGGTGGG GACTCCTCCG TGGCCACACA 1860 

AAACAACGAT GGCACTGACG GGCAAAAGCT GCTTTTATTC TACATCCTAG GCACCGTGGT 1920 

40 GGCCATCCTA CTCCTGCTGG CCCTGGCTCT GGGGCTACTG GTCTATCGCA AGCGGAGAGC 1980 

GAAGAGGGAG GAGAAGAAGG AGAAGAAGCC CCAGAATGCG GCAGACAGTT ACTCCTGGGT 2040 

TCCAGAGCGA GCTGAGAGCA GGGCCATGGA GAACCAGTAC AGTCCGACAC CTGGGACAGA 2100 

CTGCTGAAAG TGAGGTGGCC CTAGAGACAC TAGAGTCACC AGCCACCATC CTCAGAGCTT 2160 

TGAACTCCCC ATTCCAAAGG GGCACCCACA TTTTTTTGAA AGACTGGACT GGAATCTTAG 2220 

45 CAAACAATTG TAAGTCTCCT CCTTAAAGGC CCCTTGGAAC ATGCAGGTAT TTTCTACGGG 2280 

TGTTTGATGT TCCTGAAGTG GAAGCTGTGT GTTGGCGTGC CACGGTGGGG ATTTCGTGAC 2340 

TCTATAATGA TTGTTACTCC CCCTCCCTTT TCAAATTCCA ATGTGACCAA TTCCGGATCA 2400 

GGGTGTGAGG AGGCTGGGGC TAAGGGGCTC CCCTGAATAT CTTCTCTGCT CACTTCCACC 2460 

ATCTAAGAGG AAAAGGTGAG TTGCTCATGC TGATTAGGAT TGAAATGATT TG TTTC TCTT 2520 

50 CCTAGGATGA AAACTAAATC AATTAATTAT TCAATTAGGT AAGAAGATCT GGTTTTTTGG 2580 

TCAAAGGGAA CATGTTCGGA CTGGAAACAT TTCTTTACAT TTGCATTCCT CCATTTCGCC 2640 

AGCACAAGTC TTGCTAAATG TGATACTGTT GACATCCTCC AGAATGGCCA GAAGTGCAAT 2700 

TAACCTCTTA GGTGGCAAGG AGGCAGGAAG TGCCTCTTTA GTTCTTACAT TTCTAATAGC 2760 

CTTGGGTTTA TTTGCAAAGG AAGCTTGAAA AATATGAGAA AAGTTGCTTG AAGTGCATTA 2820 

55 CAGGTGTTTG TGAAGTCACA TAATCTACGG GGCTAGGGCG AGAGAGGCCA GGGATTTGTT 2880 

CACAGATACT TGAATTAATT CATCCAAATG TACTGAGGTT ACCACACACT TGACTACGGA 2940 

TGTGATCAAC ACTAACAAGG AAACAAATTC AAGGACAACC TGTCTTTGAG CCAGGGCAGG 3000 

CCTCAGACAC CCTGCCTGTG GCCCCGCCTC CACTTCATCC TGCCCGGAAT GCCAGTGCTC 3060 

CGAGCTCAGA CAGAGGAAGC CCTGCAGAAA GTTCCATCAG GCTGTTTCCT AAAGGATGTG 3120 

60 TGAACGGGAG ATGATGCACT GTGTTTTGAA AGTTGTCATT TTAAAGCATT TTAGCACAGT 3180 

TCATAGTCCA CAGTTGATGC AGCATCCTGA GATTTTAAAT CCTGAAGTGT GGGTGGCGCA 3240 

CACACCAAGT AGGGAGCTAG TCAGGCAGTT TGCTTAAGGA ACTTTTGTTC TCTGTCTCTT 3300 

TTCCTTAAAA TTGGGGGTAA GGAGGGAAGG AAGAGGGAAA GAGAT GACT A ACTAAAATCA 3360 

TTTTTACAGC AAAAACTGCT CAAAGCCATT TAAATTATAT CCTCATTTTA AAAGTTACAT 3420 

65 TTGCAAATAT TTCTCCCTAT GATAATGCAG TCGATAGTGT GCACTCTTTC TCTCTCTCTC 3480 

TCTCTCTCAC ACACACACAC ACACACACAC ACACACACAC AGAGACACGG CACCATTCTG 3540 

CCTGGGGCAC TGGAACACAT TCCTGGGGGT CACCGATGGT CAGAGTCACT AGAAGTTACC 3600 

TGAGTATCTC TGGGAGGCCT CATGTCTCCT GTGGGCTTTT TACCACCACT GTGCAGGAGA 3660 

ACAGACAGAG GAAATGTGTC TCCCTCCAAG GCCCCAAAGC CTCAGAGAAA GGGTGTTTCT 3720 

70 GGTTTTGCCT TAGCAATGCA TCGGTCTCTG AGGTGACACT CTGGAGTGGT TGAAGGGCCA 3780 

CAAGGTGCAG GGTTAATACT CTTGCCAGTT TTGAAATATA GATGCTATGG TTCAGATTGT 3840 

TTTTAATAGA AAACTAAAGG GGCAGGGGAA GTGAAAGGAA AGATGGAGGT TTTGTGCGGC 3900 

TCGATGGGGC ATTTGGAACT TCTTTTTAAA GTCATCTCAT GGTCTCCAGT TTTCAGTTGG 3960 

AACTCTGGTG TTTAACACTT AAGGGAGACA AAGGCTGTGT CCATTTGGCA AAACTTCCTT 4020 

75 GGCCACGAGA CTCTAGGTGA TGTGTGAAGC TGGGCAGTCT GTGGTGTGGA GAGCAGCCAT 4080 

CTGTCTGGCC ATTCAGAGGA TTCTAAAGAC ATGGCTGGAT GCGCTGCTGA CCAACATCAG 4140 

CACTTAAATA AATGCAAATG CAACATTTCT CCCTCTGGGC CTTGAAAATC CTTGCCCTTA 4200 

TCATTTGGGG TGAAGGAGAC ATTTCTGTCC TTGGCTTCCC ACAGCCCCAA CGCAGTCTGT 4260 

GTATGATTCC TGGGATCCAA CGAGCCCTCC TATTTTCACA GTGTTCTGAT TGCTCTCACA 4320 

80 GCCCAGGCCC ATCGTCTGTT CTCTGAATGC AGCCCTGTTC TCAACAACAG GGAGGTCATG 4380 

GAACCCCTCT GTGGAACCCA CAAGGGGAGA AATGGGTGAT AAAGAATCCA GTTCCTCAAA 4440 

ACCTTCCCTG GCAGGCTGGG TCCCTCTCCT GCTGGGTGGT GCTTTCTCTT GCACACCACT 4500 

CCCACCACGG GGGGAGAGCC AGCAACCCAA CCAGACAGCT CAGGTTGTGC ATCTGATGGA 4560 

AACCACTGGG CTCAAACACG TGCTTTATTC TCCTGTTTAT TTTTGCTGTT ACTTTGAAGC 4620 

85 ATGGAAATTC TTGTTTGGGG GATCTTGGGG CTACAGTAGT GGGTAAACAA ATGCCCACCG 4680 

GCCAAGAGGC CATTAACAAA TCGTCCTTGT CCTGAGGGGC CCCAGCTTGC TCGGGCGTGG 4740 

CACAGTGGGG AATCCAAGGG TCACAGTATG GGGAGAGGTG CACCCTGCCA CCTGCTAACT 4800 
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TCTCGCTAGA CACAGTGTTT CTGCCCAGGT GACCTGTTCA GCAGCAGAAC AAGCCAGGGC 4860 

CATGGGGACG GGGGAAGTTT TCACTTGGAG ATGGACACCA AGACAATGAA GATTTGTTGT 4920 

CCAAATAGGT CAATAATTCT GGGAGACTCT TGGAAAAAAC TGAATATATT CAGGACCAAC 4980 

TCTCTCCCTC CCCTCATCCC ACATCTCAAA GCAGACAATG TAAAGAGAGA ACATCTCACA 5040 

CACCCAGCTC GCCATGCCTA CTCATTCCTG AATTTCAGGT GCCATCACTG CTCTTTCTTT 5100 

CTTCTTTGTC ATTTGAGAAA GGATGCAGGA GGACAATTCC CACAGATAAT CTGAGGAATG 5160 

CAGAAAAACC AGGGCAGGAC AGTTATCGAC AATGCATTAG AACTTGGTGA GCATCCTCTG 5220 

TAGAGGGACT CCACCCCTGC TCAACAGCTT GGCTTCCAGG CAAGACCAAC CACATCTGGT 5280 

CTCTGCCTTC GGTGGCCCAC ACACCTAAGC GTCATCGTCA TTGCCATAGC ATCATGATGC 5340 

AACACATCTA CGTGTAGCAC TACGACGTTA TGTTTGGGTA ATGTGGGGAT GAACTGCATG 5400 

AGGCTCTGAT TAAGGATGTG GGGAAGTGGG CTGCGGTCAC TGTCGGCCTT GCAAGGCCAC 5460 

CTGGAGGCCT GTCTGTTAGC CAGTGGTGGA GGAGCAAGGC TTCAGGAAGG GCCAGCCACA 5520 

TGCCATCTTC CCTGCGATCA GGCAAAAAAG TGGAATTAAA AAGTCAAACC TTTATATGCA 5580 

TGTGTTATGT CCATTTTGCA GGATGAACTG AGTTTAAAAG AATTTTTTTT TCTCTTCAAG 5640 

TTGCTTTGTC TTTTCCATCC TCATCACAAG CCCTTGTTTG AGTGTCTTAT CCCTGAGCAA 5700 

TCTTTCGATG GATGGAGATG ATCATTAGGT ACTTTTGTTT CAACCTTTAT TCCTGTAAAT 5760 

ATTTCTGTGA AAACTAGGAG AACAGAGATG AGATTTGACA AAAAAAAATT GAATTAAAAA 5820 

TAACACAGTC TTTTTAAAAC TAACATAGGA AAGCCTTTCC TATTATTTCT CTTCTTAGCT 5880 

TCTCCATTGT CTAAATCAGG AAAACAGGAA AACACAGCTT TCTAGCAGCT GCAAAATGGT 5940 

TTAATGCCCC CTACATATTT CCATCACCTT GAACAATAGC TTTAGCTTGG GAATCTGAGA 6000 

TATGATCCCA GAAAACATCT GTCTCTACTT CGGCTGCAAA ACCCATGGTT TAAATCTATA 6060 

TGGTTTGTGC ATTTTCTCAA CTAAAAATAG AGATGATAAT CCGAATTCTC CAT AT ATT CA 6120 

CTAATCAAAG ACACTATTTT CATACTAGAT TCCTGAGACA AATACTCACT GAAGGGCTTG 6180 

TTTAAAAATA AATTGTGTTT TGGTCTGTTC TTGTAGATAA TGCCCTTCTA TTTTAGGTAG 6240 

AAGCTCTGGA ATCCCTTTAT TGTGCTGTTG CTCTTATCTG CAAGGTGGCA AGCAGTTCTT 6300 

TTCAGCAGAT TTTGCCCACT ATTCCTCTGA GCTGAAGTTC TTTGCATAGA TTTGGCTTAA 6360 

GCTTGAATTA GATCCCTGCA AAGGCTTGCT CTGTGATGTC AGATGTAATT GTAAATGTCA 6420 

GTAATCACTT CATGAATGCT AAATGAGAAT GTAAGTATTT TTAAATGTGT GTATTTCAAA 6480 

TTTGTTTGAC TAATTCTGGA ATTACAAGAT TTCTATGCAG GATTTACCTT CATCCTGTGC 6540 

ATGTTTCCCA AACTGTGAGG AGGGAAGGCT CAGAGATCGA GCTTCTCCTC TGAGTTCTAA 6600 

CAAAATGGTG CTTTGAGGGT CAGCCTTTAG GAAGGTGCAG CTTTGTTGTC CTTTGAGCTT 6660 
TCTGTTATGT GCCTATCCTA ATAAACTCTT AAACACATT 

Seq ID NO: 345 Protein sequence 
Protein Accession #: NP_036204 

1 11 21 31 41 51 

1 I I I I I 

MATSMGLLLL LLLLLTQPGA GTGADTEAW CVGTACYTAH SGKLSAAEAQ NHCNQNGGNL 60 

ATVKSKEEAQ HVQRVLAQLL RREAALTARM SKFWIGLQRE KGKCLDPSLP LKGFSWVGGG 120 

EDTPYSNWHK ELRNSCISKR CVSLLLDLSQ PLLPNRLPKW SEGPCGSPGS PGSNIEGFVC 180 

KFSFKGMCRP LALGGPGGVT YTTPFQTTSS SLEAVPFASA ANVACGEGDK DETQSHYFLC 240 

KEKAPDVFDW GSSGPLCVSP KYGCNFNNGG CHQDCFEGGD GSFLCGCRPG FRLLDDIaVTC 300 

ASRNPCSSSP CRGGATCVLG PHGKNYTCRC PQGYQLDSSQ LDCVDVDECQ DSPCAQECVN 360 

TPGGFRCECW VGYEPGGPGE GACQDVDECA LGRSPCAQGC TNTDGSFHCS CEEGYVLAGE 420 

DGTQCQDVDE CVGPGGPLCD SLCFNTQGSF HCGCLPGWVL APNGVSCTMG PVSIiGPPSGP 480 

PDEEDKGEKE GSTVPRAATA SPTRGPEGTP KATPTTSRPS LSSDAPITSA PLKMLAPSGS 540 

SGVWREPSIH HATAASGPQE PAGGDSSVAT QNNDGTDGQK LLLFYILGTV VAILLLLALA 600 
LGLLVYRKRR AKREEKKEKK PQNAADSYSW VPERAESRAM ENQYSPTPGT DC 

Seq ID NO: 346 DNA sequence 
Nucleic Acid Accession #: Z31560 
Coding sequence: <l-966 

1 11 21 31 41 51 

I II I I I 

CACAGCGCCC GCATGTACAA CATGATGGAG ACGGAGCTGA AGCCGCCGGG CCCGCAGCAA 60 

ACTTCGGGGG GCGGCGGCGG CAACTCCACC GCGGCGGCGG CCGGCGGCAA CCAGAAAAAC 120 

AGCCCGGACC GCGTCAAGCG GCCCATGAAT GCCTTCATGG TGTGGTCCCG CGGGCAGCGG 180 

CGCAAGATGG CCCAGGAGAA CCCCAAGATG CACAACTCGG AGATCAGCAA GCGCCTGGGC 240 

GCCGAGTGGA AACTTTTGTC GGAGACGGAG AAGCGGCCGT TCATCGACGA GGCTAAGCGG 300 

CTGCGAGCGC TGCACATGAA GGAGCACCCG GATTATAAAT ACCGGCCCCG GCGGAAAACC 360 

AAGACGCTCA TGAAGAAGGA TAAGTACACG CTGCCCGGCG GGCTGCTGGC CCCCGGCGGC 420 

AATAGCATGG CGAGCGGGGT CGGGGTGGGC GCCGGCCTGG GCGCGGGCGT GAACCAGCGC 480 

ATGGACAGTT ACGCGCACAT GAACGGCTGG AGCAACGGCA GCTACAGCAT GATGCAGGAC 540 

CAGCTGGGCT ACCCGCAGCA CCCGGGCCTC AATGCGCACG GCGCAGCGCA GATGCAGCCC 600 

ATGCACCGCT ACGACGTGAG CGCCCTGCAG TACAACTCCA TGACCAGCTC GCAGACCTAC 660 

ATGAACGGCT CGCCCACCTA CAGCATGTCC TACTCGCAGC AGGGCACCCC TGGCATGGCT 720 

CTTGGCTCCA TGGGTTCGGT GGTCAAGTCC GAGGCCAGCT CCAGCCCCCC TGTGGTTACC 7 B0 

TCTTCCTCCC ACTCCAGGGC GCCCTGCCAG GCCGGGGACC TCCGGGACAT GATCAGCATG 840 

TATCTCCCCG GCGCCGAGGT GCCGGAACCC GCCGCCCCCA GCAGACTTCA CATGTCCCAG 900 

CACTACCAGA GCGGCCCGGT GCCCGGCACG GCCATTAACG GCACACTGCC CCTCTCACAC 9 GO 

ATGTGAGGGC CGGACAGCGA ACTGGAGGGG GGAGAAATTT TCAAAGAAAA ACGAGGGAAA 1020 

TGGGAGGGGT GCAAAAGAGG AGAGTAAGAA ACAGCATGGA GAAAACCCGG TACGCTCAAA 1080 
AAAAA 

Seq ID NO: 347 Protein sequence 
Protein Accession CAA83435 

1 11 21 31 41 

I I I I I 

HSARMYNMME TELKPPGPQQ TSGGGGGNST AAAAGGNQKN SPDRVKRPMN 
RKMAQENPKM HNSEISKRLG AEWKLLSETE KRPFIDEAKR LRALHMKEHP 
KTLMKKDKYT LPGGLLAPGG NSMASGVGVG AGLGAGVNQR MDSYAHMNGW 
QLGYPQHPGL NAHGAAQMQP MHRYDVSALQ YNSMTSSQTY MNGSPTYSMS 
LGSMGSWKS EASSSPPWT SSSHSRAPCQ AGDLRDMISM YLPGAEVPEP 
HYQSGPVPGT AINGTLPLSH M 



51 
I 

AFMVWSRGQR 60 

DYKYRPRRKT 120 

SNGSYSMMQD 180 

YSQQGTPGMA 240 

AAPSRLHMSQ 300 



317 



WO 02/086443 

Seq ID NO: 348 DNA sequence 
Nucleotide Accession ft: NM_002638 
Coding sequence: 120-473 

1 11 21 31 41 51 

I l I I I I 

CAATACAGCT AAGGAATTAT CCCTTGTAAA TACCACAGAC CCGCCCTGGA GCCAGGCCAA 60 

GCTGGACTGC ATAAAGATTG GTATGGCCTT AGCTCTTAGC CAAACACCTT CCTGACACCA 120 

TGAGGGCCAG CA6CTTCTTG ATCGTGGTCG TGTTCCTCAT CGCTGGGACG CTGGTTCTAG 180 

AGGCAGCTGT CACGGGAGTT CCTGTTAAAG GTCAAGACAC TGTCAAAGGC CGTGTTCCAT 240 

TCAATGGACA AGATCCCGTT AAAGGACAAG TTTCAGTTAA AGGTCAAGAT AAAGTCAAAG 300 

CGCAAGAGCC AGTCAAAGGT CCAGTCTCCA CTAAGCCTGG CTCCTGCCCC ATTATCTTGA 360 

TCCGGTGCGC CATGTTGAAT CCCCCTAACC GCTGCTTGAA AGATACTGAC TGCCCAGGAA 420 

TCAAGAAGTG CTGTGAAGGC TCTTGCGGGA TGGCCTGTTT CGTTCCCCAG TGAAGGGAGC 480 

CGGTCCTTGC TGCACCTGTG CCGTCCCCAG AGCTACAGGC CCCATCTGGT CCTAAGTCCC 540 

TGCTGCCCTT CCCCTTCCCA CACTGTCCAT TCTTCCTCCC ATTCAGGATG CCCACGGCTG 600 
GAGCTGCCTC TCTCATCCAC TTTCCAATAA A 

Seq ID NO: 349 Protein sequence: 
Protein Accession ft: NP_002629 

1 11 21 31 41 51 

I I I I I I 

MRASSFLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVK 60 

AQEPVKGPVS TKPGSCPIIL IRCAMLNPPN RCLKDTDCPG IKKCCEGSCG MACFVPQ 



Seq ID NO: 350 DNA sequence 
Nucleic Acid Accession ft: NM_007183 
Coding sequence: 75-2468 

1 11 21 31 41 51 

I I I I I I 

GAATTCCGGA CAGGACGTGA AGATAGTTGG GTTTGGAGGC GGCCGCCAGG CCCAGGCCCG 60 

GTGGACCTGC CGCCATGCAG GACGGTAACT TCCTGCTGTC GGCCCTGCAG CCTGAGGCCG 120 

GCGTGTGCTC CCTGGCGCTG CCCTCTGACC TGCAGCTGGA CCGCCGGGGC GCCGAGGGGC 180 

CGGAGGCCGA GCGGCTGCGG GCAGCCCGCG TCCAGGAGCA GGTCCGCGCC CGCCTCTTGC 240 

AGCTGGGACA GCAGCCGCGG CACAACGGGG CCGCTGAGCC CGAGCCTGAG GCCGAGACTG 300 

CCAGAGGCAC ATCCAGGGGG CAGTACCACA CCCTGCAGGC TGGCTTCAGC TCTCGCTCTC 360 

AGGGCCTGAG TGGGGACAAG ACCTCGGGCT TCCGGCCCAT CGCCAAGCCG GCCTACAGCC 420 

CAGCCTCCTG GTCCTCCCGC TCCGCCGTGG ATCTGAGCTG CAGTCGGAGG CTGAGTTCAG 480 

CCCACAATGG GGGCAGCGCC TTTGGGGCCG CTGGGTACGG GGGTGCCCAG CCCACCCCTC 540 

CCATGCCCAC CAGGCCCGTG TCCTTCCATG AGCGCGGTGG GGTTGGGAGC CGGGCCGACT 600 

ATGACACACT CTCCCTGCGC TCGCTGCGGC TGGGGCCCGG GGGCCTGGAC GACCGCTACA 660 

GCCTGGTGTC TGAGCAGCTG GAGCCCGCGG CCACCTCCAC CTACAGGGCC TTTGCGTACG 720 

AGCGCCAGGC CAGCTCCAGC TCCAGCCGGG CAGGGGGGCT GGACTGGCCC GAGGCCACTG 780 

AGGTTTCCCC GAGCCGGACC ATCCGTGCCC CTGCCGTGCG GACCCTGCAG CGATTCCAGA 840 

GCAGCCACCG GAGCCGCGGG GTAGGCGGGG CAGTGCCGGG GGCCGTCCTG GAGCCAGTGG 900 

CTCGAGCGCC ATCTGTGCGC AGCCTCAGCC TCAGCCTGGC TGACTCGGGC CACCTGCCGG 960' 

ACGTGCATGG GTTCAACAGC TACGGTAGCC ACCGAACCCT GCAGAGACTC AGCAGCGGTT 1020 

TTGATGACAT TGACCTGCCC TCAGCAGTCA AGTACCTCAT GGCTTCAGAC CCCAACCTGC 1080 

AGGTGCTGGG AGCGGCCTAC ATCCAGCACA AGTGCTACAG CGATGCAGCC GCCAAGAAGC 1140 

AGGCCCGCAG CCTTCAGGCC GTGCCTAGGC TGGTGAAGCT CTTCAACCAC GCCAACCAGG 1200 

AAGTGCAGCG CCATGCCACA GGTGCCATGC GCAACCTCAT CTACGACAAC GCTGACAACA 1260 

AGCTGGCCCT GGTGGAGGAG AACGGGATCT TCGAGCTGCT GCGGACACTG CGGGAGCAGG 1320 

ATGATGAGCT TCGCAAAAAT GTCACAGGGA TCCTGTGGAA CCTTTCATCC AGCGACCACC 1380 

TGAAGGACCG CCTGGCCAGA GACACGCTGG AGCAGCTCAC GGACCTGGTG TTGAGCCCCC 1440 

TGTCGGGGGC TGGGGGTCCC CCCCTCATCC AGCAGAACGC CTCGGAGGCG GAGATCTTCT 1500 

ACAACGCCAC CGGCTTCCTC AGGAACCTCA GCTCAGCCTC TCAGGCCACT CGCCAGAAGA 1560 

TGCGGGAGTG CCACGGGCTG GTGGACGCCC TGGTCACCTC TATCAACCAC GCCCTGGACG 1620 

CGGGCAAATG CGAGGACAAG AGCGTGGAGA ACGCGGTGTG CGTCCTGCGG AACCTGTCCT 1680 

ACCGCCTCTA CGACGAGATG CCGCCGTCCG CGCTGCAGCG GCTGGAGGGT CGCGGCCGCA 1740 

GGGACCTGGC GGGGG CGCCG CCGGGAGAGG TCGTGGGCTG CTTCACGCCG CAGAGCCGGC 1800 

GGCTGCGCGA GCTGCCCCTC GCCGCCGATG CGCTCACCTT CGCGGAGGTG TCCAAGGACC 1860 

CCAAGGGCCT CGAGTGGCTG TGGAGCCCCC AGATCGTGGG GCTGTACAAC CGGCTGCTGC 1920 

AGCGCTGCGA GCTCAACCGG CACACGACGG AGGCGGCCGC CGGGGCGCTG CAGAACATCA 1980 

CGGCAGGCGA CCGCAGGTGG GCGGGGGTGC TGAGCCGCCT GGCCCTGGAG CAGGAGCGTA 2040 

TTCTGAACCC CCTGCTAGAC CGTGTCAGGA CCGCCGACCA CCACCAGCTG CGCTCACTGA 2100 

CTGGCCTCAT CCGAAACCTG TCTCGGAACG CTAGGAACAA GGACGAGATG TCCACGAAGG 2160 

TGGTGAGCCA CCTGATCGAG AAGCTGCCAG GCAGCGTGGG TGAGAAGTCG CCCCCAGCCG 2220 

AGGTGCTGGT CAACATCATA GCTGTGCTCA ACAACCTGGT GGTGGCCAGC CCCATCGCTG 2280 

CCCGAGACCT GCTGTATTTT GACGGACTCC GAAAGCTCAT CTTCATCAAG AAGAAGCGGG 2340 

ACAGCCCCGA CAGTGAGAAG TCCTCCCGGG CAGCATCCAG CCTCCTGGCC AACCTGTGGC 2400 

AGTACAACAA GCTCCACCGT GACTTTCGGG CGAAGGGCTA TCGGAAGGAG GACTTCCTGG 2460 

GCCCATAGGT GAAGCCTTCT GGAGGAGAAG GTGACGTGGC CCAGCGTCCA AGGGACAGAC 2520 

TCAGCTCCAG GCTGCTTGGC AGCCCAGCCT GGAGGAGAAG GCTAATGACG GAGGGGCCCC 2580 

TCGCTGGGGC CCCTGTGTGC ATCTTTGAGG GTCCTGGGCC ACCAGGAGGG GCAGGGTCTT 2640 

ATAGCTGGGG ACTTGGCTTC CGCAGGGCAG GGGGTGGGGC AGGGCTCAAG GCTGCTCTGG 2700 

TGTATGGGGT GGTGACCCAG TCACATTGGC AGAGGTGGGG GTTGGCTGTG GCCTGGCAGT 2760 

ATCTTGGGAT AGCCAGCACT GGGAATAAAG ATGGCCATGA ACAGTCACAA AAAAAAAAAA 2820 
AAAAGGAATT C 

Seq ID NO: 351 Protein sequence 
Protein Accession ft: NP_009114.1 

1 11 21 31 41 51 
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WO 02/086443 
I I 

MQDGNFLLSA LQPEAGVCSL 
PRHNGAAEPE PEAETARGTS 
SRSAVDLSCS RRLSSAHNGG 
LRSLRL6PGG LDDRYSLVSE 
RTIRAPAVRT LQRFQSSHRS 
NSYGSHRTLQ RLSSGFDDID 
QAVPRLVKLF NHANQEVQRH 
KNVTGILWNL SSSDHLKDRL 
FLRNLSSASQ ATRQKMRECH 
BMPPSALQRIi EGRGRRDLAG 
WLWSPQIVGL YNRLLQRCEL 
LDRVRTADHH QLRSLTGLIR 
IIAVLNNLW ASPIAARDLL 
HRDFRAKGYR KEDPLGP 



PCT/US02/12476 



ALPSDLQLDR 
RGQYHTLQAG 
SAPGAAGYGG 
QLEPAATSTY 
RGVGGAVPGA 
LPSAVKYLMA 
ATGAMRNLIY 
ARDTLEQLTD 
GLVDALVTSI 
APPGEWGCF 
NRHTTEAAAG 
NXiSRNARNKD 
YFDGLRKLIF 



RGAEGPEAER 
FSSRSQGLSG 
AQPTPPMPTR 
RAFAYERQAS 
VLEPVARAPS 
SDPNLQVLGA 
DNADNFCLALV 
LVLSPLSGAG 
NHALDAGKCE 
TPQSRRLREL 
ALQNITAGDR 
EMSTKWSHL 
IKKKRDSPDS 



LRAARVQEQV 
DKTSGFRPIA 
PVSFHERGGV 
SSSSRAGGLD 
VRSLSLSLAD 
AYIQHKCYSD 
EENGIFELLR 
GPPLIQQNAS 
DKSVENAVCV 
PLAADALTFA 
RWAGVLSRLA 
IEKLPGSVGE 
EKSSRAASSL 



RARLLQLGQQ 
KPAYSPASWS 
GSRADYDTLS 
WPEATEVSPS 
SGHLPDVHGP 
AAAKKQARSIi 
TLREQDDELR 
EAEIFYNATG 
LRNLSYRLYD 
EVSKDPKGLE 
LEQERILNPL 
KSPPAEVLVN 
LANLWQYNKL 



Seq ID NO: 352 DNA sequence 
Nucleic Acid Accession #: M31469 
Coding sequence: 1-651 



1 
I 

ATGGCTGCGC 
ACTGGAAAAA 
GCCACCTTGG 
TTCAATGTAT 
ATCCAAGCCC 
GTGCCTAACT 
GGCAACAAAG 
AAGAAGAATC 
TTCCTCTGGC 
GCTCTCGCCC 
TTAGAGGTTG 



11 
I 

AGGGAGAGCC 
CGACCTTCGT 
GTGTTGAGGT 
GGGACACAGC 
AGTGTGCCAT 
GGCATAGAGA 
TGGATATTAA 
TTCAGTACTA 
TTGCTAGGAA 
CACCAGAAGT 
CTCAGACAAC 



21 
I 

CCAGGTCCAG 
GAAACGTCAT 
TCATCCCCTA 
CGGCCAGGAG 
CATAATGTTT 
TCTGGTACGA 
GGACAGGAAA 
CGACATTTCT 
GCTCATTGGA 
TGTCATGGAC 
TGCTCTCCCG 



31 
I 

TTCAAACTTG 
TTGACTGGTG 
GTGTTCCACA 
AAATTCGGTG 
GATGTAACAT 
GTGTGTGAAA 
GTGAAGGCGA 
GCCAAAAGTA 
GACCCTAACT 
CCAGCTTTGG 
GATGAGGATG 



41 
I 

TATTGGTTGG 
AATTTGAGAA 
CCAACAGAGG 
GACTGAGAGA 
CGAGAGTTAC 
ACATCCCCAT 
AATCCATTGT 
ACTACAACTT 
TGGAATTTGT 
CAGCACAGTA 
ATGACCTGTG 



51 

I 

TGATGGTGGT 
GAAGTATGTA 
ACCTATTAAG 
TGGCTATTAT 
TTACAAGAAT 
TGTGTTGTGT 
CTTCCACCGA 
TGAAAAGCCC 
TGCCATGCCT 
TGAGCACGAC 
A 



Seq ID NO: 353 Protein sequence 
Protein Accession #: AAA36546 



11 



21 



31 
I 



41 



51 



MAAQGEPQVQ FKLVLVGDCG TGKTTFVKRH LTGEFEKKYV ATLGVEVHPL VFHTNRGPIK 
FNVWDTAGQE KFGGLRDGYY IQAQCAIIMF DVTSRVTYKN VPNWHRDLVR VCENIPIVLC 
GNKVDIKDRK VKAKSIVFHR KKNLQYYDIS AKSNYNFEKP FLWLARKLIG DPNLEFVAMP 
ALAPPEWMD PALAAQYEHD LEVAQTTALP DEDDDL 



Seq ID NO: 354 DNA sequence 
Nucleic Acid Accession #: NM_002820 
Coding sequence: 304-831 



1 
I 

CCGGTTCGCA 
CCCTGTTCCA 
CGTGTAAACA 
TTCAGAGGAA 
GTTTGGAGAA 
ACGATGCAGC 
GTGCCCTCCT 
GAACATCAGC 
CTTCACCATC 
CCTAACTCCA 
GAGGGCAGAT 
AAGACACCTG 
AAACGGCGAA 
GACCACCTGT 
CTGGCCCGTA 
GCTTGGACAA 
CAGAGAATAA 
TGTCCTCCAG 
CATCAATCCT 
ATCTTCATAA 
TTCTTCAGTG 
GAT ATT AT CT 
ACTTTTTATT 
TAAATTATGT 
CCAGCTCATA 
GGTTTTTCTC 
CCGTAGGAAA 



11 
I 

AAGAAGCTGA 
CGAACCCAGG 
CACTACTTAT 
GCGCCTCTGA 
AGCACAGTTG 
GGAGACTGGT 
GCGGGCGCTC 
TCCTCCATGA 
TGATCGCAGA 
AGCCCTCTCC 
ACCTAACTCA 
GGAAGAAAAA 
CTCGCTCTGC 
CTGACACCTC 
GCCTCAGCGG 
ACCTAGAATT 
CTCAGAATAT 
CACCATAGAG 
TTACCACTCT 
TTTGCTGGAG 
TTTTTCATTT 
ACAAACACTG 
TAATTAAATG 
TTTAAACACA 
CAAAATAAAT 
ATGTATCTTT 
AATAAAACTT 



21 
I 

CTTCAGAGGG 
AGAACTGCTG 
CATTGATGCA 
TTTGTTT CTT 
GAGTAGCCGG 
TCAGCAGTGG 
GGTGGAGGGT 
CAAGGGGAAG 
AATCCACACA 
CAACACAAAG 
GGAAACTAAC 
GAAAGGCAAG 
CTGGTTAGAC 
CACAACGTCG 
GGTGCTCTCA 
TTCTCCCTTT 
TGTCTGCCTT 
AGGCGCTAGA 
ACCAAATAAT 
AAGTGTATTT 
CTTACGTTCT 
CAGAACAGCA 
TATTTAATTA 
TGCCTTAAAT 
GGTTTCTGAA 
TTGTTCATTG 
CACATTTAAA 



31 
I 

GGAAACTTTC 
GCCAGATTAA 
TATATAAAAC 
TTTTCCCTTT 
TTGCTAAATA 
AGCGTCGCGG 
CTCAGCCGCC 
TCCATCCAAG 
GCTGAAATCA 
AACCACCCCG 
AAGGTGGAGA 
CCCGGGAAAC 
TCTGGAGTGA 
CTGGAGCTCG 
GCTGGGTTTT 
ATGTATCTCT 
AAAGCAGTAC 
GCCCATTCCT 
TTCATATTCA 
CTTCCCCTTA 
TTCACTTCAA 
TCATGTCATA 
AATCTCAAAT 
TTGTTTAATT 
AATGTTTAAG 
GCAAGATGAA 
AAAAA 



41 
I 

TTCTTTTAGG 
TTAGACATTG 
CATTTTATTT 
TTGCTCTTTC 
AGTCCCGAGC 
TGTTCCTGCT 
GCCTCAAAAG 
ATTTACGGCG 
GAGCTACCTC 
TCCGATTTGG 
CGTACAAAGA 
GCAAGGAGCA 
CTGGGAGTGG 
ATTCACGGTA 
GGAGCCTCCC 
ATCGATTGTG 
CCCCCTACCA 
CTTTCTCCAC 
AGCTTCAGAA 
CTCTCACACC 
GGGAGAATAT 
AACGATTCTG 
TTATTTTAAT 
AAATTTAACT 
TATTAACTTA 
ATAATTTTTC 



51 
I 

AGGCGGTTAG 
CTATGGGAGA 
TCGCTATTAT 
TGGCTGTGTG 
GCGAGCGGAG 
GAGCTACGCG 
AGCTGTGTCT 
ACGATTCTTC 
GGAGGTGTCC 
GTCTGATGAT 
GCAGCCGCTC 
GGAAAAGAAA 
GCTAGAAGGG 
ACAGGCTTCT 
TTCTGCCTTG 
TAGCAATTGA 
CACACACCCC 
CGTCACCCAA 
GCTAGTGACC 
TGGGCAAACT 
AGAAGCATTT 
AGCCATTCAC 
GTAAAGAACT 
CTGGTTTCTA 
CAAGGATATA 
TAGGGTAATG 



Seq ID NO: 355 Protein sequence 
Protein Accession #: NM_002820 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



11 



21 31 41 51 

I I I I I I 

MQRRLVQQWS VAVFLLSYAV PSCGRSVEGL SRRLKRAVSS HQLLHDKGKS IQDLRRRFFL 
HHLIAEIHTA EIRATSEVSP NSKPSPNTKN HPVRFGSDDE GRYLTQETNK VETYKEQPI*K 



60 
120 



319 



WO 02/086443 

TPGKKKKGKP GKRKEQEKKK RRTRSAWLDS GVTGSGLEGD HLSDTSTTSL ELDSR 

Seq ID NO: 356 DNA sequence 
Nucleic Acid Accession #: NM_017522 
Coding sequence: 1-2100 

1 11 21 31 41 51 

l l l l I I 

ATGGGCCTCC CCGAGCCGGG CCCTCTCCGG CTTCTGGCGC TGCTGCTGCT GCTGCTGCTG 60 

CTGCTGCTGC TGCGGCTCCA GCATCTTGCG GCGGCAGCGG CTGATCCGCT GCTCGGCGGC 120 

CAAGGGCCGG CCAAGGAGTG CGAAAAGGAC CAATTCCAGT GCCGGAACGA GCGCTGCATC 180 

CCCTCTGTGT GGAGATGCGA CGAGGACGAT GACTGCTTAG ACCACAGCGA CGAGGACGAC 240 

TGCCCCAAGA AGACCTGTGC AGACAGTGAC TTCACCTGTG ACAACGGCCA CTGCATCCAC 300 

GAACGGTGGA AGTGTGACGG CGAGGAGGAG TGTCCTGATG GCTCCGATGA GTCCGAGGCC 360 

ACTTGCACCA AGCAGGTGTG TCCTGCAGAG AAGCTGAGCT GTGGACCCAC CAGCCACAAG 420 

TGTGTACCTG CCTCGTGGCG CTGCGACGGG GAGAAGGACT GCGAGGGTGG AGCGGATGAG 480 

GCCGGCTGTG CTACCTCACT GGGCACCTGC CGTGGGGACG AGTTCCAGTG TGGGGATGGG 540 

ACATGTGTCC TTGCAATCAA GCACTGCAAC CAGGAGCAGG ACTGTCCAGA TGGGAGTGAT 600 

GAAGCTGGCT GCCTACAGGG GCTGAACGAG TGTCTGCACA ACAATGGCGG CTGCTCACAC 660 

ATCTGCACTG ACCTCAAGAT TGGCTTTGAA TGCACGTGCC CAGCAGGCTT CCAGCTCCTG 720 

GACCAGAAGA CTTGTGGCGA CATTGATGAG TGCAAGGACC CAGATGCCTG CAGCCAGATC 780 

TGTGTCAATT ACAAGGGCTA TTTTAAGTGT GAGTGCTACC CTGGCTGCGA GATGGACCTA 840 

CTGACCAAGA ACTGCAAGGC TGCTGCTGGC AAGAGCCCAT CCCTAATCTT CACCAACCGC 900 

ACGAGTGCGG AGGATCGACC TGTGAAGCGG AACTATTCAC GCCTCATCCC CATGCTCAAG 960 

AATGTCGTGG CACTAGATGT GGAAGTTGCC ACCAATCGCA TCTACTGGTG TGACCTCTCC 1020 

TACCGTAAGA TCTATAGCGC CTACATGGAC AAGGCCAGTG ACCCGAAAGA GCGGGAGGTC 1080 

CTCATTGACG AGCAGTTGCA CTCTCCAGAG GGCCTGGCAG TGGACTGGGT CCACAAGCAC 1140 

ATCTACTGGA CTGACTCGGG CAATAAGACC ATCTCAGTGG CCACAGTTGA TGGTGGCCGC 1200 

CGACGCACTC TCTTCAGCCG TAACCTCAGT GAACCCCGGG CCATCGCTGT TGACCCCCTG 1260 

CGAGGGTTCA TGTATTGGTC TGACTGGGGG GACCAGGCCA AGATTGAGAA ATCTGGGCTC 1320 

AACGGTGTGG ACCGGCAAAC ACTGGTGTCA GACAATATTG AATGGCCCAA CGGAATCACC 1380 

CTGGATCTGC TGAGCCAGCG CTTGTACTGG GTAGACTCCA AGCTACACCA ACTGTCCAGC 1440 

ATTGACTTCA GTGGAGGCAA CAGAAAGACG CTGATCTCCT CCACTGACTT CCTGAGCCAC 1500 

CCTTTTGGGA TAGCTGTGTT TGAGGACAAG GTGTTCTGGA CAGACCTGGA GAACGAGGCC 1560 

ATTTTCAGTG CAAATCGGCT CAATGGCCTG GAAATCTCCA TCCTGGCTGA GAACCTCAAC 1620 

AACCCACATG ACATTGTCAT CTTCCATGAG CTGAAGCAGC CAAGAGCTCC AGATGCCTGT 1680 

GAGCTGAGTG TCCAGCCTAA TGGAGGCTGT GAATACCTGT GCCTTCCTGC TCCTCAGATC 1740 

TCCAGCCACT CTCCCAAGTA CACATGTGCC TGTCCTGACA CAATGTGGCT GGGTCCAGAC 1800 

ATGAAGAGGT GCTACCGAGA TGCAAATGAA GACAGTAAGA TGGGCTCAAC AGTCACTGCC 1860 

GCTGTTATCG GGATCATCGT GCCCATAGTG GTGATAGCCC TCCTGTGCAT GAGTGGATAC 1920 

CTGATCTGGA GAAACTGGAA GCGGAAGAAC ACCAAAAGCA TGAATTTTGA CAACCCAGTC 1980 

TACAGGAAAA CAACAGAAGA AGAAGATGAA GATGAGCTCC ATATAGGGAG AACTGCTCAG 2040 

ATTGGCCATG TCTATCCTGC ACGAGTGGCA TTAAGCCTTG AAGATGATGG ACTACCCTGA 2100 

GGATGGGATC ACCCCCTTCG TGCCTCATGG AATTCAGTCC CATGCACTAC ACTCCGGATG 2160 

GTGTATGACT GGATGAATGG GTTTCTATAT ATGGGTCTGT GTGAGTGTAT GTGTGTGTGT 2220 

GATTTTTTTT TTTAAATTTA TGTTGCGGAA AGGTAACCAC AAAGTTATGA TGAACTGCAA 2280 

ACATCCAAAG GATGTGAGAG TTTTTCTATG TATAATGTTT TATACACTTT TTAACTGGTT 2340 

GCACTACCCA TGAGGAATTC GTGGAATGGC TACTGCTGAC TAACATGATG CACATAACCA 2400 

AATGGGGGCC AATGGCACAG TACCTTACTC ATCATTTAAA AACTATATTT ACAGAAGATG 2460 

TTTGGTTGCT GGGGGGCTTT TTTAGGTTTT GGGCATTTGT TTTTTGTAAA TAAGATGATT 2520 
ATGCTTTGTG GCTATCCATC AACATAAGT 

Seq ID NO: 357 Protein sequence 
Protein Accession 8: NP_059992 

1 11 21 31 41 51 

I I I I I I 

MGLPEPGPLR LLALLLLLDL LLLLRLQHLA AAAADPLLGG QGPAKECEKD QFQCRNERCI 60 

PSVWRCDEDD DCLDHSDEDD CPKKTCADSD FTCDNGHCIH ERWKCDGEEE CPDGSDESEA 120 

TCTKQVCPAE KLSCGPTSHK CVPASWRCDG EKDCEGGADE AGCATSLGTC RGDEFQCGDG 180 

TCVLAIKHCN QEQDCPDGSD EAGCLQGLNE CLHNNGGCSH I CTDLKIGFE CTCPAGFQLL 240 

DQKTCGDIDE CKDPDACSQI CVNYKGYFKC ECYPGCEMDL LTKNCKAAAG KSPSLIFTNR 300 

TSAEDRPVKR NYSRLIPMLK NWALDVEVA TNRIYWCDLS YRXIYSAYMD KASDPKEREV 360 

LIDEQLHSPE GLAVDWVHKH IYWTDSGNKT ISVATVDGGR RRTLFSRNLS EPRAIAVDPL* 420 

RGFMYWSDWG DQAKIEKSGL NGVDRQTLVS DNIEWPNGIT LDLLSQRLYW VDSKLHQLSS 480 

IDFSGGNRKT LISSTDFLSH PFGIAVFEDK VFWTDLENEA IFSANRLNGIi EISILAENLN 540 

NPHDIVIFHE LKQPRAPDAC ELSVQPNGGC EYLCLPAPQI SSHSPKYTCA CPDTMWLGPD 600 

MKRCYRDANE DSKMGSTVTA AVIGIIVPIV VIALLCMSGY LIWRNWKRKN TKSMNFDNPV 660 
YRKTTEEEDE DELHIGRTAQ IGHVYPARVA LSLEDDGLP 

Seq ID NO: 358 DNA sequence 
Nucleic Acid Accession #: M27826 
Coding sequence: <l-503 

1 11 21 31 41 51 

I I I I I I 

AGCCCAAGAA ACATCTCACC AATTTCAAAT CTGATCTATT CGGCTTAGCG ACTGAAGATT 60 

GACGCTGCCC GATCGCCTCG GAAGTCCCCT GGACCATCAC AGAAGCCGAG CTTCGGGTAA 120 

CTCTCACAGT GGAGGGTAAG TCCATCCCCT GTTTAATCGA TACGGGGGCT ACCCACTCCA 180 

CGTTGCCTTC TTTTCAAGGG CCTGTTTCCC TTGCCCCCAT AACTGTTGTG GGTATTGACG 240 

GCCAAGCTTC AAAACCCCTG AAAACTCCCC CACTCTGGTG CCAACTTGGA CAACACTCTT 300 

TTATGCACTC TTTTTTAGTT ATCCCCACCT GCCCACTTCC CTTATTAGGC CGAAATATTT 360 

TAACCAAATT ATCTGCTTCC CTGACTATTC CTGGAGTACA GCTACATCTC ATTGCTGCCC 420 

TTCTTCCCAA TCCAAAGCCT CCTTTGTGTC CTCTAACATC CCCACAATAT CAGCCCTTAC 480 

CACAAGACCT CCCTTCAGCT TAATCTCTCC CACTCTAGGT TCCCACGCCG CCCCTAATCC 540 

CACTTGAAGC AGCCCTGAGA AACATCGCCC ATTCTCTCTC CATACCACCC CCCAAAAATT 600 

TTCGCCGCTC CAACACTTCA ACACTATTTT GTTTTATTTG TCTTATTAAT ATCAGAAGGC 660 



320 
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WO 02/086443 

AGGAATGTCA GGCCTCTGAG CCCAGGCCAG 
ACATCCAGAT GGCCTGAAGT AACTGAAGAT 
ATGACATTCC ACCATTGTGA TTTGTTCCTG 
ATCTCCCCCA CCCTTAAGAA GGTTCTTTGT 
TGAGATCCAC CCCTGCCCAC CAGAGAACAA 
CCCAAATCCT ATAAAACAGC CCCACCCCTA 
AGCCACCGGC ACCCAGGTGA AATAAACAGC 

Seq ID NO: 359 Protein sequence 
Protein Accession It: AAA65999 



PCT/US02/12476 



GCCATCGCAT 


CCCCTGTGAC 


TTGCACGTAT 


720 


CCACAAAAGA 


AGTAAAAACA 


GCCTTAACTG 


780 


CCCCACCCTA 


ACTGATCAAT 


GTACTTTGTA 


840 


AATTCTCCCC 


ACCCTTGAGA 


ATGTACTTTG 


900 


CCCCCTTTGA 


TTGTAATTTT 


TTATTACCTT 


960 


TCTTCCTTCA 


CTGACTCTCT 


TTTCGGACTC 


1020 


TTTATTGCTC 


AC 
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31 



41 



51 



PKKHLTNFKS DLFGLATEDW RCPIASEVPW TITEAELRVT LTVEGKSIPC LIDTGATHST 
LPSFQGPVSL APITWGIDG QASKPLKTPP LWCQLGQHSF MHSFLVIPTC PLPLLGRNIL 
TKLSASLTIP GVQLHLIAAL LPNPKPPLCP LTSPQYQPLP QDLPSA 

Seq ID NO: 360 DNA sequence 
Nucleic Acid Accession #: NM_001854 
Coding sequence: 162-5582 



1 
I 

AACCATCAAA 
TGTGTAGCAA 
TTCGTGGGTT 
GGTGGAAAAC 
TCTTCCAAGC 
TTCACAATTC 
CTAAAGGCTC 
AACAGTTATT 
CAAAAAAAGG 
TTGGTGTTGA 
CCCCAGAAGA 
TAGCAATCAG 
CGAAACCACT 
GAACAAGGAT 
GTGATCCCAA 
CACCCAAGGC 
TCGAATATGA 
GACCCACTGT 
AAGAATACAA 
GGACAAATGA 
ATTATGATTC 
GCAGGGATTC 
AAGAATATGA 
CAGAAACTGA 
AGAAAGGAGA 
CAGGACCTGC 
ACCCTGGCGA 
GTCCTCCTGG 
CAACCATCTC 
TGAGAGGCCC 
GTTCATCTGG 
AGGGTCCCCC 
GAAGAGGAAT 
GTCTGCCAGG 
CTGGTGATGA 
AAGCTGGCCC 
GTATGGCAGG 
CTGGGCCTCC 
CAATTGGTCC 
GTGCTGATGG 
TGGGTCCCCC 
CAGATGGTGT 
GATTCAAAGG 
GAGGGNAAGA 
CTTCAGGTCA 
GAAGACAAGG 
AAGGTGCACG 
CTCGAGGTTC 
GTGGCGATGG 
TTGGATTCCC 
ACCCTGGGCA 
GAGTGGTTGG 
CTGGTCCTCC 
CAAAGGGTGA 
GTTTCCCAGG 
AAGGTCCCCA 
CAGCTGGCCC 
GAGAGAAAGG 
AAGGTCCTGT 
ACAAGGGTGA 
GCCCTCCCGG 
ATGGTGAACC 
CCAGAGGCTT 
GTGAAAAAGG 



11 
I 

TTTAGAAGAA 
ACATCCCTGG 
GAGTTCACAG 
GAAACGGTGG 
TAGAGAGGTC 
TCCAGAGGGA 
AGATACTGCT 
TCCAGGTGGA 
AATTCAGTCT 
GGTTGGGAGA 
CTATCCCCTC 
CGTGGAGAAG 
TGATAGAAGT 
TTTGGATGAA 
GGCAGCATAT 
TGCTCAAGCT 
CTATGAGTAT 
AACTGAGGAG 
CTATGGAACA 
GCCAAATCCA 
CCAGAGGAAA 
TGATCTTCTG 
AGATAAACCA 
TATTACAGAA 
ACCAGCAGTG 
AGGTATTATG 
TAGGGGCCCC 
TACTATGTTG 
TGCTCAGGAA 
ACCTGGCCCA 
GGCCAAAGGT 
TGGTCCAACG 
GCCAGGAGAA 
TGACAAAGGT 
TGGAATGAGG 
ACGAGGTTTG 
TGTAGATGGC 
AGGTCAACAA 
TCCTGGTGAA 
GCCTCCTGGT 
TGGTCCACAA 
CAGAGGTCTC 
TGACATGGGT 
TGGCCCTGAA 
AGCAGGAGAA 
TCCAAAGGGT 
GGGAGTAGCT 
AAGAGGTGCA 
CCCTCCTGGC 
TGGACCAAAA 
ACGTGGGGAG 
ACCACAGGGA 
TGGCCCTCCT 
TCCAGGTCCT 
GGAAAGAGGT 
GGGCCCACCA 
AATTGGTTTA 
TGCTCCTGGA 
TGGTCTCCCA 
AATTGGTGAG 
TCCCCCAGGT 
AGGTCCTAGA 
CCCTGGACCT 
TGAAAATGGG 



21 

i 

AAAGCCCTTT 
CGATACCTTG 
TTGTGAGTGC 
CTCTGGGATT 
AGAGGAGCTG 
ATATCAAAAA 
TACAGAGTTT 
ACTTTCCCAG 
TT CCTTTT AT 
TCACCTGTTT 
TTCAGAACTG 
AAAACTGTGA 
GAGAGAGCAA 
GAAGTTTTTG 
GACTACTGTG 
CAGGAACCTC 
GGGGAAGCAG 
ACAATAGCAC 
ATGGAAAGTT 
GTTGAAGAAA 
AATTCTGAGG 
GTAGATGGAG 
ACAAGCCCCC 
ACAAGCATAA 
GTTGAGCCTG 
GGTCCTCCAG 
CCAGGACGTC 
ATGTTACCGT 
GCTCAGGCTC 
ATGGGTCTAA 
GAGAGTGGTG 
GGAAAACCTG 
CCTGGGGCAA 
CACAGGGGTG 
GGAGAAGATG 
CTGGGTCCAA 
CCCCCAGGAC 
GGGAATCCAG 
AAAGGACCAC 
CATCCTGGGA 
GGTCCTATTG 
AAGGGATCTA 
CTAAAAGGTG 
GGACCCAAAG 
AAGGGAAAAC 
TCCACTGGAT 
GGCAAACCAG 
AGAGGTCCCA 
CCTCCAGGTG 
GGCCCTCCTG 
ACTGGATTTC 
CCAACCGGTG 
GGTGAGCAAG 
CAAGGTATCT 
CTTCCTGGAG 
GGTCCAGTTG 
CGAGGGCGCC 
GAAAAAGGTC 
GGGCCAGCTG 
CCGGGACAAA 
CTTCAAGGAC 
GGACAGCAGG 
CCTGGTCCAA 
GATGTTGGTC 



31 
I 

GACTTTTTCC 
GAAAGGACGA 
GGGGCTCGGA 
TCACCGTAAC 
CTCCAGTTGA 
CAACGGGATT 
CAAAGCAAGC 
AAGACTTTTC 
CTATATATAA 
TTCTGTTTGA 
TTAACATCGC 
CAATGATTGT 
TTGTTGATAC 
AGGGGGACAT 
AGCATTATAG 
AGATAGATGA 
AGTATAAAGA 
AGACGGAGGC 
ACCAGACAGA 
TATTTACTGA 
ATACACTATA 
ATTTAGGCGA 
CTAATGAAGA 
ATGGCCATGG 
GTATGCTTGT 
GTCTACAAGG 
CTGGCTTACC 
TCCGTTATGG 
AAGCTATTCT 
CTGGAAGACC 
ATCCAGGTCC 
GAAAAAGGGG 
AGGGAGATCG 
AACGAGGTCC 
GAGAAATTGG 
GGGGAACTCC 
CAAAAGGGAA 
GACCTCAGGG 
AAGGAAAACC 
AAGAAGGCCA 
GATNNCCGGG 
AAGGTGAAAA 
ACAGAGGAGA 
GTCGAGCAGG 
TTGGAGTTCC 
TCCCTGGGTT 
GCCCTCGGGG 
CTGGGAAACC 
AAAGAGGTCC 
GACCACCAGG 
AAGGCAAGAC 
AGACTGGTCC 
GTCTTCCTGG 
CAGGGAAAGA 
CTCAGGGTGC 
GCTCACCAGG 
CGGGACCTCA 
CCCAAGGGCC 
GTCCTGCCGG 
AAGGCAGCAA 
CAGTTGGTGC 
GGATGTTTGG 
TAGGTCTTCA 
CATGGGGGCC 



41 
I 

CCCTCTCCCT 
AGTTGGTCTG 
GATGGAGCCG 
AACCCTCGCA 
TGTACTAAAA 
TTGCACAAAC 
ACAACTCAGT 
AATACTATTT 
TGAGCATGGT 
AGACCACACT 
TGACGGGAAG 
TGATTGTAAG 
CAATGGAATC 
TCAGCAGTTT 
TCCAGACTGT 
GTATGCACCA 
GGCTGAAAGT 
AAACATCGTT 
AGCTCCTAGG 
AGAATATCTA 
TGAAAACAAA 
ATATGATTTT 
ATTTGGTCCA 
TGCATATGGA 
CGAAGGACCA 
CCCCACTGGA 
AGGGGCTGAT 
TGGTGATGGT 
TCAGCAGGCT 
AGGTCCTGTG 
TCAGGGCCCT 
TCGTCCAGGT 
AGGGTTTGAT 
TCAAGGTCCT 
ACCAAGAGGT 
AGGAGCTCCA 
CATGGGTCCC 
TCTTCCTGGT 
AGGACTTGCT 
GTCTGGAGAA 
CCCCCGGGGA 
GGGTGAAGAT 
AGTTGGTCAA 
CCCAACTGGA 
AGGATTACCA 
TCCAGGTGCC 
TCAGCGTGGT 
TGGGCCAAAG 
TCAAGGACCT 
AAGGATGGGC 
CGGCCCTCCT 
AATAGGGGAA 
TGCTGCAGGA 
TGGACCAGCA 
ACCTGGACTG 
AGAACGTGGG 
GGGTCCTCCT 
TGCAGGGAGA 
CTCCCCTGGG 
GGGTGGCAAG 
CCCTGGAATT 
GCAAAAAGGT 
GGGTCTGCCA 
ACCTGGTCCT 



51 
I 

CCCCAATGGC 
CAGTCGCAAT 
TGGTCCTCTA 
TTGACCTTCC 
GCACTAGATT 
AGAAAGAATT 
GCCCCAACAA 
ACAGTAAAAC 
ATTCAGCAAA 
GGAAAACCTG 
TGGCATCGGG 
AAGAAAACCA 
ACGGTTTTTG 
TTGATCACAG 
GACTCTTCAG 
GAGGATATAA 
GTAACAGAGG 
GATGATTTTC 
CATGTTTCTG 
ACGGGAGAGG 
GAAATAGACG 
TATGAATATA 
GGTGTACCAG 
GAGAAAGGAC 
CCAGGACCAG 
CCCCCTGGTG 
GGTCTACCTG 
TCCAAAGGAC 
CGGATTGCTC 
GGGGGGCCTG 
CGAGGCGTCC 
GCAGATGGAG 
GGACTTCCGG 
CCAGGTCCTC 
CTTCCAGGTG 
GGGCAGCCTG 
CAAGGGGAGC 
CCACAAGGTC 
GGACTTCCTG 
AAGGGGGCTC 
GTAAAGGGAG 
GGTTTTCCAG 
ATTGGCCCAA 
GACCCAGGTC 
GGATATCCAG 
AATGGAGAGA 
CCAACGGGTC 
GGCACTTCAG 
CAGGGTCCAG 
TGCCCAGGAC 
GGGCCAGGGG 
CGTGGGTATC 
AAAGAAGGTG 
GGATTACGTG 
AAAGGAGGGG 
TCAGCAGGTA 
GGTCCAGCTG 
GATGGAGTTC 
GAAGACGGAG 
GGAGAAAATG 
GCTGGAGGTG 
GATGAGGGTG 
GGCCCACCTG 
CCAGGCCCAA 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
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GAGGCCCTCA AGGTCCCAAT GGAGCTGATG GACCACAAGG ACCCCCAGGT TCTGTTGGTT 3900 

CAGTTGGTGG TGTTGGAGAA AAGGGTGAAC CTGGAGAAGC AGGAAACCCA GGGCCTCCTG 3960 

GGGAAGCAGG TGTAGGCGGT CCCAAAGGAG AAAGAGGAGA GAAAGGGGAA GCTGGTCCAC 4020 

CTGGAGCTGC TGGACCTCCA GGTGCCAAGG GGCCGCCAGG TGATGATGGC CCTAAGGGTA 4080 

ACCCGGGTCC TGTTGGTTTT CCTGGAGATC CTGGTCCTCC TGGGGAACTT GGCCCTGCAG 4140 

GTCAAGATGG TGTTGGTGGT GACAAGGGTG AAGATGGAGA TCCTGGTCAA CCGGGTCCTC 4200 

CTGGCCCATC TGGTGAGGCT GGCCCACCAG GTCCTCCTGG AAAACGAGGT CCTCCTGGAG 4260 

CTGCAGGTGC AGAGGGAAGA CAAGGTGAAA AAGGTGCTAA GGGGGAAGCA GGTGCAGAAG 4320 

GTCCTCCTGG AAAAACCGGC CCAGTCGGTC CTCAGGGACC TGCAGGAAAG CCTGGTCCAG 4380 

AAGGTCTTCG GGGCATCCCT GGTCCTGTGG GAGAACAAGG TCTCCCTGGA GCTGCAGGCC 4440 

AAGATGGACC ACCTGGTCCT ATGGGACCTC CTGGCTTACC TGGTCTCAAA GGTGACCCTG 4500 

GCTCCAAGGG TGAAAAGGGA CATCCTGGTT TAATTGGCCT GATTGGTCCT CCAGGAGAAC 4560 

AAGGGGAAAA AGGTGACCGA GGGCTCCCTG GAACTCAAGG ATCTCCAGGA GCAAAAGGGG 4620 

ATGGGGGAAT TCCTGGTCCT GCTGGTCCCT TAGGTCCACC TGGTCCTCCA GGCTTACCAG 4680 

GTCCTCAAGG CCCAAAGGGT AACAAAGGCT CTACTGGACC CGCTGGCCAG AAAGGTGACA 4740 

GTGGTCTTCC AGGGCCTCCT GGGCCTCCAG GTCCACCTGG TGAAGTCATT CAGCCTTTAC 4800 

CAATCTTGTC CTCCAAAAAA ACGAGAAGAC ATACTGAAGG CATGCAAGCA GATGCAGATG 4860 
ATAATATTCT TGATTACTCG GATGGAATGG AAGAAATATT TGGTTCCCTC AATTCCCTGA ' 4920 

AACAAGACAT CGAGCATATG AAATTTCCAA TGGGTACTCA GACCAATCCA GCCCGAACTT 4980 

GTAAAGACCT GCAACTCAGC CATCCTGACT TCCCAGATGG TGAATATTGG ATTGATCCTA 5040 

ACCAAGGTTG CTCAGGAGAT TCCTTCAAAG TTTACTGTAA TTTCACATCT GGTGGTGAGA 5100 

CTTGCATTTA TCCAGACAAA AAATCTGAGG GAGTAAGAAT TTCATCATGG CCAAAGGAGA 5160 

AACCAGGAAG TTGGTTTAGT GAATTTAAGA GGGGAAAACT GCTTTCATAC TTAGATGTTG 5220 

AAGGAAATTC CATCAATATG GTGCAAATGA CATTCCTGAA ACTTCTGACT GCCTCTGCTC 5280 

GGCAAAATTT CACCTACCAC TGTCATCAGT CAGCAGCCTG GTATGATGTG TCATCAGGAA 5340 

GTTATGACAA AGCACTTCGC TTCCTGGGAT CAAATGATGA GGAGATGTCC TATGACAATA 5400 

ATCCTTTTAT CAAAACACTG TATGATGGTT GTACGTCCAG AAAAGGCTAT GAAAAAACTG 5460 

TCATTGAAAT CAATACACCA AAAATTGATC AAGTACCTAT TGTTGATGTC ATGATCAGTG 5520 

ACTTTGGTGA TCAGAATCAG AAGTTCGGAT TTGAAGTTGG TCCTGTTTGT TTTCTTGGCT 5580 

AAGATTAAGA CAAAGAACAT ATCAAATCAA CAGAAAATGT ACCTTGGTGC CACCAACCCA 5640 

TTTTGTGCCA CATGCAAGTT TTGAATAAGG ATGTATGGAA AACAACGCTG CATATACAGG 5700 

TACCATTTAG GAAATACCGA TGCCTTTGTG GGGGCAGAAT CACAGACAAA AGCTTTGAAA 5760 

ATCATAAAGA TATAAGTTGG TGTGGCTAAG ATGGAAACAG GGCTGATTCT TGATTCCCAA 5B20 

TTCTCAACTC TCCTTTTCCT ATTTGAATTT CTTTGGTGCT GTAGAAAACA AAAAAAGAAA 5880 

AATATATATT CATAAAAAAT ATGGTGCTCA TTCTCATCCA TCCAGGATGT ACTAAAACAG 5940 

TGTGTTTAAT AAATTGTAAT TATTTTGTGT ACAGTTCTAT ACTGTTATCT GTGTCCATTT 6000 

CCAAAACTTG CACGTGTCCC TGAATTCCGC TGACTCTAAT TTATGAGGAT GCCGAACTCT 6060 

GATGGCAATA ATATATGTAT TATGAAAATG AAGTTATGAT TTCCGATGAC CCTAAGTCCC 6120 
TTTCTTTGGT TAATGATGAA ATTCCTTTGT GTGTGTTT 

Seq ID NO: 361 Protein sequence 
Protein Accession ft: NP_001845 

1 11 21 31 41 51 

I I I 1 I I 

MEPWSSRWKT KRWLWDFTVT TLALTFLFQA REVRGAAPVD VLKALDFHNS PEGISKTTGF 60 

CTNRKNSKGS DTAYRVSKQA QLtSAPTKQLF PGGTFPEDFS ILFTVKPKKG IQSFLLSIYN 120 

EHGIQQIGVE VGRSPVFLFE DHTGKPAPED YPLFRTVNIA DGKWHRVAIS VEKKTVTMIV 180 

DCKKKTTKPL DRSERAIVDT NGITVFGTRI LDEEVFEGDI QQFfclTGDPK AAYDYCEHYS 240 

PDCDSSAPKA AQAQEPQIDE YAPEDIIEYD YEYGEAEYKE AESVTEGPTV TEETIAQTEA 300 

NIVDDFQEYN YGTMESYQTE APRHVSGTNE PNPVEEIFTE EYLTGEDYDS QRKNSEDTLY 360 

ENKEIDGRDS DLLVDGDLGE YDFYEYKEYE DKPTSPPNEE FGPGVPAETD ITETSINGHG 420 

AYGEKGQKGE PAWEPGMLV EGPPGPAGPA GIMGPPGLQG PTGPPGDPGD RGPPGRPGLP 480 

GADGLPGPPG TMLMLPFRYG GDGSKGPTIS AQEAQAQAIL QQARIALRGP PGPMGLTGRP 540 

GPVGGPGSSG AKGESGDPGP OGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR 600 

GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP 660 

GAPGQPGMAG VDGPPGPKGN MGPQGEPGPP GQGGNPGPQG LPGPQGPIGP PGEKGPQGKP 720 

GLAGLPGADG PPGHPGKEGQ SGEKGALGPP GPQGPIGXPG PRGVKGADGV RGLKGSKGEK 780 

GEDGFPGFKG DMGLKGDRGE VGQIGPRGXO GPEGPKGRAG PTGDPGPSGQ AGEKGKIX3VP 840 

GLPGYPGRQG PKGSTGFPGF PGANGEKGAR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP 900 

GPKGTSGGDG PPGPPGERGP QGPQGPVGFP GPKGPPGPPG RMGCPGHPGQ RGETGFQGKT 960 

GPPGPGGWG PQGPTGETGP IGERGYPGPP GPPGEQGLPG AAGKEGAKGD PGPQGISGKD 1020 

GPAGLRGFPG ERGLPGAQGA PGLKGGEGPQ GPPGPVGSPG ERGSAGTAGP IGLRGRPGPQ 1080 

GPPGPAGEKG APGEKGPQGP AGRDGVQGPV GLPGPAGPAG SPGEDGDKGE IGEPGQKGSK 1140 

GGKGENGPPG PPGLQGPVGA PGIAGGDGEP GPRGQQGMFG QKGDEGARGF PGPPGPIGLQ 1200 

GLPGPPGEKG ENGDVGPWGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA 1260 

GNPGPPGEAG VGGPKGERGE KGEAGPPGAA GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP 1320 

GELGPAGQDG VGGDKGEDGD PGQPGPPGPS GEAGPPGPPG KRGPPGAAGA EGRQGEKGAK 1380 

GEAGAEGPPG KTGPVGPCGP AGKPGPEGLR GIPGPVGEQG LPGAAGQDGP PGPMGPPGLP 1440 

GLKGDPGSKG EKGHPGLIGL IGPPGEQGEK GDRGLPGTQG SPGAKGDGGI PGPAGPLGPP 1500 

GPPGLPGPQG PKGNKGSTGP AGQKGDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG 1560 

MQADADDNIL DYSDGMEEIF GSLNSLKQDI EHMKFPMGTQ TNPARTCKDL QLSHPDFPDG 1620 

EYWIDPNQGC SGDSFKVYCN FTSGGETCIY PDKKSEGVRI SSWPKEKPGS WFSEFKRGKL 1680 

LSYLDVEGNS INMVQMTFLK LLTASARQNF TYHCHQSAAW YDVSSGSYDK ALRFLGSNDE 1740 

EMSYDNNPFI KTLYDGCTSR KGYEKTVIEI NTPKIDQVPI VDVMISDFGD QNQKFGFEVG 1800 
PVCFLG • 



Seq ID NO: 362 DNA sequence 
Nucleic Acid Accession ft: NM_003107 
Coding sequence: 351-1775 

1 11 21 31 41 51 

I I I I I I 

TTCCCCAGCA TTCGAGAAAC TCCTCTCTAC TTTAGCACGG TCTCCAGACT CAGCCGAGAG 60 
ACAGCAAACT GCAGCGCGGT GAGAGAGCGA GAGAGAGGGA GAGAGAGACT CTCCAGCCTG 120 
GGAACTATAA CTCCTCTGCG AGAGGCGGAG AACTCCTTCC CCAAATCTTT TGGGGACTTT 180 
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TCTCTCTTTA CCCACCTCCG CCCCTGCGAG GAGTTGAGGG GCCAGTTCGG CCGCCGCGCG 240 

CGTCTTCCCG TTCGGCGTGT GCTTGGCCCG GGGAACCGGG AGGGCCCGGC 6ATCGCG0GG 300 

CGGCCGCCGC GAGGGTGTGA GCGCGCGTGG GCGCCCGCCG AGCCGAGGCC ATGGTGCAGC 360 

AAACCAACAA TGCCGAGAAC ACGGAAGCGC TGCTGGCCGG CGAGAGCTCG GACTCGGGCG 420 

CCGGCCTCGA GCTGGGAATC GCCTCCTCCC CCACGCCCGG CTCCACCGCC TCCACGGGCG 480 

GCAAGGCCGA CGACCCGAGC TGGTGCAAGA CCCCGAGTGG GCACATCAAG CGACCCATGA 540 

ACGCCTTCAT GGTGTGGTCG CAGATCGAGC GGCGCAAGAT CATGGAGCAG TCGCCCGACA 600 

TGCACAACGC CGAGATCTCC AAGCGGCTGG GCAAACGCTG GAAGCTGCTC AAAGACAGCG 660 

ACAAGATCCC TTTCATTCGA GAGGCGGAGC GGCTGCGCCT CAAGCACATG GCTGACTACC 720 

CCGACTACAA GTACCGGCCC AGGAAGAAGG TGAAGTCCGG CAACGCCAAC TCCAGCTCCT 780 

CGGCCGCCGC CTCCTCCAAG CCGGGGGAGA AGGGAGACAA GGTCGGTGGC AGTGGCGGGG 840 

GCGGCCATGG GGGCGGCGGC GGCGGCGGGA GCAGCAACGC GGGGGGAGGA GGCGGCGGTG 900 

CGAGTGGCGG CGGCGCCAAC TCCAAACCGG CGCAGAAAAA GAGCTGCGGC TCCAAAGTGG 960 

CGGGCGGCGC GGGCGGTGGG GTTAGCAAAC CGCACGCCAA GCTCATCCTG GCAGGCGGCG *1020 

GCGGCGGCGG GAAAGCAGCG GCTGCCGCCG CCGCCTCCTT CGCCGCCGAA CAGGCGGGGG 1080 

CCGCCGCCCT GCTGCCCCTG GGCGCCGCCG CCGACCACCA CTCGCTGTAC AAGGCGCGGA 1140 

CTCCCAGCGC CTCGGCCTCC GCCTCCTCGG CAGCCTCGGC CTCCGCAGCG CTCGCGGCCC 1200 

CGGGCAAGCA CCTGGCGGAG AAGAAGGTGA AGCGCGTCTA CCTGTTCGGC GGCCTGGGCA 1260 

CGTCGTCGTC GCCCGTGGGC GGCGTGGGCG CGGGAGCCGA CCCCAGCGAC CCCCTGGGCC 1320 

TGTACGAGGA GGAGGGCGCG GGCTGCTCGC CCGACGCGCC CAGCCTGAGC GGCCGCAGCA 1380 

GCGCCGCCTC GTCCCCCGCC GCCGGCCGCT OGCCCGCCGA CCACCGCGGC TACGCCAGCC 1440 

TGCGCGCCGC CTCGCCCGCC CCGTCCAGCG CGCCCTCGCA CGCGTCCTCC TCGGCCTCGT 1500 

CCCACTCCTC CTCTTCCTCC TCCTCGGGCT CCTCGTCCTC CGACGACGAG TTCGAAGACG 1560 

ACCTGCTCGA CCTGAACCCC AGCTCAAACT TTGAGAGCAT GTCCCTGGGC AGCTTCAGTT 1620 

CGTCGTCGGC GCTCGACCGG GACCTGGATT TTAACTTCGA GCCCGGCTCC GGCTCGCACT 1680 

TCGAGTTCCC GGACTACTGC ACGCCCGAGG TGAGCGAGAT GATCTCGGGA GACTGGCTCG 1740 

AGTCCAGCAT CTCCAACCTG _ GTTTTCACCT ACTGAAGGGC GCGCAGGCAG GGAGAAGGGC 1800 

CGGGGGGGGT AGGAGAGG AG " AAAAAAAAAG TGAAAAAAAG AAACGAAAAG GACAGACGAA I860 

GAGTTTAAAG AGAAAAGGGA AAAAAGAAAG AAAAAGTAAG CAGGGCTCGT TCGCCCGCGT 1920 

TCTCGTCGTC GGATCAAGGA GCGCGGCGGC GTTTTGGACC CGCGCTCCCA TCCCCCACCT 1980 

TCCCGGGCCG GGGACCCACT CTGCCCAGCC GGAGGGACGC GGAGGAGGAA GAGGGTAGAC 2040 

AGGGGCGACC TGTGATTGTT GTTATTGATG TTGTTGTTGA TGGCAAAAAA AAAAAGCGAC 2100 

TTCGAGTTTG CTCCCCTTTG CTTGAAGAGA CCCCCTCCCC CTTCCAACGA GCTTCCGGAC 2160 

TTGTCTGCAC CCCCAGCAAG AAGGCGAGTT AGTTTTCTAG AGACTTGAAG GAGTCTCCCC 2220 

CTTCCTGCAT CACCACCTTG GTTTTGTTTT ATTTTGCTTC TTGGTCAAGA AAGGAGGGGA 2280 

GAACCCAGCG CACCCCTCCC CCCCTTTTTT TAAACGCGTG ATGAAGACAG AAGGCTCCGG 2340 

GGTGACGAAT TTGGCCGATG GCAGATGTTT TGGGGGAACG CCGGGACTGA GAGACTCCAC 2400 

GCAGGCGAAT TCCCGTTTGG GGCCTTTTTT TCCTCCCTCT TTTCCCCTTG CCCCCTCTGC 2460 

AGCCGGAGGA GGAGATGTTG AGGGGAGGAG GCCAGCCAGT GTGACCGGCG CTAGGAAATG 2520 

ACCCGAGAAC CCCGTTGGAA GCGCAGCAGC GGGAGCTAGG GGCGGGGGCG GAGGAGGACA 2580 

CGAACTGGAA GGGGGTTCAC GGTCAAACTG AAATGGATTT GCACGTTGGG GAGCTGGCGG 2640 

CGGCGGCTGC TGGGCCTCCG CCTTCTTTTC TACGTGAAAT CAGTGAGGTG AGACTTCCCA 2700 

GACCCCGGAG GCGTGGAGGA GAGGAGACTG TTTGATGTGG TACAGGGGCA GTCAGTGGAG 2760 
GGCGAGTGGT TTCGGAAAAA AAAAAAGAAA AAAAGGG 

Seq ID NO: 363 Protein sequence 
Protein Accession ft: NP_003098 

1 11 21 31 41 51 

I I I' I I I 

MVQQTNNAEN TEALLAGESS DSGAGLELGI ASSPTPGSTA STGGKADDPS WCKTPSGHIK 60 

RPMNAFMVWS QIERRKIMEQ SPDMHNAEIS KRLGKRWKLL KDSDKIPFIR EAERLRLKHM 120 

ADYPDYKYRP RKKVKSGNAN SSSSAAASSK PGEKGDKVGG SGGGGHGGGG GGGSSNAGGG 180 

GGGASGGGAN SKPAQKK5CG SKVAGGAGGG VSKPHAKLIL AGGGGGGKAA AAAAASFAAE 240 

QAGAAALLPL GAAADHHSLY KARTPSASAS ASSAASASAA LAAPGKHLAE KKVKRVYLFG 300 

GLGTSSSPVG GVGAGADPSD PLGLYEEEGA GCSPDAPSLS GRSSAASSPA AGRSPADHRG 360 

YASLRAASPA PSSAPSHASS SASSHSSSSS SSGSSSSDDE FEDDLU>LNP SSNFESMSLG 420 
SFSSSSALDR DLDFNFEPGS GSHFEFPDYC TPEVSEMISG DWLESSISNL VFTY 



Seq ID NO: 364 DNA sequence 
Nucleic Acid Accession #: U10860 
Coding sequence: 123-2204 

1 11 21 31 41 51 

I I I I I I 

TGCCGGCTGC TCCTCGACCA GGCCTCCTTC TCAACCTCAG CCCGCGGCGC CGACCCTTCC 60 

GGCACCCTCC CGCCCCGTCT CGTACTGTCG CCGTCACCGC CGCGGCTCCG GCCCTGGCCC 120 

CGATGGCTCT GTGCAACGGA GACTCCAAGC TGGAGAATGC TGGAGGAGAC CTTAAGGATG 180 

GCCACCACCA CTATGAAGGA GCTGTTGTCA TTCTGGATGC TGGTGCTCAG TACGGGAAAG 240 

TCATAGACCG AAGAGTGAGG GAACTGTTCG TGCAGTCTGA AATTTTCCCC TTGGAAACAC 300 

CAGCATTTGC TATAAAGGAA CAAGGATTCC GTGCTATTAT CATCTCTGGA GGACCTAATT 360 

CTGTGTATGC TGAAGATGCT CCCTGGTTTG ATCCAGCAAT ATTCACTATT GGCAAGCCTG 420 

TTCTTGGAAT TTGCTATGGT ATGCAGATGA TGAATAAGGT ATTTGGAGGT ACTGTGCACA 480 

AAAAAAGTGT CAGAGAAGAT GGAGTTTTCA ACATTAGTGT GGATAATACA TGTTCATTAT 540 

TCAGGGGCCT TCAGAAGGAA GAAGTTGTTT TGCTTACACA TGGAGATAGT GTAGACAAAG 600 

TAGCTGATGG ATTCAAGGTT GTGGCACGTT CTGGAAACAT AGTAGCAGGC ATAGCAAATG 660 

AATCTAAAAA GTTATATGGA GCACAGTTCC ACCCTGAAGT TGGCCTTACA GAAAATGGAA 720 

AAGTAATACT GAAGAATTTC CTTTATGATA TAGCTGGATG CAGTGGAACC TTCACCGTGC 780 

AGAACAGAGA ACTTGAGTGT ATTCGAGAGA TCAAAGAGAG AGTAGGCACG TCAAAAGTTT 840 

TGGTTTTACT CAGTGGTGGA GTAGACTCAA CAGTTTGTAC AGCTTTGCTA AATCGTGCTT 900 

TGAACCAAGA ACAAGTCATT GCTGTGCACA TTGATAATGG CTTTATGAGA AAACGAGAAA 960 

GCCAGTCTGT TGAAGAGGCC CTCAAAAAGC TTGGAATTCA GGTCAAACTG ATAAATGCTG 1020 

CTCATTCTTT CTACAATGGA ACAACAACCC TACCAATATC AGATGAAGAT AGAACCCCAC 1080 

GGAAAAGAAT TAGCAAAACG TTAAATATGA CCACAAGTCC TGAAGAGAAA AGAAAAATCA 1140 

TTGGGGATAC TTTTGTTAAG ATTGCCAATG AAGTAATTGG AGAAATGAAC TTGAAACCAG 1200 

AGGAGGTTTT CCTTGCCCAA GGTACTTTAC GGCCTGATCT AATTGAAAGT GCATCCCTTG 1260 



323 



10 
15 
20 
25 
30 
35 
40 
45 
50 
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WO 02/086443 

TTGCAAGTGG CAAAGCTGAA CTCATCAAAA CCCATCACAA 
AGTTGAGAGA GGAGGGAAAA GTAATAGAAC CTCTGAAAGA 
GAATTTTGGG CAGAGAACTT GGACTTCCAG AAGAGTTAGT 
GTCCTGGCCT GGCAATCAGA GTAATATGTG CTGAAGAACC 
CTGAAACCAA CAATATTTTG AAAATAGTAG CTGATTTTTC 
ATACCCTATT ACAGAGAGTC AAAGCCTGCA CAACAGAAGA 
AAATTACCAG TCTGCATTCA CTGAATGCCT TCTTGCTGCC 
AGGGTGACTG TOGTTCCTAC AGTTACGTGT GTGGAATCTC 
GGGAATCACT TATTTTTCTG GCTAGGCTTA TACCTCGCAT 
TTGTTTATAT ATTTGGCCCA CCAGTTAAAG AACCTCCTAC 
TGACAACAGG GGTGCTCAGT ACTTTACGCC AAGCTGATTT 
GGGAGTCTGG GTATGCTGGG AAAATCAGCC AGATGCCGGT 
TTGATCGGGA CCCACTTCAA AAGCAGCCTT CATGCCAGAG 
TTATTACTAG TGACTTCATG ACTGGTATAC CTGCAACACC 
AGGTGGTATT AAAGATGGTC ACTGAGATTA AGAAGATTCC 
ATGACTTAAC ATCAAAGCCC CCAGGAACTA CTGAGTGGGA 

Seq ID NO: 365 Protein sequence 
Protein Accession #: AAA60331 



PCT/US02/12476 



TGACACAGAG 
TTTTCATAAA 
TTCCAGGCAT 
TTATATTTGT 
TGCAAGTGTT 
GGATCAGGAG 
AATTAAAACT 
CAGTAAAGAT 
GTGTCACAAC 
AGATGTTACT 
TGAGGCCCAT 
GATTTTGACA 
ATCTGTGGTT 
TGGCAATGAG 
TGGTATTTCT 
GTAATAAACT 



CTCATCAGAA 
GATGAAGTGA 
CCATTTCCAG 
AAGGACTTTC 
AAAAAGCCAC 
AAGCTGATGC 
GTAGGTGTGC 
GAACCTGACT 
GTTAACAGAG 
CCCACTTTCT 
AACATTCTCA 
CCATTACATT 
ATTCGAACCT 
ATCCCTGTAG 
CGAATTATGT 
TC 



I 

MALCNGDSKL 
AFAIKEQGFR 
KSVREDGVFN 
SKKLYGAQFH 
VLLSGGVDST 
HSPYNGTTTL 
EVFLAQGTLR 
ILGRELGLPE 
TIiLQRVKACT 
ESLIFLARLI 
ESGYAGKISQ 
WLKMVTEIK 



11 
I 

ENAGGDLKDG 
AIIISGGPNS 
ISVDNTCSLF 
PEVGLTENGK 
VCTALLNRAL 
PI SDEDRTPR 
PDLIESASLV 
ELVSRHPFPG 
TEEDQEKLMQ 
PRMCHNVNRV 
MPVILTPLHF 
KIPGISRIMY 



21 
I 

HHHYEGAWI 
VYAEDAPWFD 
RGLQKEEWL 
VILKNFLYDI 
NQEQVIAVHI 
KRISKTLNMT 
ASGKAELIKT 
PGLAIRVICA 
ITSLHSLNAF 
VYIFGPPVKE 
DRDPI/QKQPS 
DLTSKPPGTT 



31 

I 

LOAGAQYGKV 
PAIFTIGKPV 
LTHGDSVDKV 
AGCSGTFTVQ 
DNGFMRKRES 
TSPEEKRKI I 
HHNDTELIRK 
EEPYICKDFP 
LIiPIKTVGVQ 
PPTDVTPTFL 
CQRSWIRTF 
EWE 



41 
I 

IDRRVRELFV 
LGICYGMQMM 
ADGFKWARS 
NRELECIREI 
QSVEEALKKL 
GDTFVKIANE 
LREEGKVIEP 
ETNNILKIVA 
GDCRSYSYVC 
TTGVLSTLRQ 
ITSDFMTGIP 



51 
I 

QSEIFPLETP 
NKVFGGTVHK 
GNIVAGIANE 
KERVGTSKVL 
GIQVKVINAA 
VIGEMNLKPE 
LKDFHKDEVR 
DFSASVKKPH 
GISSKDEPDW 
ADFEAHNILR 
ATPGNEIPVE 



Seq ID NO: 366 DNA sequence 
Nucleic Acid Accession #: NM_004219 
Coding sequence: 46-654 



GCGGCCTCAG 
TATGTTGATA 
CTGGGGTCTG 
TTTGGCAAAA 
ACTGTCAACA 
CCAAGCTTTT 
GCCTCAGATG 
GAGAGTTTTG 
CTCATGATCC 
CCTGTGAAGA 
CTGTCGACCC 
TAGTGCTTCA 
AAAAAAAA 



11 
I 

ATGAATGCGG 
AGGAAAATGG 
GACCTTCAAT 
CGTTCGATGC 
GAGCTACAGA 
CTGCCAAAAA 
ATGCCTATCC 
ACCTGCCTGA 
TTGACGAGGA 
TGCCCTCTCC 
TGGATGTTGA 
GAGTTTGTGT 



21 
I 

CTGTTAAGAC 
AGAACCAGGC 
CAAAGCCTTA 
CCCACCAGCC 
AAAGTCTGTA 
GATGACTGAG 
AGAAATAGAA 
AGAGCACCAG 
GAGAGAGCTT 
ACCATGGGAA 
ATTGCCACCT 
GTATTTGTAT 



31 

t 

CTGCAATAAT 
ACCCGTGTGG 
GATGGGAGAT 
TTACCTAAAG 
AAGACCAAGG 
AAGACTGTTA 
AAATTCTTTC 
ATTGCGCACC 
GAAAAGCTGT 
TCCAATCTGT 
GTTTGCTGTG 
TAATAAAGCA 



41 
I 

CCAGAATGGC 
TTGCTAAGGA 
CTCAAGTTTC 
CTACTAGAAA 
GACCCCTCAA 
AAGCAAAAAG 
CCTTCAATCC 
TCCCCTTGAG 
TTCAGCTGGG 
TGCAGTCTCC 
ACATAGATAT 
TTCTTCAACA 



51 
I 

TACTCTGATC 
TGGGCTGAAG 
AACACCACGT 
GGCTTTGGGA 
ACAAAAACAG 
CTCTGTTCCT 
TCTAGACTTT 
TGGAGTGCCT 
CCCCCCTTCA 
TTCAAGCATT 
TTAAATTTCT 
GAAAAAAAAA 



Seq ID NO: 367 Protein sequence 
Protein Accession 8: NP_004210 

1 11 21 31 41 51 

I I 1 I I I 

MATLIYVDKE NGEPGTRWA KDGLKLGSGP SIKALDGRSQ VSTPRFGKTF DAPPALPKAT 
RKALGTVNRA TEKSVKTKGP LKQKQPSFSA KKMTEKTVKA KSSVPASDDA YPEIEKFFPF 
NPLDFESFDL PEEHQIAHLP LSGVPLMILD EERELEKLFQ LGPPSPVKMP SPPWESNLLQ 
SPSSILSTLD VELPPVCCDI DI 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



60 
120 
180 
240 
.300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
• 600 
660 
720 



60 
120 
180 



70 
75 
80 
85 



Seq ID NO: 368 DNA sequence 
Nucleic Acid Accession 8: NM_000597 
Coding sequence: 118-1104 



1 

I 

ATTCGGGGCG 
CCTGCCCGCC 
CTGCCGAGAG 
CCGCTGCTGC 
CTGTTCCGCT 
GCGCCGCCCG 
GTCCGGGAGC 
GGCGTCTACA 
CTGCCCCTGC 
TATGGCGCCA 
GTGGAGAACC 
AAGCCCCTCA 
CACCGGCAGA 
CGACCACCCC 



11 

I 

AGGGAGGAGG 
CGCCCGCTCG 
TGGGCTGCCC 
TGCTGCTACT 
GCCCGCCCTG 
CCGCGGTGGC 
CGGGCTGCGG 
CCCCGCGCTG 
AGGCGCTGGT 
GCCCGGAGCA 
ACGTGGACAG 
AGTCGGGTAT 
TGGGCAAGGG 
CTGCCAGGAC 



21 

I 

AAGAAGCGGA 
CTCGCTCGCC 
CGCGCTGCCG 
GGGCGCGAGT 
CACACCCGAG 
CGCAGTGGCC 
CTGCTGCTCG 
CGGCCAGGGG 
CATGGGCGAG 
GGTTGCAGAC 
CACCATGAAC 
GAAGGAGCTG 
TGGCAAGCAT 
TCCCTGCCAA 



31 
I 

GGAGGCGGCT 
CGCCGCGCCG 
CTGCCGCCGC 
GGCGGCGGCG 
CGCCTGGCCG 
GGAGGCGCCC 
GTGTGCGCCC 
CTGCGCTGCT 
GGCACTTGTG 
AATGGCGATG 
ATGTTGGGCG 
GCCGTGTTCC 
CACCTTGGCC 
CAGGAACTGG 



41 
I 

CCCGCTCGCA 
CGCTGCCGAC 
CGCCGCTGCT 
GCGGGGCGCG 
CCTGCGGGCC 
GCATGCCATG 
GGCTGGAGGG 
ATCCCCACCC 
AGAAGCGCCG 
ACCACTCAGA 
GGGGAGGCAG 
GGGAGAAGGT 
TGGAGGAGCC 
ACCAGGTCCT 



51 
I 

GGGCCGTGCA 
CGCCAGCATG 
GCCGCTGCTG 
CGCGGAGGTG 
CCCGCCGGTT 
CGCGGAGCTC 
CGAGGCGTGC 
GGGCTCCGAG 
GGACGCCGAG 
AGGAGGCCTG 
TGCTGGCCGG 
CACTGAGCAG 
CAAGAAGCTG 
GGAGCGGATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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TCCACCATGC GCCTTCCGGA TGAGCGGGGC CCTCTGGAGC 
CCCAACTGTG ACAAGCATGG CCTGTACAAC CTCAAACAGT 
CAGCGTGGGG AGTGCTGGTG TGTGAACCCC AACACCGGGA 
ACCATCCGGG GGGACCCCGA GTGTCATCTC TTCTACAATG 
GTGCACACCC AGCGGATGCA GTAGACCGCA GCCAGCCGGT 
GCCCCTCTCC AAACACCGGC AGAAAACGGA GAGTGCTTGG 
TTCCAGTTCT GACACACGTA TTTATATTTG GAAAGAGACC 
CCCGGCCTCT CTCTTCCCAG CTGCAGATGC CACACCTGCT 
GAGGAAGGGG GTTGTGGTCG GGGAGCTGGG GTACAGGTTT 
TTTATTTTTG AACCCCTGTG TCCCTTTTGC ATAAGATTAA 

Seq ID NO: 369 Protein sequence 
Protein Accession #: NP_000588 



ACCTCTACTC 
GCAAGATGTC 
AGCTGATCCA 
AGCAGCAGGA 
GCCTGGCGCC 
GTGGTGGGTG 
AGCACCGAGC 
CCTTCTTGCT 
GGGGAGGGGG 
AGGAAGGAAA 



CCTGCACATC 
TCTGAACGGG 
GGGAGCCCCC 
GGCTTGCGGG 
CCTGCCCCCC 
CTGGAGGATT 
TCGGCACCTC 
TTCCCCGGGG 
AAGAGAAATT 
AGT 



MLPRVGCPAL 
VAPPAAVAAV 
ELPLQALVMG 
RKPLKSGMKE 
ISTMRLPDER 
PTIRGDPECH 



11 
I 

PLPPPPLLPL 
AGGARMPCAE 
EGTCEKRRDA 
LAVFREKVTE 
GPLEHLYSLH 
LFYNEQQEAC 



21 
I 

LPLLLLLLGA 
LVREPGCGCC 
EYGASPEQVA 
QHRQMGXGGK 
IPNCDKHGLY 
GVHTQRMQ 



31 
I 

SGGGGGARAE 
SVCARLEGEA 
DNGDDHSEGG 
HHLGLEEPKK 
NLKQCKMSLN 



41 
I 

VLPRCPPCTP 
CGVYTPRCGQ 
LVENHVDSTM 
LRPPPARTPC 
GQRGECWCVN 



51 
I 

ERLAACGPPP 
GLRCYPHPGS 
NMLGGGGSAG 
QQELDQVLER 
PNTGKLIQGA 



Seq ID HO: 370 DNA sequence 
Nucleic Acid Accession #: NMJ)04264 
Coding sequence: 6-440 



1 
I 

GGAACATGGC 
TTTGTAATGC 
AGACAGCAAT 
CAGCACTGAT 
AAGAATCTAC 
AAGCTGCTAC 
AAAGCGCACT 
AGTCTCTTCC 
GTGCCATTAA 
TTAAACACTA 
GATAAGCTTA 
GAGTGAAATT 
AATTCTGTTA 
C 



11 
I 

GGATCGGCTC 
CATTGGAGTA 
TAACAAAGAC 
TGCACGAACA 
AGCTGCTTTA 
ATGTGTGGAG 
TGCTGATATT 
AGACTCATAG 
GAATTCTGCA 
TGACACATTA 
TAAATCATGA 
ATTAAGGCAT 
TGACATAATT 



21 
I 

ACGCAGCTTC 
TTGCAGCAAT 
CAGCCAGCTA 
GCAAAAGACA 
CAGGCTGCTA 
GATGTTGTTT 
GCACAGTCAC 
CATCAGTGGA 
TCAGACTTAG 
CCTTTTTAGC 
TTGAATCAGC 
GTAATACATT 
TATGTCTCCA 



31 
I 

AGGACGCTGT 
GTGGTCCTCC 
ACCCTACAGA 
TTGATGTTTT 
GCTTGTATAA 
ATCGAGGAGA 
AGCTGAAGAC 
TACCATGTGG 
ATACAAGCCT 
TATTTTTAAT 
TTTAAAGCAT 
AATGAACATA 
TTTTGTTGTA 



41 
I 

GAATTCGCTT 
TGCCTCTTTC 
AGAGTATGCC 
GATAGATTCC 
GCTAGAAGAA 
CATGCTTCTG 
AAGAAGTGGT 
CTGAGAAAAG 
TACCAACAAT 
AGTCTTCTAT 
CATACCATCA 
ATATAAGGAA 
TTGGCCAGTA 



51 
I 

GCAGATCAGT 
AATAATATTC 
CAGCTTTTTG 
TTACCCAGTG 
GAAAACCATG 
GAGAAGATAC 
ACCCATAGCC 
AACTGTTTGA 
TACAGAAACA 
TTTCACTCTT 
TTTTTTAACT 
ACATATGTAA 
CTTTTACAAT 



Seq ID NO: 371 Protein sequence 
Protein Accession ft: NP__004255 



41 



51 



1 11 21 31 

I I I I 1 I 

MADRLTQLQD AVNSLADQFC NAIGVLQQCG PPASFNNIQT AINKDQPANP TEEYAQLFAA 
LIARTAKDID VLIDSLPSEE STAALQAASL YKLEEENHEA ATCVEDWYR GDMLLEKIQS 
ALADIAQSQL KTRSGTHSQS LPDS 

Seq ID NO: 372 DNA sequence 
Nucleic Acid Accession #: AJ271091 
Coding sequence: 1-1113 



ATGGAGAATC 
CTGCGCGTGG 
CATTTCAAAG 
TTCTTAGACC 
ACAGTACAGA 
CTGTTTTTGG 
AGAGCTAAGG 
ACTCTTACAA 
TTCTCCTGGA 
TATGACACAT 
GAAACTATCA 
CTTCTTGGAA 
AAAGCTGTGG 
TTCTACATGC 
CTGTGGATTC 
ATTCCAATAT 
AAAGTTAGAT 
ATAAATTTTC 
CATGCCTGTG 



11 
I 

AGGTGTTGAC 
AGCTGAGTGA 
CTCAAGGACA 
TTGTGAAACC 
AGAAAGTGAG 
CTCCTGACTT 
AAGAAGAGCG 
ACTTAAGGAA 
TCTTTGTCAA 
TCCATACTGT 
ATGCAGCAAT 
GAAATTTTAT 
TTTTCTTTGT 
TGACGTGCAT 
CCTTATATCC 
TCAATGAGAC 
TTTCCTTTTT 
GTCACCTTTA 
ATCCCAGCGC 



21 
I 

GCCGCATGTC 
CGTACAGAAC 
TGGTGCCAAA 
AGAGCCTGTT 
TCAGTGGTGG 
TGATCGTTGG 
CCTAAATAAA 
AGGATACCTG 
CCTGACTGTG 
GGCTGACATG 
TGGAGTCACT 
TTTGTTTATC 
GTTTTATTTG 
TGACATGGAT 
ACTGGGATGT 
CGGACGATTC 
TCTTCAGATT 
TAAACAGCGC 
TTTGGGAGGC 



31 
I 

TACTGGGCTC 
CCTGCCATCA 
GGAGACAATG 
TACAAACTGA 
GAGAGACTCA 
CTGGATGAAT 
CTCCGACTGG 
TTTATGTATA 
CGATTCTGTA 
ATGTATTTCT 
ACGTCACCGG 
ATCTTTGGCA 
TGGAGTGCAA 
TGGAAGGTGC 
TTGGCGGAAG 
AGTTTCACAT 
TATCTTATAA 
AGACTGAAAA 
TGA 



41 
I 

AGCGACACCG 
GCATCACTGA 
TCTATGAATT 
CCCAGAGGCA 
CAAAGCAGGA 
CTGATGCGGA 
AAAGCGAAGG 
ATCTTGTGCA 
TCTTGGGAAA 
GCCAGATGCT 
TGCTGCCTTC 
CCATGGAAGA 
TTGAAATTTT 
TCACATGGCT 
CTGTCTCAGT 
TGCCATATCC 
TGATATTTTT 
TGAGGGCAGG 



51 

I 

CGAGCTATAT 
AAACGTGCTG 
TCACCTGGAG 
GGTAAACATT 
AAAGCGACCA 
AATGGAGCTC 
CTCTCCTGAA 
ATTCTTGGGA 
AGAGTCCTTT 
GGCAGTTGTG 
TCTGATCCAG 
AATGCAGAAC 
CAGGTACTCT 
TCGTTACACT 
GATTCAGTCC 
AGTGAAAATC 
AGGTTTATAC 
CGCAGTGGCT 



Seq ID NO: 373 Protein sequence 
Protein Accession #: CAB69070 



11 



21 



31 



41 



51 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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60 
120 
180 
240 
300 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



MENQVLTPHV YWAQRHREkY LRVELSDVQN PAISITENVL HFKAQGHGAK GDNVYEFHLE 
FLDLVKPEPV YKLTQRQVNI TVQKKVSQWW ERLTKQEKRP LFLAPDFDRW LDESDAEMEL 
RAKEEERLNK LRLESEGSPE TLTNLRKGYL FMYNLVQFLG FSWIFVNLTV RFCILGKESF 



60 
120 
180 



325 



WO 02/086443 PCT/US02/12476 

YDTFHTVADM MYFCQMLAW ETINAAIGVT TSFVLPSLIQ LLGRNF1LFI IFGTMEEMQN 240 

KAWFFVFYL WSAIEIFRYS FYMLTCIDMD WKVLTWLRYT LWIPLYPLGC LAEAVSVIQS 300 

IPIFNETGRF SFTLPYPVKI KVRPSPPLQI YLIMIFLGLY INFRHLYKQR RLKMRAGAVA 360 
HACDPSALGG 



Seq ID NO: 374 DNA sequence 
Nucleic Acid Accession 8: NM_01639S 
Coding sequence: 1-1113 

1 11 21 31 41 51 

I I I I I I 

ATGGAGAATC AGGTGTTGAC GCCGCATGTC TACTGGGCTC AGCGACACCG CGAGCTATAT 60 

CTGCGCGTGG AGCTGAGTGA CGTACAGAAC CCTGCCATCA GCATCACTGA AAACGTGCTG 120 

15 CATTTCAAAG CTCAAGGACA TGGTGCCAAA GGAGACAATG TCTATGAATT TCACCTGGAG 180 

TTCTTAGACC TTGTGAAACC AGAGCCTGTT TACAAACTGA CCCAGAGGCA GGTAAACATT 240 

ACAGTACAGA AGAAAGTGAG TCAGTGGTGG GAGAGACTCA CAAAGCAGGA AAAGCGACCA 300 

CTG TTTTTG G CTCCTGACTT TGATCGTTGG CTGGATGAAT CTGATGCGGA AATGGAGCTC 360 

AGAGCTAAGG AAGAAGAGCG CCTAAATAAA CTCCGACTGG AAAGCGAAGG CTCTCCTGAA 420 

20 ACTCTTACAA ACTTAAGGAA AGGATACCTG TTTATGTATA ATCTTGTGCA ATTCTTGGGA 480 

TTCTCCTGGA TCTTTGTCAA CCTGACTGTG CGATTCTGTA TCTTGGGAAA AGAGTCCTTT 540 

TATGACACAT TCCATACTGT GGCTGACATG ATGTATTTCT GCCAGATGCT GGCAGTTGTG 600 

GAAACTATCA ATGCAGCAAT TGGAGTCACT ACGTCACCGG TGCTGCCTTC TCTGATCCAG 660 

CTTCTTGGAA GAAATTTTAT TTTGTTTATC ATCTTTGGCA CCATGGAAGA AATGCAGAAC 720 

25 AAAGCTGTGG TTTTCTTTGT GTTTTATTTG TGGAGTGCAA TTGAAATTTT CAGGTACTCT 780 

TTCTACATGC TGACGTGCAT TGACATGGAT TGGAAGGTGC TCACATGGCT TCGTTACACT 840 

CTGTGGATTC CCTTATATCC ACTGGGATGT TTGGCGGAAG CTGTCTCAGT GATTCAGTCC 900 

ATTCCAATAT TCAATGAGAC CGGACGATTC AGTTTCACAT TGCCATATCC AGTGAAAATC ST60 

AAAGTTAGAT TTTCCTTTTT TCTTCAGATT TATCTTATAA TGATATTTTT AGGTTTATAC 1020 

30 ATAAATTTTC GTCACCTTTA TAAACAGCGC AGACTGAAAA TGAGGGCAGG CGCAGTGGCT 1080 
CATGCCTGTG ATCCCAGCGC TTTGGGAGGC TGA 



Seq ID NO: 375 Protein sequence 
35 Protein Accession #: NP_057479 

1 11 21 31 41 51 

I I I I I I 

MENQVLTFHV YWAQRHRELY LRVELSDVQN PAISITENVL HFKAQGHGAK GDNVYEFHLE 60 

40 PLDLVKPEPV YKLTQRQVNI TVQKKVSQWW ERLTKQEKRP LFLAPDFDRW LDESDAEMEL 120 1 

RAKEEERLNK LRLESEGSPE TLTNLRKGYL FMYNLVQPLG FSWIFVNLTV RFCILGKESF 180 

YDTFHTVADM MYFCQMLAW ETINAAIGVT TSPVLPSLIQ LLGRNFILFI IFGTMEEMQN 240 

KAWFFVFYL WSAIEIFRYS FYMLTCIDMD WKVLTWLRYT LWIPLYPLGC LVEAVSVIQS 300 

IPIFNETGRF SFTLPYPVKI KVRFSFFLQI YLIMIFLGLY INFRHLYKQR RRRYGKKRKR 360 
45 STKKKDLDGF LPV 



50 



Seq ID NO: 376 DNA sequence 
Nucleic Acid Accession #: NM_005987 
Coding sequence: 1-270 



11 21 31 41 51 

I I I I I I 

ATGAATTCTC AGCAGCAGAA GCAGCCTTGC ACCCCACCCC CTCAGCCTCA GCAGCAGCAG 60 
GTGAAACAAC CTTGCCAGCC TCCACCCCAG GAACCATGCA TCCCCAAAAC CAAGGAGCCC 120 
55 TGCCAACCCA AGGTGCCTGA GCCCTGCCAC CCCAAAGTGC CTGAGCCCTG CCAGCCCAAG 180 
ATTCCAGAGC CCTGCCAGCC CAAGGTGCCT GAGCCCTGCC CTTCAACGGT CACTCCAGCA . 240 
CCAGCCCAGC AGAAGACCAA GCAGAAGTAA 

Seq ID NO: 377 Protein sequence 
60 Protein Accession tt: NP_005978 

1 11 21 31 41 51 

I I I I I I 

MNSQQQKQPC TPPPQPQQQQ VKQPCQPPPQ EPCIPKTKEP CQPKVPEPCH PKVPEPCQPK 60 
65 IPEPCQPKVP EPCPSTVTPA PAQQKTKQK 

Seq ID NO: 378 DNA sequence 
Nucleic Acid Accession #: NM_002105 
70 Coding sequence: 74-505 

1 ' 11 21 31 41 51 

I I ] I I I 

ACAGCAGTTA CACTGCGGCG GGCGTCTGTT CTAGTGTTTG AGCCGTCGTG CTTCACCGGT 60 

75 CTACCTCGCT AGCATGTCGG GCCGCGGCAA GACTGGCGGC AAGGCCCGCG CCAAGGCCAA 120 

GTCGCGCTCG TCGCGCGCCG GCCTCCAGTT CCCAGTGGGC CGTGTACACC GGCTGCTGCG 180 

GAAGGGCCAC TACGCCGAGC GCGTTGGCGC CGGCGCGCCA GTGTACCTGG CGGCAGTGCT 240 

GGAGTACCTC ACCGCTGAGA TCCTGGAGCT GGCGGGCAAT GCGGCCCGCG ACAACAAGAA 300 

GACGCGAATC ATCCCCCGCC ACCTGCAGCT GGCCATCCGC AACGACGAGG AGCTCAACAA 360 

80 GCTGCTGGGC GGCGTGACGA TCGCCCAGGG AGGCGTCCTG CCCAACATCC AGGCCGTGCT 420 

GCTGCCCAAG AAGACCAGCG CCACCGTGGG GCCGAAGGCG CCCTCGGGCG GCAAGAAGGC 480 

CACCCAGGCC TCCCAGGAGT ACTAAGAGGG CCCGCGCCGC GGCCGGCCGC CCCAGCTCCC 540 

CATGCCACCA CAAAGGCCCT TTTAAGGGCC ACCACCGCCC TCATGGAAAG AGCTGAGCCG 600 

CTTCAGACTG CGGGGCAAGC GGGCCGCGGC TCCCTTCCCC TCCCCTCCCC TCGCCCGCCT 660 

85 TCGCCGCCCG GCCTCGAGTC CCCGCCCGCC CCCGCTCCCG TCCCGCACCG CCTGCCGCGT 720 

CGGCCTCGGG CCTGCCCTGT CCGCCGTCCG CCCTCCGGTA GGGTTCGGGC CTTCCGGATG 780 

CGGCTTGGGC GCTCTTCGGG GACCTCCGTG GCGCGGAAGA CCCGAGCCTG CCGGGGGGAG 840 



326 



10 



15 



WO 02/086443 

GCCGGCGGCG CCGCACCTGC CCGCCTCGGC GTTCGTGACT CAGCCGCCCC ATCCCGAGTC 900 

GCTAAGGGGC TGCGGGGAGG CCGCAGCACC TTCTGGAAGA CTTGGCCTTC CGCTCTGACG 960 

CAGGGCCGAG GTGGGCAGTC CAGGCCGAGA GCCGGCGGCC CTGAAGGTGA GTGAGGCCCT 1020 

CGGCAGCTGC AGCCGGGGTG TCTGGTACCC CCCCGGCGTG GTGCTTAGCC CAGGACTTTC 1080 

AGACGGCCGC TGGCCGGGAG GCTTTGGTGG GAGAGACGCG ATCGCCGATT TCGGTCTGGC 1140 

GCCCCTTCTG CGGCCGGGAC CCAGGCCTTT CACATCAGCT CTCCCTCCAT CTTCATTCAT 1200 

AGGTCTGCGC TGGGGCCGGG ACGAAGCACT TGGTAACAGG CACATCTTCC TCCCGAGTGA 1260 

CTGCCTCCTA GGAGGACATT TAGGGGAGGG CAGAGGCCTG CAGTTTGGCT TCACGGCTGG 1320 

CTATGTGGAC AGCAAGAGTC GTTTTGCGGA ACGCGACTGG CAGCCAGGCC TGTCGGGCCC 1380 

CCGACGCCGC CCCATTTCCC TTCCAGCAAA CTCAACTCGG CAATCCAAGC ACCTAGATAC 1440 

CAGCACAAGT CGGTTAATCC CTGTCTGGAC TGAGCCTCCG TTGGCTTCTG AACTGGAATT 1500 

CTGCAGCTAA CCCTTCCACG ACTAGAACCT TAGGCATTGG GGAGTTTTAG ATGGACTAAT 1560 
TTTATTAAAG GATTGTTTTT TTTTT 

Seq ID NO: 379 Protein sequence 
Protein Accession #: NP_002096 
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1 11 21 31 41 51 

10 MSGRGKTGGK ARAKAKSRSS RAGLQFPVGR VHRLLRKGHY AERVGAGAPV YIAAVLEYLT 
AEILELAGNA ARDNKKTRII PRHLQLAIRN DEBLNKLLGG VTIAQGGVLP NIQAVLLPKK 
TSATVGPKAP SGGKKATQAS QEY 



60 
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Seq ID NO: 380 DNA sequence 
Nucleic Acid Accession #: AL136942 
Coding sequence: 184-864 



ACGOGTCCGG 
GGGCGAGCGG 
CAGGCGAGGC 
GCGATGAAGA 
CATGTCCGCA 
CTGTTGATTT 
CTGGGAGGTG 
CTTCTCATGA 
TGGATCATCC 
ATCACTGTGC 
TTTCCCTACA 
CTGTTTATTA 
TACCGATACA 
ACTACGGTGC 
CCGCCACCTT 
CTTTGCAGAC 
TTGTTTGTTG 
TCAACATATG 
TGGGGATATA 
GGACCTAGAA 
TCCCTCTCTT 
CCAGCATAGA 
ATTAGGTAAA 
TGACTGAAGT 
TAAGACCATT 
GATCTTGTGT 
GTGTTTGGCG 
GGCCCGCTTV 
TATCAAGTGG 
TCTTTTCCCT 
TAAAATGTAA 
GACTACCTGA 
TGTTGGTTCA 
CCACATCCAA 



11 
I 

CAGAAGCTCG 
GCCGGGAGCC 
GGTCGACGCT 
TGGTCGCGCC 
CCGGCACCAT 
TATTGAGTGC 
ACTTTGAGTT 
TCCTGATATG 
CATTCTTCTG 
TTATTTATCC 
GAGATGATGT 
GCATTATCTT 
TCAATGGTAG 
TGCTACCCCC 
ACGTGTCTGC 
ATCTGAGCAA 
CTGAAATGCT 
CTTTGCTAGA 
ACGGGCTTCA 
GTCTGCTTTT 
TTGAAAATGT 
GAACAAAACC 
TAGAAGTCCT 
TCAATGACAG 
AGAAAGCACC 
CCAGGGACAT 
CTGCATGGGA 
TACTAAGTGT 
AATTGGGATA 
GCAAGCTACA 
ACATTTTCAG 
ATTGCAAGGG 
TTATTGAATG 
AAAAAAAAAA 



21 
I 

GAGCTCTCGG 
GGAGCGGCGG 
CCTGAAAACT 
CTGGACGCGG 
CCTGCTCGGC 
CCTGGCTGAT 
CATGGATGAT 
TGCTATGGCT 
TTACCAGATC 
AAACTCCATT 
CATGTCAGTG 
GACTTTTAAG 
GAACTCCTCT 
GTATGATGAT 
CTAAGCCTTC 
TAGTTCTGTT 
ACTTTTTAAA 
ACACTGTGAT 
CTAACCTTCC 
GTACCTGCTG 
AAAATAAAAC 
TTATGGAAAC 
TATGTATGTG 
TTTGTGTTTG 
AGGCCGTGGG 
GGGGTGACAT 
TCTGGTGCCC 
TCTGCCCTAG 
TATTTGATAT 
TCCTACTGCT 
AAAAATGAGG 
ATTTTTATAT 
TGCTGTAAAT 
AAAAA 



31 
I 

GGTATCGAGG 
AGGAGCCGGC 
TGCGCGCGCG 
TTCTACTCCA 
GTCTGGTATC 
CCGGATCAGT 
GCCAACATGT 
ACTTACGGAG 
TTTGACTTTG 
CAGGAATACA 
AATCCTACCT 
GGTTACTTGA 
GATGTCCTGG 
GCCACTGTGA 
AAGTGGGCGG 
ATTTCACTTT 
ATTTAGATGT 
AGATTAACTG 
CTAGGCATTG 
GGCCCCAAAG 
CAAAAATAGA 
AGGAATGTCA 
TTACAAGAAT 
GTGGTAAAGG 
AGCAGTGACC 
GCCTCGTATG 
CTCTTCTCCT 
ATTGGTTCAA 
ACTTCTGCCT 
TTGAACTTCC 
ATTGCCTTCC 
ATTCATATGT 
TAAGTCGTTT 



41 
1 

AGGCAGGCCC 
AGCAGCGGCG 
CTCGCGCCAC 
ACAGCTGCTG 
TGATCATCAA 
ATAACTTTTC 
GCATTGCCAT 
CGTACAAGCA 
CCCTGAACAT 
TACGGCAACT 
GTTTGGTCCT 
TTAGCTGTGT 
TTTATGTTAC 
ATGGTGCTGC 
AGCTGAGGGC 
TGCCATGAGC 
TAGATTGAAA 
TAGAATTCTT 
AAACTTCCCC 
TTGGGCATTT 
CAACTTTTTC 
ATTGTGTAAT 
TTCCCCCACA 
ATTTTCTCCA 
ATCTACTGAC 
TGTTAGAGGG 
GGATTCACAT 
GGAGGTCATC 
AACAACATGG 
AAGTATGTCT 
TTGTATGCGC 
TACAAAGTCA 
GCAATTAAAA 



51 
I 

GCGGGCGCAC 
CGGCGGGCTC 
TGCGCCCGGA 
CTTGTGCTGC 
TGCTGTGGTA 
AAGTTCTGAA 
TGCGATTTCT 
ACGCGCAGCC 
GTTGGTTGCA 
GCCTCCTAAT 
TATTATTCTT 
TTGGAACTGC 
CAGCAATGAC 
CAAGGAGCCA 
AGCAGCTTGA 
CTCTCTGAGC 
ACTGTAGTTT 
CCTGTACGAT 
CAAATCTGAT 
TTCTCTCTGT 
TTCAGCCATT 
CATTGTTCTA 
ACATCCTTTA 
TGGCCTGAAT 
TGTTCTTGTG 
TGGAATGGAT 
CCCCACCCAG 
CAACTGACTT 
AAAAGGGTTT 
AGTCACCTTT 
TTTTTACCTT 
GCAACTCTCC 
CAAGGTTTGC 



Seq ID NO: 381 Protein sequence 
Protein Accession #: CAB66876 
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41 



51 



1 11 21 

I I I 

MKMVAPWTRF YSNSCCLCCH VRTGTILLGV WYLIINAWL LILLSALADP DQYNFSSSEL 
GGDFEFMDDA NMCIAIAISL LMILICAMAT YGAYKQRAAW IIPFFCYQIF DFALNMLVAI 
TVLIYPNSIQ EYIRQLPPNF PYRDDVMSVN PTCLVLIILL FISIILTFKG YLISCVWNCY 
RYINGRNSSD VLVYVTSNDT TVLLPPYDDA TVNGAAKEPP PPYVSA 

Seq ID NO: 382 DNA sequence 
Nucleic Acid Accession #: NM_002510 
Coding sequence: 92-1774 



CAGATGCCAG 
CCTTGAGTGC 
TCTGCTCCTG 
CAATGAAAGA 
TGAAAATGAC 
AAACTCCTGG 



11 
I 

AAGAACACTG 
CTGCGTCCGT 
GCTGCAAGAT 
CCTTCTGCTT 
TGGAATGAAA 
AAGGGAGGCC 



21 
I 

TTGCTCTTGG 
GAGAATTCAG 
TGCCACTTGA 
ACATGAGGGA 
AACTCTACCC 
GTGTGCAGGC 



31 
i 

TGGACGGGCC 
CATGGAATGT 
TGCCGCCAAA 
GCACAATCAA 
AGTGTGGAAG 
GGTCCTGACC 



41 
I 

CAGAGGAATT 
CTCTACTATT 
CGATTTCATG 
TTAAATGGCT 
CGGGGAGACA 
AGTGACTCAC 



51 
I 

CAGAGTTAAA 
TCCTGGGATT 
ATGTGCTGGG 
GGTCTTCTGA 
TGAGGTGGAA 
CAGCCCTCGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



60 
120 
180 



60 
120 
180 
240 
300 
360 
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GGGCTCAAAT ATAACATTTG CGGTGAACCT GATATTCCCT AGATGCCAAA AGGAAGATGC 420 

CAATGGCAAC ATAGTCTATG AGAAGAACTG CAGAAATGAG GCTGGTTTAT CTGCTGATCC 4B0 

ATATGTTTAC AACTGGACAG CATGGTCAGA GGACAGTGAC GGGGAAAATG GCACCGGCCA 540 

AAGCCATCAT AACGTCTTCC CTGATGGGAA ACCTTTTCCT CACCACCCCG GATGGAGAAG 600 

ATGGAATTTC ATCTACGTCT TCCACACACT TGGTCAGTAT TTCCAGAAAT TGGGACGATG 660 

TTCAGTGAGA GTTTCTGTGA ACACAGCCAA TGTGACACTT GGGCCTCAAC TCATGGAAGT 720 

GACTGTCTAC AGAAGACATG GACGGGCATA TGTTCCCATC GCACAAGTGA AAGATGTGTA 780 

CGTGGTAACA GATCAGATTC CTGTGTTTGT GACTATGTTC CAGAAGAACG ATCGAAATTC 840 

ATCCGACGAA ACCTTCCTCA AAGATCTCCC CATTATGTTT GATGTCCTGA TTCATGATCC 900 

TAGCCACTTC CTCAATTATT CTACCATTAA CTACAAGTGG AGCTTCGGGG ATAATACTGG 960 

CCTGTTTGTT TCCACCAATC ATACTGTGAA TCACACGTAT GTGCTCAATG GAACCTTCAG 1020 

CCTTAACCTC ACTGTGAAAG CTGCAGCACC AGGACCTTGT CCGCCACCGC CACCACCACC 10 80 

CAGACCTTCA AAACCCACCC CTTCTTTAGG ACCTGCTGGT GACAACCCCC TGGAGCTGAG 1140 

TAGGATTCCT GATGAAAACT GCCAGATTAA CAGATATGGC CACTTTCAAG CCACCATCAC 1200 

AATTGTAGAG GGAATCTTAG AGGTTAACAT CATCCAGATG ACAGACGTCC TGATGCCGGT 1260 

GCCATGGCCT GAAAGCTCCC TAATAGACTT TGTGGTGACC TGCCAAGGGA GCATTCCCAC 1320 

GGAGGTCTGT ACCATCATTT CTGACCCCAC CTGCGAGATC ACCCAGAACA CAGTCTGCAG 1380 

CCCTGTGGAT GTGGATGAGA TGTGTCTGCT GACTGTGAGA CGAACCTTCA ATGGGTCTGG 1440 

GACGTACTGT GTGAACCTCA CCCTGGGGGA TGACACAAGC CTGGCTCTCA CGAGCACCCT 1500 

GATTTCTGTT CCTGACAGAG ACCCAGCCTC GCCTTTAAGG ATGGCAAACA GTGCCCTGAT 1560 

CTCCGTTGGC TGCTTGGCCA TATTTGTCAC TGTGATCTCC CTCTTGGTGT ACAAAAAACA 1620 

CAAGGAATAC AACCCAATAG AAAATAGTCC TGGGAATGTG GTCAGAAGCA AAGGCCTGAG 1680 

TGTCTTTCTC AACCGTGCAA AAGCCGTGTT CTTCCCGGGA AACCAGGAAA AGGATCCGCT 1740 

ACTCAAAAAC CAAGAATTTA AAGGAGTTTC TTAAATTTCG ACCTTGTTTC TGAAGCTCAC 1800 

TTTTCAGTGC CATTGATGTG AGATGTGCTG GAGTGGCTAT TAACCTTTTT TTCCTAAAGA 1860 

TTATTGTTAA ATAGATATTG TGGTTTGGGG AAGTTGAATT TTTTATAGGT TAAATGTCAT 1920 

TTTAGAGATG GGGAGAGGGA TTATACTGCA GGCAGCTTCA GCCATGTTGT GAAACTGATA 1980 

AAAGCAACTT AGCAAGGCTT CTTTTCATTA TTTTTTATGT TTCACTTATA AAGTCTTAGG 2040 

TAACTAGTAG GATAGAAACA CTGTGTCCCG AGAGTAAGGA GAGAAGCTAC TATTGATTAG 2100 

AGCCTAACCC AGGTTAACTG CAAGAAGAGG CGGGATACTT TCAGCTTTCC ATGTAACTGT 2160 

ATGCATAAAG CCAATGTAGT CCAGTTTCTA AGATCATGTT CCAAGCTAAC TGAATCCCAC 2220 

TTCAATACAC ACTCATGAAC TCCTGATGGA ACAATAACAG GCCCAAGCCT GTGGTATGAT 2280 

GTGCACACTT GCTAGACTCA GAAAAAATAC TACTCTCATA AATGGGTGGG AGTATTTTGG 2340 

TGACAACCTA CTTTGCTTGG CTGAGTGAAG GAATGATATT CATATATTCA TTTATTCCAT 2400 

GGACATTTAG TTAGTGCTTT TTATATACCA GGCATGATGC TGAGTGACAC TCTTGTGTAT 2460 

ATTTCCAAAT TTTTGTATAG TCGCTGCACA TATTTGAAAT CATATATTAA GACTTTCCAA 2520 

AGATGAGGTC CCTGGTTTTT CATGGCAACT TGATCAGTAA GGATTTCACC TCTGTTTGTA 2580 

ACTAAAACCA TCTACTATAT GTTAGACATG ACATTCTTTT TCTCTCCTTC CTGAAAAATA 2640 
AAGTGTGGGA AGAGACAAAA AAAAAAAAA 

Seq ID NO: 383 Protein sequence 
Protein Accession #: NP_002501 

1 11 21 31 41 51 

I I I I I I 

MECLYYFLGF LLLAARLPLD AAKRFHDVLG NERPSAYMRE HNQLNGWSSD ENDWNBKLYP 60 

VWKRGDMRWK NSWKGGRVQA VLTSDSPALV GSNITFAVNL IFPRCQKEDA NGNIVYEKNC " 120 

RNEAGLSADP YVYNWTAWSE DSDGENGTGQ SHHNVFPDGK PFPHHPGWRR WNFIYVFHTL 180 

GQYFQKLGRC SVRVSVNTAN VTLGPQLMEV TVYRRHGRAY VPIAQVKDVY WTDQIPVFV 240 

TMFQKNDRNS SDETFLKDLP IMFDVLIHDP SHFLNYSTIN YKWSFGDNTG LFVSTNHTVN 300 

HTYVLNGTFS LNLTVKAAAP GPCPPPPPPP RPSKPTPSLG PAGDNPLELS RIPDENCQIN 360 

RYGHFQATIT IVEGILEVNI IQMTDVLMPV PWPESSLIDF WTCQGSIPT EVCTIISDPT 420 

CEITQNTVCS PVDVDEMCLL TVRRTFNGSG TYCVNLTLGD DTSLALTSTL ISVPDRDPAS 480 

PXjRMANSALI SVGCLAIFVT VISLI>VYKKH KEYNPIENSP GNWRSKGLS VFLNRAKAVF 540 
FPGNQEKDPL LKNQEFKGVS 

Seq ID NO: 384 DNA sequence 
Nucleic Acid Accession 8: NM_001134 
Coding sequence: 48-1877 

1 11 21 31 41 51 

I I I I I I 

TCCATATTGT GCTTCCACCA CTGCCAATAA CAAAATAACT AGCAACCATG AAGTGGGTGG 60 

AATCAATTTT TTTAATTTTC CTACTAAATT TTACTGAATC CAGAACACTG CATAGAAATG 120 

AATATGGAAT AGCTTCCATA TTGGATTCTT ACCAATGTAC TGCAGAGATA AGTTTAGCTG 180 

ACCTGGCTAC CATATTTTTT GCCCAGTTTG TTCAAGAAGC CACTTACAAG GAAGTAAGCA 240 

AAATGGTGAA AGATGCATTG ACTGCAATTG AGAAACCCAC TGGAGATGAA CAGTCTTCAG 300 

GGTGTTTAGA AAACCAGCTA CCTGCCTTTC TGGAAGAACT TTGCCATGAG AAAGAAATTT 360 

TGGAGAAGTA CGGACATTCA GACTGCTGCA GCCAAAGTGA AGAGGGAAGA CATAACTGTT 420 

TTCTTGCACA CAAAAAGCCC ACTCCAGCAT CGATCCCACT TTTCCAAGTT CCAGAACCTG 480 

TCACAAGCTG TGAAGCATAT GAAGAAGACA GGGAGACATT CATGAACAAA TTCATTTATG 540 

AGATAGCAAG AAGGCATCCC TTCCTGTATG CACCTACAAT TCTTCTTTGG GCTGCTCGCT 600 

ATGACAAAAT AATTCCATCT TGCTGCAAAG CTGAAAATGC AGTTGAATGC TTCCAAACAA 660 

AGGCAGCAAC AGTTACAAAA GAATTAAGAG AAAGCAGCTT GTTAAATCAA CATGCATGTG 720 

CAGTAATGAA AAATTTTGGG ACCCGAACTT TCCAAGCCAT AACTGTTACT AAACTGAGTC 780 

AGAAGTTTAC CAAAGTTAAT TTTACTGAAA TCCAGAAACT AGTCCTGGAT GTGGCCCATG 840 

TACATGAGCA CTGTTGCAGA GGAGATGTGC TGGATTGTCT GCAGGATGGG GAAAAAATCA 900 

TGTCCTACAT ATGTTCTCAA CAAGACACTC TGTCAAACAA AATAACAGAA TGCTGCAAAC 960 

TGACCACGCT GGAACGTGGT CAATGTATAA TTCATGCAGA AAATGATGAA AAACCTGAAG 1020 

GTCTATCTCC AAATCTAAAC AGGTTTTTAG GAGATAGAGA TTTTAACCAA TTTTCTTCAG 1080 

GGGAAAAAAA TATCTTCTTG GCAAGTTTTG TTCATGAATA TTCAAGAAGA CATCCTCAGC 1140 

TTGCTGTCTC AGTAATTCTA AGAGTTGCTA AAGGATACCA GGAGTTATTG GAGAAGTGTT 1200 

TCCAGACTGA AAACCCTCTT GAATGCCAAG ATAAAGGAGA AGAAGAATTA CAGAAATACA 1260 

TCCAGGAGAG CCAAGCATTG GCAAAGCGAA GCTGCGGCCT CTTCCAGAAA CTAGGAGAAT 1320 

ATTACTTACA AAATGCGTTT CTCGTTGCTT ACACAAAGAA AGCCCCCCAG CTGACCTCGT 1380 

CGGAGCTGAT GGCCATCACC AGAAAAATGG CAGCCACAGC AGCCACTTGT TGCCAACTCA 1440 

GTGAGGACAA ACTATTGGCC TGTGGCGAGG GAGCGGCTGA CATTATTATC GGACACTTAT 1500 
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GTATCAGACA TGAAATGACT CCAGTAAACC CTGGTGTTGG CCAGTGCTGC ACTTCTTCAT 1560 

ATGCCAACAG GAGGCCATGC TTCAGCAGCT TGGTGGTGGA TGAAACATAT GTCCCTCCTG 1620 

CATTCTCTGA TGACAAGTTC ATTTTCCATA AGGATCTGTG CCAAGCTCAG GGTGTAGCGC 16B0 

TGCAAACGAT GAAGCAAGAG TTTCTCATTA ACCTTGTGAA GCAAAAGCCA CAAATAACAG 1740 

AGGAACAACT TGAGGCTGTC ATTGCAGATT TCTCAGGCCT GTTGGAGAAA TGCTGCCAAG 1800 

GCCAGGAACA GGAAGTCTGC TTTGCTGAAG AGGGACAAAA ACTGATTTCA AAAACTCGTG 1860 

CTGCTTTGGG AGTTTAAATT ACTTCAGGGG AAGAGAAGAC AAAACGAGTC TTTCATTCGG 1920 

TGTGAACTTT TCTCTTTAAT TTTAACTGAT TTAACACTTT TTGTGAATTA ATGAAATGAT 1980 
AAAGACTTTT ATGTGAGATT TCCTTATCAC AGAAATAAAA TATCTCCAAA TG 

Seq ID NO: 385 Protein sequence 
Protein Accession #: NP_001125 

1 11 21 31 41 51 

I I I I I I 

MKWVESIFI>I PLLNFTESRT LHRNEYGIAS ILDSYQCTAE ISLADLATIF FAQFVQEATY 60 

KEVSKMVKDA LTAIEKPTGD" EQSSGCLENQ LPAFLBELCH EKEILEKYGH SDCCSQSEEG 120 

RHNCFLAHKK PTPASIPLFQ VPEPVTSCEA YEEDRETFMN KFIYEIARRH PFLYAPTILI* 180 

WAARYDKIIP SCCKAENAVE CFQTKAATVT KELRESSLLN QHACAVMKNF GTRTFQAITV 240 

TKLSQKFTKV NFTEIQKLVL DVAHVHEHCC RGDVLDCLQD GEKIMSYICS QQDTLSNKIT 300 

ECCKLTTLER GQCIIHAEND EKPEGLSPNL NRFLGDRDFN QFSSGEKNIF LASFVHEYSR 360 

RHPQLAVSVI LRVAKGYQEL LEKCFQTENP LECQDKGEEE LQKYIQESQA LAKRSCGLFQ 420 

KLGEYYLQNA FLVAYTKKAP QLTSSELMAI TRKMAATAAT CCQLSEDKLL ACGEGAADII 480 

IGHLCIRHEM TPVNPGVGQC CTSSYANRRP CFSSLWDET YVPPAFSDDK FIFHKDLCQA 540 

QGVALQTMKQ EFLINLVKQK PQITEEQLEA VIADFSGLIjE KCCQGQEQEV CFAEEGQKLI 600 
SKTRAALGV 

Seq ID NO: 386 DNA sequence 

Nucleic Acid Accession #: NM_002205.1 

Coding sequence: 1..3149 

1 11 21 31 41 51 

I I . I I I I 

ATGGGGAGCC GGACGCCAGA GTCCCCTCTC CACGCCGTGC AGCTGCGCTG GGGCCCCCGG 60 

CGCCGACCCC CGCTSSTGCC GCTGCTGTTG CTGCTSSTGC CGCCGCCACC CAGGGTCGGG 120 

GGCTTCAACT TAGACGCGGA GGCCCCAGCA GTACTCTCGG GGCCCCCGGG CTCCTTCTTC 180 

GGATTCTCAG TGGAGTTTTA CCGGCCGGGA ACAGACGGGG TCAGTGTGCT GGTGGGAGCA 240 

CCCAAGGCTA ATACCAGCCA GCCAGGAGTG CTGCAGGGTG GTGCTGTCTA CCTCTGTCCT 300 

TGGGGTGCCA GCCCCACACA GTGCACCCCC ATTGAATTTG ACAGCAAAGG CTCTCGGCTC 360 

CTGGAGTCCT CACTGTCCAG CTCAGAGGGA GAGGAGCCTG TGGAGTACAA GTCCTTGCAG 420 

TGGTTCGGGG CAACAGTTCG AGCCCATGGC TCCTCCATCT TGGCATGCGC TCCACTGTAC 480 

AGCTGGCGCA CAGAGAAGGA GCCACTGAGC GACCCCGTGG GCACCTGCTA CCTCTCCACA 540 

GATAACTTCA CCCGAATTCT GGAGTATGCA CCCTGCCGCT CAGATTTCAG CTGGGCAGCA 600 

GGACAGGGTT ACTGCCAAGG AGGCTTCAGT GCCGAGTTCA CCAAGACTGG CCGTGTGGTT 660 

TTAGGTGGAC CAGGAAGCTA TTTCTGGCAA GGCCAGATCC TGTCTGCCAC TCAGGAGCAG 720 

ATTGCAGAAT CTTATTACCC CGAGTACCTG ATCAACCTGG TTCAGGGGCA GCTGCAGACT 780 

CGCCAGGCCA GTTCCATCTA TGATGACAGC TACCTAGGAT ACTCTGTGGC TGTTGGTGAA 840 

TTCAGTGGTG ATGACACAGA AGACTTTGTT GCTGGTGTGC CCAAAGGGAA CCTCACTTAC 900 

GGCTATGTCA CCATCCTTAA TGGCTCAGAC ATTCGATCCC TCTACAACTT CTCAGGGGAA 960 

CAGATGGCCT CCTACTTTGG CTATGCAGTG GCCGCCACAG ACGTCAATGG GGACGGGCTG 1020 

GATGACTTGC TGGTGGGGGC ACCCCTGCTC ATGGATCGGA CCCCTGACGG GCGGCCTCAG 1080 

GAGGTGGGCA GGGTCTACGT CTACCTGCAG CACCCAGCCG GCATAGAGCC CACGCCCACC 1140 

CTTACCCTCA CTGGCCATGA TGAGTTTGGC CGATTTGGCA GCTCCTTGAC CCCCCTGGGG 1200 

GACCTGGACC AGGATGGCTA CAATGATGTG GCCATCGGGG CTCCCTTTGG TGGGGAGACC 1260 

CAGCAGGGAG TAGTGTTTGT ATTTCCTGGG GGCCCAGGAG GGCTGGGCTC TAAGCCTTCC 1320 

CAGGTTCTGC AGCCCCTGTG GGCAGCCAGC CACACCCCAG ACTTCTTTGG CTCTGCCCTT 1380 

CGAGGAGGCC GAGACCTGGA TGGCAATGGA TATCCTGATC TGATTGTGGG GTCCTTTGGT 1440 

GTGGACAAGG CTGTGGTATA CAGGGGCCGC CCCATCGTGT CCGCTAGTGC CTCCCTCACC 1500 

ATCTTCCCCG CCATGTTCAA CCCAGAGGAG CGGAGCTGCA GCTTAGAGGG GAACCCTGTG 1560 

GCCTGCATCA ACCTTAGCTT CTGCCTCAAT GCTTCTGGAA AACACGTTGC TGACTCCATT 1620 

GGTTTCACAG TGGAACTTCA GCTGGACTGG CAGAAGCAGA AGGGAGGGGT ACGGCGGGCA 1680 

CTGTTCCTGG CCTCCAGGCA GGCAACCCTG ACCCAGACCC TGCTCATCCA GAATGGGGCT 1740 

CGAGAGGATT GCAGAGAGAT GAAGATCTAC CTCAGGAACG AGTCAGAATT TCGAGACAAA 1800 

CTCTCGCCGA TTCACATCGC TCTCAACTTC TCCTTGGACC CCCAAGCCCC AGTGGACAGC 1860 

CACGGCCTCA GGCCAGCCCT ACATTATCAG AGCAAGAGCC GGATAGAGGA CAAGGCTCAG 1920 

ATCTTGCTGG ACTGTGGAGA AGACAACATC TGTGTGCCTG ACCTGCAGCT GGAAGTGTTT 1980 

GGGGAGCAGA ACCATGTGTA CCTGGGTGAC AAGAATGCCC TGAACCTCAC TTTCCATGCC 2040 

CAGAATGTGG GTGAGGGTGG CGCCTATGAG GCTGAGCTTC GGGTCACCGC CCCTCCAGAG 2100 

GCTGAGTACT CAGGACTCGT CAGACACCCA GGGAACTTCT CCAGCCTGAG CTGTGACTAC 2160 

TTTGCCGTGA ACCAGAGCCG CCTGCTGGTG TGTGACCTGG GCAACCCCAT GAAGGCAGGA 2220 

GCCAGTCTGT GGGGTGGCCT TCGGTTTACA GTCCCTCATC TCCGGGACAC TAAGAAAACC 2280 

ATCCAGTTTG ACTTCCAGAT CCTCAGCAAG AATCTCAACA ACTCGCAAAG CGACGTGGTT 2340 

TCCTTTCGGC TCTCCGTGGA GGCTCAGGCC CAGGTCACCC TGAACGGTGT CTCCAAGCCT 2400 

GAGGCAGTGC TATTCCCAGT AAGCGACTGG CATCCCCGAG ACCAGCCTCA GAAGGAGGAG 2460 

GACCTGGGAC CTGCTGTCCA CCATGTCTAT GAGCTCATCA ACCAAGGCCC CAGCTCCATT 2520 

AGCCAGGGTG TGCTGGAACT CAGCTGTCCC CAGGCTCTGG AAGGTCAGCA GCTCCTATAT 2580 

GTGACCAGAG TTACGGGACT CAACTGCACC ACCAATCACC CCATTAACCC AAAGGGCCTG 2640 

GAGTTGGATC CCGAGGGTTC CCTGCACCAC CAGCAAAAAC GGGAAGCTCC AAGCCGCAGC 2700 

TCTGCTTCCT CGGGACCTCA GATCCTGAAA TGCCCGGAGG CTGAGTGTTT CAGGCTGCGC 2760 

TGTGAGCTCG GGCCCCTGCA CCAACAAGAG AGCCAAAGTC TGCAGTTGCA TTTCCGAGTC 2820 

TGGGCCAAGA CTTTCTTGCA GCGGGAGCAC CAGCCATTTA GCCTGCAGTG TGAGGCTGTG 2680 

TACAAAGCCC TGAAGATGCC CTACCGAATC CTGCCTCGGC AGCTGCCCCA AAAAGAGCGT 2940 

CAGGTGGCCA CAGCTGTGCA ATGGACCAAG GCAGAAGGCA GCTATGGCGT CCCACTGTGG 3000 

ATCATCATCC TAGCCATCCT GTTTGGCCTC CTGCTCCTAG GTCTACTCAT CTACATCCTC 3060 

TACAAGCTTG GATTCTTCAA ACGCTCCCTC CCATATGGCA CCGCCATGGA AAAAGCTCAG 3120 
CTCAAGCCTC CAGCCACCTC TGATGCCTGA 
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Seq ID NO: 387 Protein sequence 
Protein Accession NP_002 196.1 

5 1 11- 21 31 41 51 ' 

I I I I I I 

MGSRTPESPL HAVQLRWGPR RRPPLLPLLL LLLPPPPRVG GFNLDAEAPA VLSGPPGSPF 60 

GFSVEFYRPG TDGVSVLVGA PKANTSQPGV LQGGAVYLCP WGASPTQCTP IEFDSKGSRL 120 

LESSLSSSEG EBPVEYKSLQ WFGATVRAHG SSILACAPLY SWRTEKEPLS DPVGTCYLST 180 

10 DNFTRILEYA PCRSDPSWAA GQGYCQGGPS AEFTKTGRW LGGPGSYFWQ GQILSATQEQ 240 

IAESYYPEYL 'INLVQGQLQT RQASSIYDDS YLGYSVAVGE PSGDDTEDFV AGVPKGNLTY 300 

GYVTILNGSD IRSLYNFSGE QMASYFGYAV AATDVNGDGL DDLLVGAPLL MDRTPDGRPQ 360 

EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDQDGYNDV AIGAPFGGET 420 

QQGWPVFPG GPGGLGSKPS QVLQPLWAAS HTPDFFGSAL RGGRDLDGNG YPDLIVGSFG 480 

15 VDKAWYRGR PIVSASASLT I FPAMFNPEE RSCSLEGNPV ACINLSFCLN ASGKHVADSI 540 

GFTVELQLDW QKQKGGVRRA LFLASRQATL TQTLLIQNGA REDCREMKIY LRNESEFRDK 600 

LSPIHIALNP SLDPQAPVDS HGLRPALHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 660 

GEQNHVYLGD KNALNLTFHA QNVGEGGAYE AELRVTAPPE AEYSGLVRHP GNFSSLSCDY 720 

FAVNQSRLLV CDLGNPMKAG ASLWGGLRFT VPHLRDTKKT IQFDFQILSK NLNNSQSDW 780 

20 SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW HPRDQPQKEE DLGPAVHHVY ELINQGPSSI 840 

SQGVLELSCP QALEGQQLLY VTRVTGLNCT TNHPINPKGL ELDPEGSLHH QQKREAPSRS 900 

SASSGPQIIiK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTFLQREH QPFSLQCEAV 960 

YKALKMPYRI LPRQLPQKER QVATAVQWTK AEGSYGVPLW IIILAILFGL LLLGLLIYIL 1020 
YKLGFFKRSL PYGTAMEKAQ LKPPATSDA 



25 



65 



75 



80 



Seq' ID NO: 388 DNA sequence 

Nucleic Acid Accession #: NMJ)02425 

Coding sequence: 26.. 1453 



30 1 11 21 31 " 41 51 

I I I I I I 

AAAGAAGGTA AGGGCAGTGA GAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 

AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 

TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 180 

35 AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGGA 240 

GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT # 300 

TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT ' 360 

TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 

TGAGAAAGCT CTGAAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 480 

40 AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 

TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA 600 

TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 

CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 

TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 

45 TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 

GGTGCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 900 

GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 960 

TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 

CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 

50 TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1140 

AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 

CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 

TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1320 

GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 

55 ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 1440 

GTTACATTGC TAGGCGAGAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1500 

ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 1560 

GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 

ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAAT AGGTGAGAGA 1680 

60 ATGTATTTTC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1740 
CTT 



Seq ID NO: 389 Protein sequence 
Protein Accession #: NP 002416 



1 11 21 31 41 51 

I I I I I I 

MHLAFLVLLC LPVCSAYPLS GAAKEEDSNK DLAQQYLEKY YNLEKDVKQF RRKDSNLIVK 60 

KIQGMQKFLG LEVTGKLDTD TLEVMRKPRC GVPDVGHFSS FPGMPKWRKT HLTYRIVNYT 120 

70 PDLPRDAVDS AIEKALKVWE EVTPLTFSRIi YEGEADIMIS FAVKEHGDFY SFDGPGHSLA 180 

HAYPPGPGLY GDIHFDDDEK WTEDASGTNL FLVAAHELGH SLGLFHSANT EALMYPLYNS 240 

FTELAQFRLS QDDVNGIQSL YGPPPASTEE PLVPTKSVPS GSEMPAKCDP ALSFDAISTL 300 

RGEYLFFKDR YFWRRSHWNP EPEFHLISAF WPSLPSYLDA AYEVNSRDTV FIFKGNEFWA 360 

IRGNEVQAGY PRGIHTLGFP PTIRKIDAAV SDKEKKKTYF FAADKYWRFD ENSQSMEQGF 420 
PRLIADDFPG VEPKVDAVLQ AFGFFYFFSG SSQFEFDPNA RMVTHILKSN SWLHC 



Seq ID NO: 390 DNA sequence 

Nucleic Acid Accession #: NM_002421.2 

Coding sequence: 1..1409 



1 11 21 31 41 51 

i I I I I I 

ATGCACAGCT TTCCTCCACT GCTGCTGCTG CTGTTCTGGG GTGTGGTGTC ACACAGCTTC 60 

CCAGCGACTC TAGAAACACA AGAGCAAGAT GTGGACTTAG TCCAGAAATA CCTGGAAAAA 120 

OJ TACTACAACC TGAAGAATGA TGGGAGGCAA GTTGAAAAGC GGAGAAATAG TGGCCCAGTG 180 

GTTGAAAAAT TGAAGCAAAT GCAGGAATTC TTTGGGCTGA AAGTGACTGG GAAACCAGAT 240 

GCTGAAACCC TGAAGGTGAT GAAGCAGCCC AGATGTGGAG TGCCTGATGT GGCTCAGTTT 300 
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GTCCTCACTG AGGGGAACCC TCGCTGGGAG CAAACACATC 
TACACGCCAG ATTTGCCAAG AGCAGATGTG GACCATGCCA 
TGGAGTAATG TCACACCTCT GACATTCACC AAGGTCTCTG 
ATATCTTTTG TCAGGGGAGA TCATCGGGAC AACTCTCCTT 
CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTGGAGGGG 
GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC 
GGCCATTCTC TTGGACTCTC CCATTCTACT GATATCGGGG 
ACCTTCAGTG GTGATGTTCA GCTAGCTCAG GATGACATTG 
GGACGTTCCC AAAATCCTGT CCAGCCCATC GGCCCACAAA 
AAGCTAACCT TTGATGCTAT AACTACGATT CGGGGAGAAG 
TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC 
TGGCCACAAC TGCCAAATGG GCTTGAAGCT GCTTACGAAT 
CGGTTTTTCA AAGGGAATAA GTACTGGGCT GTTCAGGGAC 
CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG 
CTTTCTGAGG AAAACACTGG AAAAACCTAC TTCTTTGTTG 
GATGAATATA AACGATCTAT GGATCCAGGT TATCCCAAAA 
GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG 
GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA 
AATAGCTGGT TCAACTGCAG GAAAAATTAG 

Seq ID NO: 391 Protein sequence 
Protein Accession #: NP 002412. 1 



PCT/US02/12476 



MHSFPPLLLL 
VEKLKQMQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 

i 

LFWGWSHSF 
FGLKVTGKPD 
DHAIEKAFQL 
XGGDAHFDED 
DDIDGIQAIY 
EVELNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQLPNGLEA 
LSEENTGKTY 
GTRQYKFDPK 



31 

t 

VDLVQKYLEK 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHEL 
GPOTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 



Seq ZD NO: 392 DNA sequence 

Nucleic Acid Accession #: NMJ)02421.2 

Coding sequence: 1..1409 



ATGCACAGCT 
CCAGCGACTC 
TACTACAACC 
GTTGAAAAAT 
GCTGAAACCC 
GTCCTCACTG 
TACACGCCAG 
TGGAGTAATG 
ATATCTTTTG 
CTTGCTCATG 
GAAAGGTGGA 
GGCCATTCTC 
ACCTTCAGTG 
GGACGTTCCC 
AAGCTAACCT 
TTCTACATGC 
TGGCCACAAC 
CGGTTTTTCA 
CCCAAGGACA 
CTTTCTGAGG 
GATGAATATA 
GGAATTGGCC 
GGAACAAGAC 
AATAGCTGGT 



11 

I 

TTCCTCCACT 
TAGAAACACA 
TGAAGAATGA 
TGAAGCAAAT 
TGAAGGTGAT 
AGGGGAACCC 
ATTTGCCAAG 
TCACACCTCT 
TCAGGGGAGA 
CTTTTCAACC 
CCAACAATTT 
TTGGACTCTC 
GTGATGTTCA 
AAAATCCTGT 
TTGATGCTAT 
GCACAAATCC 
TGCCAAATGG 
AAGGGAATAA 
TCTACAGCTC 
AAAACACTGG 
AACGATCTAT 
ACAAAGTTGA 
AATACAAATT 
TCAACTGCAG 



21 
I 

GCTGCTGCTG 
AGAGCAAGAT 
TGGGAGGCAA 
GCAGGAATTC 
GAAGCAGCCC 
TCGCTGGGAG 
AGCAGATGTG 
GACATTCACC 
TCATCGGGAC 
AGGCCCAGGT 
CAGAGAGTAC 
CCATTCTACT 
GCTAGCTCAG 
CCAGCCCATC 
AACTACGATT 
CTTCTACCCG 
GCTTGAAGCT 
GTACTGGGCT 
CTTTGGCTTC 
AAAAACCTAC 
GGATCCAGGT 
TGCAGTTTTC 
TGATCCTAAA 
GAAAAATTAG 



31 

I 

CTGTTCTGGG 
GTGGACTTAG 
GTTGAAAAGC 
TTTGGGCTGA 
AGATGTGGAG 
CAAACACATC 
GACCATGCCA 
AAGGTCTCTG 
AACTCTCCTT 
ATTGGAGGGG 
AACTTACATC 
GATATCGGGG 
GATGACATTG 
GGCCCACAAA 
CGGGGAGAAG 
GAAGTTGAGC 
GCTTACGAAT 
GTTCAGGGAC 
CCTAGAACTG 
TTCTTTGTTG 
TATCCCAAAA 
ATGAAAGATG 
ACGAAGAGAA 



Seq ID NO: 393 Protein sequence 
Protein Accession 8: NP 002412.1 



I 

MHSFPPLLLL 
VEKLKQMQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 

1 

LFWGWSHSF 
FGLKVTGKPD 
DHAIEKAFQL 
IGGDAHFDED 
DDIDGIQAIY 
EVELNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQLPNGLEA 
LSEENTGKTY 
GTRQYKFDPK 



31 
I 

VDLVQKYLEK 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHAL 
GPQTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 



TGACCTACAG 


GATTGAAAAT 


360 


TTGAGAAAGC 


CTTCCAACTC 


420 


AGGGTCAAGC 


AGACATCATG 


480 


TTGATGGACC 


TGGAGGAAAT 


540 


ATGCTCATTT 


TGATGAAGAT 


600 


GTGTTGCGGC 


TCATGAACTC 


660 


CTTTGATGTA 


CCCTAGCTAC 


720 


ATGGCATCCA 


AGCCATATAT 


780 


CCCCAAAAGC 


ATGTGACAGT 


840 


TGATGTTCTT 


TAAAGACAGA 


900 


TCAATTTCAT 


TTCTGTTTTC 


960 


TTGCCGACAG 


AGATGAAGTC 


1020 


AGAATGTGCT 


ACACGGATAC 


1080 


TGAAGCATAT 


CGATGCTGCT 


1140 


CTAACAAATA 


CTGGAGGTAT 


1200 


TGATAGCACA 


TGACTTTCCT 


1260 


GATTTTTCTA 


TTTCTTTCAT 


1320 


TTTTGACTCT 


CCAGAAAGCT 


1380 


41 


51 




| 

YYNLKNDGRQ 


| 

VEKRRNSGPV 


60 


VLTEGNPRWE 


QTHLTYRIEN 


120 


ISFVRGDHRD 


NSPFDGPGGN 


180 


GHSLGLSHST 


DIGALMYPSY 


240 


KLTFDAITTI 


RGEVMFFKDR 


300 


RFFKGNKYWA 


VQGQNVLHGY 


360 


DEYKRSMDPG 


YPKMIAHDFP 


420 


NSWFNCRKN 






41 


51 




| 

GTGTGGTGTC 


| 

ACACAGCTTC 


60 


TCCAGAAATA 


CCTGGAAAAA 


120 


GGAGAAATAG 


TGGCCCAGTG 


180 


AAGTGACTGG 


GAAACCAGAT 


240 


TGCCTGATGT 


GGCTCAGTTT 


300 


TGACCTACAG 


GATTGAAAAT 


360 


TTGAGAAAGC 


CTTCCAACTC 


420 


AGGGTCAAGC 


AGACATCATG 


480 


TTGATGGACC 


TGGAGGAAAT 


540 


ATGCTCATTT 


TGATGAAGAT 


600 


GTGTTGCGGC 


TCATGCCCTC 


660 


CTTTGATGTA 


CCCTAGCTAC 


720 


ATGGCATCCA 


AGCCATATAT 


780 


CCCCAAAAGC 


ATGTGACAGT 


840 


TGATGTTCTT 


TAAAGACAGA 


900 


TCAATTTCAT 


TTCTGTTTTC 


960 


TTGCCGACAG 


AGATGAAGTC 


1020 


AGAATGTGCT 


ACACGGATAC 


1060 


TGAAGCATAT 


CGATGCTGCT 


1140 


CTAACAAATA 


CTGGAGGTAT 


1200 


TGATAGCACA 


TGACTTTCCT 


1260 


GATTTTTCTA 


TTTCTTTCAT 


1320 


TTTTGACTCT 


CCAGAAAGCT 


1380 


41 


51 




| 

YYNLKNDGRQ 


| 

VEKRRNSGPV 


60 


VLTEGNPRWE 


QTHLTYRIEN 


120 


ISFVRGDHRD 


NSPFDGPGGN 


180 


GHSLGLSHST 


DIGALMYPSY 


240 


KLTFDAITTI 


RGEVMFFKDR 


300 


RFFKGNKYWA 


VQGQNVLHGY 


360 


DEYKRSMDPG 


YPKMIAHDFP 


420 


NSWFNCRKN 







Seq ID NO: 394 DNA sequence 

Nucleic Acid Accession tf: NMJ314331.2 

Coding sequence: 1..1506 



11 



21 
I 



31 



41 
i 



51 
I 



331 
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ATGGTCAGAA AGCCTGTTGT GTCCACCATC TCCAAAGGAG GTTACCTGCA GGGAAATGTT 60 

AACGGGAGGC TGCCTTCCCT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 120 

AAGAGGAAAG TCACTTTACT GAGGGGAGTC TCCATTATCA TTGGCACCAT CATTGGAGCA 180 

GGAATCTTCA TCTCTCCTAA GGGCGTGCTC CAGAACACGG GCAGCGTGGG CATGTCTCTG 240 

ACCATCTGGA CGGTGTGTGG GGTCCTGTCA CTATTTGGAG CTTTGTCTTA TGCTGAATTG 300 

GG AACAACTA TAAAGAAATC TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTCCA 360 

TTACCAGCTT TTGTACGAGT CTGGGTGGAA CTCCTCATAA TACGCCCTGC AGCTACTGCT 420 

GTGATATCCC TGGCATTTGG ACGCTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 480 

CCTGAACTTG CGATCAAGCT CATTACAGCT GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 540 

AGCATGAGTG TCAGCTGGAG CGCCCGGATC CAGATTTTCT TAACCTTTTG CAAGCTCACA 600 

GCAATTCTGA TAATTATAGT CCCTGGAGTT ATGCAGCTAA TTAAAGGTCA AACGCAGAAC 660 

TTTAAAGACG CGTTTTC AGG AAGAGATTCA AGTATTACGC GGTTGCCACT GG CTTTTTAT 720 

TATGGAATGT ATGCATATGC TGGCTGGTTT TACCTCAACT TTGTTACTGA AGAAGTAGAA 780 

AACCCTGAAA AAACCATTCC CCTTGCAATA TGTATATCCA TGGCCATTGT CACCATTGGC 840 

TATGTGCTGA CAAATGTGGC CTACTTTAOG ACCATTAATG CTGAGGAGCT GCTGCTTTCA 900 

AATGCAGTGG CAGTGACCTT TTCTGAGCGG CTACTGGGAA ATTTCTCATT AGCAGTTCCG 960 

ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC ATGAACGGTG GTGTGTTTGC TGTCTCCAGG 1020 

TTATTCTATG TTGCGTCTCG AGAGGGTCAC CTTCCAGAAA TCCTCTCCAT GATTCATGTC 1080 

CGCAAGCACA CTCCTCTACC AGCTGTTATT GTTTTGCACC CTTTGACAAT GATAATGCTC 1140 

TTCTCTGGAG ACCTCGACAG TCTTTTGAAT TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 1200 

GGGCTGGCAG TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATCGTCCT 1260 

TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT. CATGGTTGCC 1320 

CTTTCCCTCT ATTCGGACCC ATTTAGTACA GGGATTGGCT TCGTCATCAC TCTGACTGGA 1380 

GTCCCTGCGT ATTATCTCTT TATTATATGG GACAAGAAAC CCAGGTGGTT TAGAATAATG 1440 

TCAGAGAAAA TAACCAGAAC ATTACAAATA ATACTGGAAG TTGTACCAGA AGAAGATAAG 1500 

TTATGAACTA ATGGACTTGA GATCTTGGCA ATCTGCCCAA GGGGAGACAC AAAATAGGGA 1560 

TTTTTACTTC ATTTTCTGAA AGTCTAGAGA ATTACAACTT TGGTGATAAA CAAAAGGAGT 1620 

CAGTTATTTT TATTCATATA TTTTAGCATA TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

AACTCTATGT AGTTATAGAA AGTGAATATG CAGTTATTCT ATGAGTCGCA CAATTCTTGA 1740 

GTCTCTGATA CCTACCTATT GGGGTTAGGA GAAAAGACTA GACAATTACT ATGTGGTCAT 1800 

TCTCTACAAC ATATGTTAGC ACGGCAAAGA ACCTTCAAAT TGAAGACTGA GATTTTTCTG 1860 

TATATATGGG TTTTGTAAAG ATGGTTTXAC ACACTACAGA TGTCTATACT GTGAAAAGTG 1920 

TTTTCAATTC TGAAAAAAAG CATACATCAT GATTATGGCA AAGAGGAGAG AAAGAAATTT 1980 

ATTTTACATT GACATTGCAT TGCTTCCCCT TAGATACCAA TTTAGATAAC AAACACTCAT 2040 

GCTTTAATGG ATTATACCCA GAGCACTTTG AACAAAGGTC AGTGGGGATT GTTGAATACA 2100 

TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAGACACA TCCAGGAGTT ATGTTTAAGT 2160 

AAAAATCCTT GAGAATTTAT TATGTCAGAT GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 

TTATCTGTCA TTTTTTTTTT TCACATCAGT TTGATCAGGA AAGTGTATAA CACATCTTAG 2280 

AGCAAGAGTT AGTTTGGTAT TAAATCCTCA TTAGAACAAC CACCTGTTTC ACTAATAACT 2340 

TACCCCTGAT GAGTCTATCT AAACATATGC ATTTTAAGCC TTCAAATTAC ATTATCAACA 2400 

TGAGAGAAAT AACCAACAAA GAAGATGTTC AAAATAATAG TCCCATATCT GTAATCATAT 2460 

CTACATGCAA TGTTAGTAAT TCTGAAGTTT TTTAAATTTA TGGCTATTTT TACACGATGA 2520 

TGAATTTTGA CAGTTTGTGC ATTTTCTTTA TACATTTTAT ATTCTTCTGT TAAAATATCT 2580 

CTTCAGATGA AACTGTCCAG ATTAATTAGG AAAAGGCATA TATTAACATA AAAATTGCAA 2640 

AAGAAATGTC GCTGTAAATA AGATTTACAA CTGATGTTTC TAGAAAATTT CCACTTCTAT 2700 

ATCTAGGCTT TGTCAGTAAT TTCCACACCT TAATTATCAT TCAACTTGCA AAAGAGACAA 2760 

CTGATAAGAA GAAAATTGAA ATGAGAATCT GTGGATAAGT GTTTGTGTTC AGAAGATGTT 2820 

GTTTTGCCAG TATTAGAAAA TACTGTGAGC CGGGCATGGT GGCTTACATC TGTAATCCCA 2880 

GCACTTTGGG AGGCTGAGGG GGTGGATCAC CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 2940 

CAACATGGAG AAACCCCATC TCTACTAAAA ATACAAAATT AGCTGGGCAT GGTGGCACAT 3000 

GCTGGTAATC TCAGCTATTG AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGCG 3060 

GAGGTTGCAG TGAGCCAAGA TTGCACCACT GTACTCCAGC CTGGGTGACA AAGTCAGACT 3120 
CCATCTCCAA AAAAAAAAAA AAAA 

Seq ID NO: 395 Protein sequence 
Protein Accession tt: NP_055146.1 

1 11 21 31 41 51 

I I I I I I 

MVRKPWSTI SKGGYLQGNV NGRLPSLGNK EPPGQEKVQL KRKVTLLRGV SIIIGTIIGA 60 

GIFISPKGVL QNTGSVGMSL TIWTVCGVLS LFGALSYAEL GTTIKKSGGH YTYILEVFGP 120 

LPAFVRVWVE LLIIRPAATA VISLAFGRYI LEPFFIQCEI PELAIKLITA VGITWMVLN 180 

SMSVSWSARI QIFLTFCKLT AILIIIVPGV MQLIKGQTQN FKDAFSGRDS SITRLPLAFY 240 

YGMYAYAGWP YLNFVTEEVE NPEKTIPLAI CISMAITIGV YVLTNVAYFT TINAEELLLS 300 

NAVAVTFSER LLGNFSLAVP IFVALSCFGS MNGGVFAVSR LFYVASREGH LPEILSMIHV 360 

RKHTPLPAVI VLHPLTMIML FSGDLDSLLN FLSFARWLF1 GLAVAGLIYL RYKCPDMHRP 420 

FKVPLFIPAL FSFTCLFMVA LSLYSDPFST GIGFVITLTG VPAYYLFIIW DKKPRWFRIM 480 
SEKITRTLQI ILEWPEEDK L 



Seq ID NO: 396 DNA sequence 
Nucleic Acid Accession #: NMM)0652B 
Coding sequence: 57.. 764 

1 11 21 31 41 51 

I I I I I I 

GCCGCCAGCG GCTTTCTCGG ACGCCTTGCC CAGCGGGCCG CCCGACCCCC TGCACCATGG 60 

ACCCCGCTCG CCCCCTGGGG CTGTCGATTC TGCTGCTTTT CCTGACGGAG GCTGCACTGG 120 

GCGATGCTGC TCAGGAGCCA ACAGGAAATA ACGCGGAGAT CTGTCTCCTG CCCCTAGACT 180 

ACGGACCCTG CCGGGCCCTA CTTCTCCGTT ACTACTACGA CAGGTACACG CAGAGCTGCC 240 

GCCAGTTCCT GTACGGGGGC TGCGAGGGCA ACGCCAACAA TTTCTACACC TGGGAGGCTT 300 

GCGACGATGC TTGCTGGAGG ATAGAAAAAG TTCCCAAAGT TTGCCGGCTG CAAGTGAGTG 360 

TGGACGACCA GTGTGAGGGG TCCACAGAAA AGTATTTCTT TAATCTAAGT TCCATGACAT 420 

GTGAAAAATT CTTTTCCGGT GGGTGTCACC GGAACCGGAT TGAGAACAGG TTTCCAGATG 480 

AAGCTACTTG TATGGGCTTC TGCGCACCAA AGAAAATTCC ATCATTTTGC TACAGTCCAA 540 

AAGATGAGGG ACTGTGCTCT GCCAATGTGA CTCGCTATTA TTTTAATCCA AGATACAGAA 600 

CCTGTGATGC TTTCACCTAT ACTGGCTGTG GAGGGAATGA CAATAACTTT GTTAGCAGGG 660 



332 



WO 02/086443 

AGGATTGCAA AOGTGCATGT GCAAAAGCTT TGAAAAAGAA AAAGAAGATG CCAAAGCTTC 720 

GCTTTGCCAG TAGAATCCGG AAAATTCGGA AGAAGCAATT TTAAACATTC TTAATATGTC 780 

ATCTTGTTTG TCTTTATGGC TTATTTGCCT TTATGGTTGT ATCTGAAGAA TAATATGACA 840 

GCATGAGGAA ACAAATCATT GGTGATTTAT TCACCAGTTT TTATTAATAC AAGTCACTTT 900 

TTCAAAAATT TGGATTTTTT TATATATAAC TAGCTGCTAT TCAAATGTGA GTCTACCATT 960 

TTTAATTTAT GGTTCAACTG TTTGTGAGAC GAATTCTTGC AATGCATAAG ATATAAAAGC 1020 

AAATATGACT CACTCATTTC TTGGGGTCGT ATTCCTGATT TCAGAAGAGG ATCATAACTG 1080 

AAACAACATA AGACAATATA ATCATGTGCT TTTAACATAT TTGAGAATAA AAAGGACTAG 1140 
CC 



Seq ID NO: 397 Protein sequence 
Protein Accession #: NP__006519 

1 11 21 31 41 51 

I I I I I I 

MDPARPLGLS ILLLFLTEAA LGDAAQEPTG NNAEICLLPL DYGPCRALLL RYYYDRYTQS 60 

CRQFLYGGCE GNANNPYTWE ACDDACWRIE KVPKVCRLQV SVDDQCEGST EKYFFNLSSM 120 

TCEKFFSGGC HRNRIENRFP DEATCMGFCA PKKIPSFCYS PKDEGLCSAN VTRYYFNPRY 180 
RTCDAFTYTG CGGNDNNFVS REDCKRACAK ALKKKKKMPK LRFASRIRKI RKKQF 

Seq ID NO: 398 DNA sequence 

Nucleic Acid Accession #: NM_001508.1 

Coding sequence: 1..1361 

1 11 21 31 41 51 

11)111 

ATGGCTTCAC CCAGCCTCCC GGGCAGTGAC TGCTCCCAAA TCATTGATCA CAGTCATGTC 60 

CCCGAGTTTG AGGTGGCCAC CTGGATCAAA ATCACCCTTA "TTCTGGTGTA CCTGATCATC 120 

TTCGTGATGG GCCTTCTGGG GAACAGCGTC ACCATTCGGG TCACCCAGGT GCTGCAGAAG 180 

AAAGGATACT TGCAGAAGGA GGTGACAGAC CACATGGTGA GTTTGGCTTG CTCGGACATC 240 

TTGGTGTTCC TCATCGGCAT GCCCATGGAG TTCTACAGCA TCATCTGGAA TCCCCTGACC 300 

ACGTCCAGCT ACACCCTGTC CTGCAAGCTG CACACTTTCC TCTTCGAGGC CTGCAGCTAC 360 

GCTACGCTGC TGCACGTGCT GACGCTCAGC TTTGAGCGCT ACATCGCCAT CTGTCACCCC 420 

TTCAGGTACA AGGCTGTGTC GGGACCTTGC CAGGTGAAGC TGCTGATTGG CTTCGTCTGG 480 

GTCACCTCCG CCCTGGTGGC ACTGCCCTTG CTGTTTGCCA TGGGTACTGA GTACCCCCTG 540 

GTGAACGTGC CCAGCCACCG GGGTCTCACT TGCAACCGCT CCAGCACCCG CCACCACGAG 600 

CAGCCCGAGA CCTCCAATAT GTCCATCTGT ACCAACCTCT CCAGCCGCTG GACCGTGTTC 660 

CAGTCCAGCA TCTTCGGCGC CTTCGTGGTC TACCTCGTGG TCCTGCTCTC CGTAGCCTTC 720 

ATGTGCTGGA ACATGATGCA GGTGCTCATG AAAAGCCAGA AGGGCTCGCT GGCCGGGGGC 780 

ACGCGGCCTC CGCAGCTGAG GAAGTCCGAG AGCGAAGAGA GCAGGACCGC CAGGAGGCAG 840 

ACCATCATCT TCCTGAGGCT GATTGTTGTG ACATTGGCCG TATGCTGGAT GCCCAACCAG 900 

ATTCGGAGGA TCATGGCTGC GGCCAAACCC AAGCACGACT GGACGAGGTC CTACTTCCGG 960 

GCGTACATGA TCCTCCTCCC CTTCTCGGAG ACGTTTTTCT ACCTCAGCTC GGTCATCAAC 1020 

CCGCTCCTGT ACACGGTGTC CTCGCAGCAG TTTCGGCGGG TGTTCGTGCA GGTGCTGTGC 1080 

TGCCGCCTGT CGCTGCAGCA CGCCAACCAC GAGAAGCGCC TGCGCGTACA TGCGCACTCC 1140 

ACCACCGACA GCGCCCGCTT TGTGCAGCGC CCGTTGCTCT TCGCGTCCCG GCGCCAGTCC 1200 

TCTGCAAGGA GAACTGAGAA GATTTTCTTA AGCACTTTTC AGAGCGAGGC CGAGCCCCAG 1260 

TCTAAGTCCC AGTCATTGAG TCTCGAGTCA CTAGAGCCCA ACTCAGGCGC GAAACCAGCC 1320 
AATTCTGCTG CAGAGAATGG TTTTCAGGAG CATGAAGTTT GA 



Seq ID NO: 399 Protein sequence 
Protein Accession #: NP_001499.1 



1 11 21 31 41 51 

I I I 1 I I 

MASPSLPGSD CSQIIDHSHV PEFBVATWIK ITLILVYLII FVMGLLGNSV TIRVTQVLQK 60 

KGYLQKEVTD HMVSLACSDI LVFLIGMPME FYSIIWNPLT TSSYTLSCKL HTFLFEACSY 120 

ATLLHVLTLS FERYIAICHP FRYKAVSGPC QVKLLIGFVW VTSALVALPL LFAMGTEYPL 180 

VNVPSHRGLT CNRSSTRHHE QPETSNMSIC TNLSSRWTVF QSSIFGAFW YLWLLSVAF 240 

MCWNMMQVLM KSQKGSLAGG TRPPQLRKSE SEESRTARRQ TIIFLRLIW TLAVCWMPNQ 300 

IRRIMAAAKP KHDWTRSYFR AYMILLPFSE TFFYLSSVIN PLLYTVSSQQ FRRVFVQVLC 360 

CRLSLQHANH EKRLRVHAHS TTDSARFVQR PLLFASRRQS SARRTEKIFL STFQSEAEPQ 420 
SKSQSLSLES LEPNSGAKPA NSAAENGFQE HEV 



Seq ID NO: 400 DNA sequence 

Nucleic Acid Accession #: NM_006475.1 

Coding sequence: 28.. 2538 



1 11 21 31 41 51 

I I I I I I 

AACAGAACTG CAACGGAGAG ACTCAAGATG ATTCCCTTTT TACCCATGTT TTCTCTACTA 60 

TTGCTGCTTA TTGTTAACCC TATAAACGCC AACAATCATT ATGACAAGAT CTTGGCTCAT 120 

AGTCGTATCA GGGGTCGGGA CCAAGGCCCA AATGTCTGTG CCCTTCAACA GATTTTGGGC 180 

ACCAAAAAGA AATACTTCAG CACTTGTAAG AACTGGTATA AAAAGTCCAT CTGTGGACAG 240 

AAAACGACTG TTTTATATGA ATGTTGCCCT GGTTATATGA GAATGGAAGG AATGAAAGGC 300 

TGCCCAGCAG TTTTGCCCAT TGACCATGTT TATGGCACTC TGGGCATCGT GGGAGCCACC 360 

ACAACGCAGC GCTATTCTGA CGCCTCAAAA CTGAGGGAGG AGATCGAGGG AAAGGGATCC 420 

TTCACTTACT TTGCACCGAG TAATGAGGCT TGGGACAACT TGGATTCTGA TATCCGTAGA 480 

GGTTTGGAGA GCAACGTGAA TGTTGAATTA CTGAATGCTT TACATAGTCA CATGATTAAT 540 

AAGAGAATGT TGACCAAGGA CTTAAAAAAT GGCATGATTA TTCCTTCAAT GTATAACAAT 600 

TTGGGGCTTT TCATTAACCA TTATCCTAAT GGGGTTGTCA CTGTTAATTG TGCTCGAATC 660 

ATCCATGGGA ACCAGATTGC AACAAATGGT GTTGTCCATG TCATTGACCG TGT GCTT ACA 720 

CAAATTGGTA CCTCAATTCA AGACTTCATT GAAGCAGAAG ATGACCTTTC ATCTTTTAGA 780 

GCAGCTGCCA TCACATCGGA CATATTGGAG GCCCTTGGAA GAGACGGTCA CTTCACACTC 840 

TTTGCTCCCA CCAATGAGGC TTTTGAGAAA CTTCCACGAG GTGTCCTAGA AAGGTTCATG 900 

GGAGACAAAG TGGCTTCCGA AGCTCTTATG AAGTACCACA TCTTAAATAC TCTCCAGTGT 960 

TCTGAGTCTA TTATGGGAGG AGCAGTCTTT GAGACGCTGG AAGGAAATAC AATTGAGATA 1020 
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GGATGTGACG GTGACAGTAT AACAGTAAAT GGAATCAAAA TGGTGAACAA AAAGGATATT 1080 

GTGACAAATA ATGGTGTGAT CCATTTGATT GATCAGGTCC TAATTCCTGA TTCTGCCAAA 1140 

CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 1200 

GGCTTGGCAT CTGCTCTGAG GCCAGATGGA GAATACACTT TGCTGGCACC TGTGAATAAT 1260 

GCATTTTCTG ATGATACTCT CAGCATGGTT CAGCGCCTCC TTAAATTAAT TCTGCAGAAT 1320 

CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA ACGGGCAAAT ACTGGAAACC 1380 

ATCGGAGGCA AACAGCTCAG AGTCTTCGTA TATCGTACAG CTGTCTGCAT TGAAAATTCA 1440 

TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATATT CCGCGAGATC 1500 

ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 1560 

TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGGAGACTGG 1620 

ACATTATTTG TGCCAACCAA TGATGCTTTT AAGGGAATGA CTAGTGAAGA AAAAGAAATT 1680 

CTGATACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGGAGTT 1740 

TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGGAAGC 1800 

AAAATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA ATGAATTGAA ATCAAAAGAA 1860 

TCTGACATCA TGACAACAAA TGGTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA 1920 

GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 1980 

CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTGACTGT CTATACAACT 2040 

AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 

ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 2160 

AGACTGATTA AAGAAGGTGA AACAATAACT GAAGTGATCC ATGGAGAGCC AATTATTAAA 2220 

AAATACACCA AAATCATTGA TGGAGTGCCT GTGGAAATAA CTGAAAAAGA GACACGAGAA 22 BO 

GAACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 

ACAGAAGAAA CTCTGAAGAA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC CAAATTCATT 2400 

GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAGAC 2460 

ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 2520 

GAAGGTCGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA TGTTTATACA ACCCTAAGTC 2580 

AATAACCTGA CCTTAGAAAA TTGTGAGAGC CAAGTTGACT TCAGGAACTG AAACATCAGC 2640 

ACAAAGAAGC AATCATCAAA TAATTCTGAA CACAAATTTA ATATTTTTTT TTCTGAATGA 2700 

GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA GAAAATATAA 2760 

CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG AATCCATTAG 2820 

AGAAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG AACTGTTATC 2880 

CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 2940 

TATCTCTCCA TGGGAAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA ACTTTTTATA 3000 

TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG TTATTTTTTA 3060 

CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA TATTTTTTAA 3120 

TCTCAAACGT TTCAATAAAA CCATTTTTCA GATATAAAGA GAATTACTTC AAATTGAGTA 3180 
ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 

Seq ID NO: 401 Protein sequence 
Protein Accession #: NP_006466.1 

1 11 21 31 41 51 

I I I I I I 

MIPFLPMFSL LLLLIVNPIN ANNHYDKILA HSRIRGRDQG PNVCALQQIL GTKKKYFSTC 60 

KNWYKKSICG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLGIVGA TTTQRYSDAS 120 

KLREBIEGKG SFTYFAPSNE AWDNLDSDIR RGLESNVNVE LLNALHSHMI NKRMLTKDLK 180 

NGMIIPSMYN NLGLFINHYP NGWTVNCAR IIHGNQIATN GWHVIDRVL TQIGTSIQDF ' 240 

IEAEDDLSSF RAAAITSDIL EALGRDGHFT LFAPTNEAFE KLPRGVLERF MGDKVASEAL 300 

MKYHILNTLQ CSESIMGGAV FETLEGNTIE IGCDGDSITV NGIKMVNKKD IVTNNGVIHL 360 

IDQVLIPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 

VQRLLKLILQ NHILKVKVGL NELYNGQIIiE TIGGKQLRVF VYRTAVCIEN SCMEKGSKQG 480 

RNGAIHIFRE 1 1 KPAEKSLH EKLKQDKRFS TFLSLLEAAD LKELLTQPGD WTLFVPTNDA 540 

FKGMTSEEKE ILIRDKNALQ NIILYHLTPG VFIGKGFBPG VTNILKTTQG SKI FLKEVND 600 

TLLVNELKSK ESDIMTTNGV IHWDKLLYP ADTPVGNDQL LEILNKLIKY IQIKFVRGST 660 

FKEIPVTVYT TKIITKWEP KIKVIEGSLQ PIIKTEGPTL TKVKIEGEPE FRLIKEGETI 720 

TEVIHGEPII KKYTKIIDGV PVEITEKETR EERIITGPEI KYTRISTGGG ETEETLKKLL 780 
QEEVTKVTKF IEGGDGHLFE DEEIKRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 

Seq ID NO: 402 DNA sequence 

Nucleic Acid Accession #: NM_002416 

Coding sequence: 40.. 417 

1 11 21 31 41 51 

I 1 I I I I 

ATCCAATACA GGAGTGACTT GGAACTCCAT TCTATCACTA TGAAGAAAAG TGGTGTTCTT 60 

TTCCTCTTGG GCATCATCTT GCTGGTTCTG ATTGGAGTGC AAGGAACCCC AGTAGTGAGA 120 

AAGGGTCGCT GTTCCTGCAT CAGCACCAAC CAAGGGACTA TCCACCTACA ATCCTTGAAA 180 

GACCTTAAAC AATTTGCCCC AAGCCCTTCC TGCGAGAAAA TTGAAATCAT TGCTACACTG 240 

AAGAATGGAG TTCAAACATG TCTAAACCCA GATTCAGCAG ATGTGAAGGA ACTGATTAAA 300 

AAGTGGGAGA AACAGGTCAG CCAAAAGAAA AAGCAAAAGA ATGGGAAAAA ACATCAAAAA 360 

AAGAAAGTTC TGAAAGTTCG AAAATCTCAA CGTTCTCGTC AAAAGAAGAC TACATAAGAG 420 

ACCACTTCAC CAATAAGTAT TCTGTGTTAA AAATGTTCTA TTTTAATTAT ACCGCTATCA 480 

TTCCAAAGGA GGATGGCATA TAATACAAAG GCTTATTAAT TTGACTAGAA AATTTAAAAC 540 

ATTACTCTGA AATTGTAACT AAAGTTAGAA AGTTGATTTT AAGAATCCAA ACGTTAAGAA 600 

TTGTTAAAGG CTATGATTGT CTTTGTTCTT CTACCACCCA CCAGTTGAAT TTCATCATGC 660 

TTAAGGCCAT GATTTTAGCA ATACCCATGT CTACACAGAT GTTCACCCAA CCACATCCCA 720 

CTCACAACAG CTGCCTGGAA GAGCAGCCCT AGGCTTCCAC GTACTGCAGC CTCCAGAGAG 780 

TATCTGAGGC ACATGTCAGC AAGTCCTAAG CCTGTTAGCA TGCTGGTGAG CCAAGCAGTT 840 

TGAAATTGAG CTGGACCTCA CCAAGCTGCT GTGGCCATCA ACCTCTGTAT TTGAATCAGC 900 

CTACAGGCCT CACACACAAT GTGTCTGAGA GATTCATGCT GATTGTTATT GGGTATCACC 960 

ACTGGAGATC ACCAGTGTGT GGCTTTCAGA GCCTCCTTTC TGGCTTTGGA AGCCATGTGA 1020 

TTCCATCTTG CCCGCTCAGG CTGACCACTT TATTTCTTTT TGTTCCCCTT TGCTTCATTC 1080 

AAGTCAGCTC TTCTCCATCC TACCACAATG CAGTGCCTTT CTTCTCTCCA GTGCACCTGT 1140 

CATATGCTCT GATTTATCTG AGTCAACTCC TTTCTCATCT TGTCCCCAAC ACCCCACAGA 1200 

AGTGCTTTCT TCTCCCAATT CATCCTCACT CAGTCCAGCT TAGTTCAAGT CCTGCCTCTT 1260 

AAATAAACCT TTTTGGACAC ACAAATTATC TTAAAACTCC TGTTTCACTT GGTTCAGTAC 1320 

CACATGGGTG AACACTCAAT GGTTAACTAA TTCTTGGGTG TTTATCCTAT CTCTCCAACC 1380 
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AGATTGTCAG CTCCTTGAGG GCAAGAGCCA 
CTAATAATAC TGTGGAACTA GGTTTTAATA 
TGGCAACCAG ACCATTGTCT CAGAGCAGGT 
CTAGCCTCTG GTAACCTCTT ACTTATTATC 
GATGCAACAT CCTTGTCTTT TTATGACAGG 
GCACGTGGTA AAACACTTGC GGATATTCTG 
AAAATCATAT AATCTTACAA TGAAAAGGAC 
CCAACCATAC AAAAATTCCT TTTCCCGAAG 
TCTAAGATCT AACAAGATAG CCACCGAGAT 
AGTTTTATTG TCCGTTTACT TGTTTCAGAG 
TCTCCCATGA AGAAAGGGAA CGGTGAAGTA 
TAGTGGAAGC ATGATTGGTG CCCAGTTAGC 
GGAGGTTCAG TGAATTGTGT AGGAGAGGTT 
CTTTCCCAAA TTGAATCACT GCTCACACTG 
TCCCACCCGA ACGTCTTATC TAATCATGAA 
AAAAATCTAA GTGTTTCATA AATTTGAGAG 
GTAGACAGTA TATAACTAAC AACCAAAGAC 
TCATTTATCA TATATATACA TACATGCATA 
AAACAGTATT GACTTGTATA CCTTGTAATT 
TATCAATAAA TAGACCATTA ATCAG 

Seq ID NO: 403 Protein sequence 
Protein Accession ft: NP 002407 



PCT/US02/12476 



CAGTATATTT 
ATTTTTTAAT 
GCTGGCTCTT 
TTCAGGACAC 
ATGTTTGCTC 
GACTGTTTTT 
TTTATAGATC 
GAAAAGGGCT 
CCTTATCGAA 
TTTGTATTGT 
CTAAGCGCTA 
CTCTGCAGGA 
GTCTGTGGCC 
CTGATGATTT 
ACTCCCTAGT 
TCTGTGACCC 
TACATATTGT 
CACTCTCAAA 
TGAAATATTT 



CCCTGTTTCT 
TGATGTTGTT 
TCCTGGCTAC 
TCACTACAGG 
AGCTTCTCCA 
AAAAAATATA 
AGCCAGTGAC 
TTCTCAATAA 
ACTCATTTTA 
GATTATCAAT 
GAGGAAGCAG 
TGTGGAAACC 
AGAATTTAAA 
AGAGTGCTGT 
TCCTTCATGT 
ACTTACCTTG 
CACTGACACA 
GCAAATAATT 
TCTTTGTTAA 



TCCACAGTGC 
ATGGGCAGGA 
TCCATGTTGG 
GACCAGGGAT 
ACAATAAGAA 
CAGTTTACCG 
CAACCTTTTC 
GCCTCAGCTT 
GGCAAATATG 
TACCACACCA 
CCAAGTCGGT 
TCCTTCCAGG 
CCTATACTCA 
CCGGTGGAGA 
AACTTCCCTG 
CATCTCACAG 
CACGTTATAA 
TTTCACTTCA 
AATAGAATGG 



11 



21 



31 



41 



I I I I I I 1 

MKKSGVLFLL GIILLVLIGV QGTPWRKGR CSCISTNQGT IHLQSLKDLK QFAPSPSCEK 
IEIIATLKNG VQTCLNPDSA DVKELIKKWE KQVSQKKKQK NGKKHQKKKV LKVRKSQRSR 
QKKTT 

Seq ID NO: 404 DNA sequence 

Nucleic Acid Accession ft: NM_006670 

Coding sequence: 85.. 1347 



CCGGCTCGCG 
AGCTCCGGGG 
GACGGGCGTC 
TCTCCCACCT 
TCCGCCCAGC 
CGCACAGTCA 
GTGCGCAACC 
CGCCGGCCGC 
GTGCGCGCGG 
CCACTGGCCG 
AGTCCCCTTG 
CGGAGCTTCG 
CGCCGCTTGG 
CTGCCCAGCC 
TCCTTCCGCA 
CTTCACAATG 
AACAATCCCT 
GAGGTAGTGC 
GTCCTCTTGG 
CAAACCTCTT 
GTTTTGTATT 
AGGGATCACA 
AACCTCAGTT 
CATGAGATGT 
TAGATACAAC 
TTTCTCGGTG 
TTCTTTTTCT 
TGGGCTTCTT 
ACAGATAGCA 
TATCAGTTTT 
CTGCAGACGT 
AGAGCATGCT 
TTCTTTGACA 
TTTTAATAAA 
ATTCTTAAAA 



11 
I 

CCCTCCGGGC 
AAACGCGAGC 
TGCGGCTGGC 
CCTCGGCATC 
CCCCGCTGCC 
AGTGCGTTAA 
TCTTCCTTAC 
CGCTGGCGGA 
GCGCCTTCGA 
ACCTCAGTCC 
TGGAACTGAT 
AGGGCATGGT 
AGCTGGCCAG 
TCAGGCACCT 
ACCTGACACA 
GCACCCTGGC 
GGGTCTGCGA 
AGGGCAAAGA 
AACTCAACAG 
ATGTCTTCCT 
TGAACCGCAA 
TGGAAGGGTA 
CTAACTCGGA 
AGACTTAAGC 
GGACTTTGAC 
TGTTCTGTTA 
TGGAACTCCT 
GCTGTCTGTC 
TTCAACAAAA 
ATTCTCATGT 
TAGCAGGCTC 
TACATTTTAC 
AAGTAAATTA 
CTGCATCGAG 
GAA 



21 
I 

CCAGCCTCCC 
CGCGATGCCT 
GCGACTAGCG 
CTCCTTCTCC 
GGACCAGTGC 
CCGCAATCTG 
CGGCAACCAG 
GCTGGCCGCG 
GCATCTGCCC 
CTTCGCTTTC 
CCTGAACCAC 
GGTGGCGGCC 
CAACCACTTC 
GGACTTAAGT 
TCTAGAAAGC 
TGAGTTGCAA 
CTGCCACATG 
CCGGCTCACC 
TGCTGACCTG 
GGGTATTGTT 
GGGGATAAAA 
TCATTACAGA 
TGTCTGAGAA 
TTTATCCCTA 
TAAAAGCAGT 
ATGTAAGACG 
CAACACGTAT 
TCTCTCTCAG 
GCTGCCTCAA 
ACCTAAGTTG 
TTCAAAATAA 
TGTTCTGCAT 
CTTTTTTGAT 
ATCCAACCGA 



31 

I 

GAGCCTTCGG 
GGGGGGTGCT 
CTGGTACTCC 
TCCTCGGCGC 
CCCGCGCTGT 
ACCGAGGTGC 
CTGGCCGTGC 
CTCAACCTCA 
AGCCTGCGCC 
TCGGGCAGCA 
ATCGTGCCCC 
CTGCTGGCGG 
CTTTACCTGC 
AATAATTCGC 
CTCCACCTGG 
GGTCTACCCC 
GCAGACATGG 
TGTGCATATC 
GACTGTGACC 
TTAGCCCTGA 
AAGTGGATGC 
TATGAAATCA 
ATATTAGAGG 
CTAGGCTTGC 
GAAGGGGATT 
ATGAACAGTT 
GGAGGGATTT 
TACAGTTCAA 
CTTTTTCGAG 
TGGAGAAAAT 
CTCCATGGTG 
ATTACAAAAA 
TGCAGTTTAT 
CTGAATTGTT 



41 
I 

AGCGGGCGCC 
CCCGGGGCCC 
TGGGCTGGGT 
CGTTCCTGGC 
GCGAGTGCTC 
CCACGGACCT 
TCCCTGCCGG 
GCGGCAGCCG 
AGCTCGACCT 
ATGCCAGCGT 
CTGAAGATGA 
GCCGTGCACT 
CGCGGGATGT 
TGGTGAGCCT 
AGGACAATGC 
ACATTAGGGT 
TGACCTGGCT 
CGGAAAAAAT 
CGATTCTTCC 
TAGGCGCTAT 
ATAACATCAG 
ATGOGGACCC 
ACAGACCAAG 
TCCACTTTCA 
TGCTTCCTTG 
GTGTATAGTG 
TTCAGGTTTC 
GGTGTAGCAA 
AAAAATACTT 
AATTGCATCC 
CACAGGAGCA 
ATAACTTGCA 
ATGAAAATGT 
AAAAAAAAAA 



51 
I 

GTCCCAGCCC 
CGCCGCCGGG 
CTCCTCGTCT 
TTCCGCCGTG 
CGAGGCAGCG 
GCCCGCCTAC 
CGCCTTCGCC 
CCTGGACGAG 
CAGCCACAAC 
CTCGGCCCCC 
GCGGCAGAAC 
GCAGGGGCTC 
GCTGGCCCAA 
GACCTACGTG 
CCTCAAGGTC 
TTTCCTGGAC 
CAAGGAAACA 
GAGGAATCGG 
CCCATCCCTG 
TTTCCTCCTG 
AGATGCCTGC 
CAGATTAACA 
GACAACTCTG 
TCCTCCACTA 
TTATGTAAAG 
TTTTACCCTC 
AGCATGAACA 
GTGTACCCAC 
TATTCATAAA 
TATAAACTGC 
CCTGCATCCA 
ACTTCATAAC 
ACTGATTTTT 
AAAAATAAAG 



Seq ID NO: 405 Protein sequence 
Protein Accession #: NP_006661 



1 
I 

MPGGCSRGPA 
QCPALCECSE 
AALNLSGSRL 
NHIVPPEDER 
LSNNSLVSLT 
HMADMVTWLK 
IVLALIGAIF 



11 
I 

AGDGRLRLAR 
AARTVKCVNR 
DEVRAGAFEH 
QNRSFEGMW 
YVSFRNLTHL 
ETEWQGKDR 
LLVLYLNRKG 



21 

I 

LALVLLGWVS 
NLTEVPTDLP 
LPSLRQLDLS 
AALLAGRALQ 
ESLHLEDNAL 
LTCAYPEKMR 
IKKWMHNIRD 



31 

I 

SSSPTSSASS 
AYVRNLFLTG 
HNPLADLSPF 
GLRRLELASN 
KVLHNGTLAE 
NRVLLELNSA 
ACRDHMEGYH 



41 
I 

FSSSAPFLAS 
NQLAVLPAGA 
AFSGSNASVS 
HFLYLPRDVL 
LQGLPHIRVP 
DLDCDPILPP 
YRYEINADPR 



51 
I 

AVSAQPPLPD 
FARRPPLiAEL 
APSPLVELIL 
AQLPSLRHLD 
LDNNPWVCDC 
SLQTSYVFLG 
LTNLSSNSDV 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



60 
120 
180 
240 
300 
360 



Seq ID NO: 406 DNA sequence 

Nucleic Acid Accession #: Eos sequence 
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Coding sequence: 1..927 



PCT/US02/12476 



1 11 21 31 41 51 

c I I I I I I 

5 AT6CCTGGGG GGTGCTCCCG GGGCCCCGCC GCCGGGGACG GGCGTCTGCG GCTGGCGCGA 60 

CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGCATCCTCC 120 

TTCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGGAC 180 

CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 

AATCTGACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 

10 AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 

AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 

CGCAACCTGA CACATCTAGA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 

AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 

CCCTGGGTCT GCGACTGCCA CATGGCAGAC ATGGTGACCT GGCTCAAGGA AACAGAGGTA 600 

15 GTGCAGGGCA AAGACCGGCT CACCTGTGCA TATCCGGAAA AAATGAGGAA TCGGGTCCTC 660 

TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCCGATTC TTCCCCCATC CCTGCAAACC 720 

TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 

TATTTGAACC GCAAGGGGAT AAAAAAGTGG ATGCATAACA TCAGAGATGC CTGCAGGGAT 840 

CACATGGAAG GGTATCATTA CAGATATGAA ATCAATGCGG ACCCCAGATT AACAAACCTC 900 

20 AGTTCTAACT CGGATGTCCT CGAGTGA 

Seq ID NO: 407 Protein sequence 
Protein Accession It: Eos sequence 

25 1 11 21 31 41 51 

I I I I I I 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD 60 

QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 

SLRHLDLSNN SLVSLTYVSF RNLTHLESLH LEDNALKVLH NGTLAELQGL PHIRVFLDNN 180 

30 PWVCDCHMAD MVTWLKETEV VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPILPPSLQT 240 

SYVFLGIVLA LIGAIFLLVL YLNRKGIKKW MHNIRDACRD HMEGYHYRYE INADPRLTNL 300 
SSNSDVLE 

Seq ID NO: 408 DNA sequence 
35 Nucleic Acid Accession 8: NM_000095.1 
Coding sequence: 26.. 2299 

1 11 21 31 41 51 

An 1 I I I I I 

40 CAGCACCCAG CTCCCCGCCA CCGCCATGGT CCCCGACACC GCCTGCGTTC TTCTGCTCAC 60 

CCTGGCTGCC CTCGGCGCGT CCGGACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 

GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 180 

GCGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 

CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 

45 GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 

CCCCTGCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 

CGCCCACCCC TGCTTCCCCC GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 

GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 

GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 

50 CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 

CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 

CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 

GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTGACCT 840 

AGACGGCTTC CCGGACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 

55 CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 

CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 

GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 1080 

GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1140 

CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 

60 CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGGAT GCCTGTGACA ACTGTCCCCA 1260 

GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 

CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 1380 

TAACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCCTGCG ACGACGACGA 1440 

CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA ACCCCGGCCA 1500 

65 GGAGGACGCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 

GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 

GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 

GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 

GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 

70 GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT I860 

CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 

GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 

CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 

_ AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 

75 GGGCTACATC AGGGTGCGAT TCTATGAGGG CCCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 

CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 

CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 

TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GGATGACAGC CACCCTCACC 2340 

Q GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 

80 AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 

Seq ID NO: 409 Protein sequence 
Protein Accession Jh NP_000086.1 

85 1 11 21 31 41 51 

I I I I I I 

MVPDTACVLIi LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 60 
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FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV 
NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG 
INECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG 
ADCVXiERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDBKL 
EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD 
DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG 
VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD 
RDNCRLVPNP GQEDADRDGV GDVCQDDFDA DKWDKIDVC 
PEGDAQIDPN WWLNQGREI VQTMNSDPGL AVGYTAFNGV 
FGYQDSSSFY WMWKQMEQT YWQANPFRAV AEPGIQLKAV 
ESQVRLLWKD PRNVGWKDKK SYRWFLQHRP QVGYIRVRFY 
RLGVFCFSQE NIIWANLRYR CNDTI PEDYE THQLRQA 

Seq ID NO: 410 DNA sequence 

Nucleic Acid Accession 8: NM_001565.1 

Coding sequence: 67.. 3 63 



ACIQTESGGR 
PTHQGVGLAF 
CQRGAQRFCP 
RCPEPQCRKD 
EDKWGDACDN 
DGIGDACDNC 
HDGQGDACDD 
PENAEVTLTD 
DFEGTFHVNT 
KSSTGPGEQL 
EGPELVADSN 



CGPCPAGPTG 
AKANKQVCTD 
DGSPSECHEH 
NCVTVPNSGQ 
CRSQKNDDQK 
PQKSNPDQAD 
DDDNDGVPDS 
FRAFQTWLD 
VTDDDYAGFI 
RNALWHTGDT 
WLDTTMRGG 



GAGACATTCC 
AGCACCATGA 
ATTCAAGGAG 
CCTGTTAATC 
CGTGTTGAGA 
TCGAAGGCCA 
TAAAACCAGA 
CCTCTCCCAT 
GTTACACTAA 
GGTTAATGTT 
GCTCTACTGA 
ACCTTTCCCA 
TCAGAATCTC 
ACTTCATGGA 
CATACAATTC 
CTTATTTAAT 
TTTCAGTGTA 
TTTTAAAAAT 
TTTTCAAATA 



11 
I 

TCAATTGCTT 
ATCAAACTGC 
TACCTCTCTC 
CAAGGTCTTT 
TCATTGCTAC 
TCAAGAATTT 
GGGGAGCAAA 
CACTTCCCTA 
AAGGTGACCA 
CATCATCCTA 
GGTGCTATGT 
TCTTCCAAGG 
AAATAACTAA 
CTTCCACTGC 
CAAACACATA 
GAAAGACTGT 
CATGGAATAA 
ACAGATAGAT 
AAAATGAGGT 



21 
I 

AGACATATTC 
GATTCTGATT 
TAGAACCGTA 
AGAAAAACTT 
AATGAAAAAG 
ACTGAAAGCA 
ATCGATGCAG 
CATGGAGTAT 
ATGATGGTCA 
AGCTATTCAG 
TCTTAGTGGA 
GTACTAAGGA 
AAGGTATGCA 
CATCCTCCCA 
CAGGAAGGTA 
ACAAAGTATA 
CATGTAATTA 
ATATGCTCTG 
ACTCTCCTGG 



31 
I 

TGAGCCTACA 
TGCTGCCTTA 
CGCTGTACCT 
GAAATTATTC 
AAGGGTGAGA 
GTTAGCAAGG 
TGCTTCCAAG 
ATGTCAAGCC 
CCAAATCAGC 
TAATAACTCT 
TGTTCTGACC 
ATCTTTCTGC 
ATCAAATCTG 
AGGGGCCCAA 
GAAATATCTG 
AGTCTTAGAT 
AGTACTATGT 
CATGTTACAT 
AAATATTAAG 



41 

I 

GCAGAGGAAC 
TCTTTCTGAC 
GCATCAGCAT 
CTGCAAGCCA 
AGAGATGTCT 
AAATGTCTAA 
GATGGACCAC 
ATAATTGTTC 
TGCTACTACT 
ACCCTGGCAC 
CTGCTTCAAA 
TTTGGGGTTT 
CTTTTTAAAG 
ATTCTTTCAG 
AAAATGTATG 
GTATATATTT 
ATCAATGAGT 
AAGATAAATG 



51 
I 

CTCCAGTCTC 
TCTAAGTGGC 
TAGTAATCAA 
ATTTTGTCCA 
GAATCCAGAA 
AAGATCTCCT 
ACAGAGGCTG 
TTAGTTTGCA 
CCTGTAGGAA 
TATAATGTAA 
TATTTCCCTC 
ATCAGAATTC 
AATGCTCTTT 
TGGCTACCTA 
TGTAAGTATT 
CCTATATTGT 
AACAGGAAAA 
TGCTGAATGG 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO: 411 Protein sequence 
Protein Accession #: NP 001556.1 



1 11 21 31 41 51 

AC I 1 1 1 1 1 

45 MNQTAILICC LIFLTLSGIQ GVPLSRTVRC TCISISNQPV NPRSLEKLEI IPASQFCPRV 

EIIATMKKKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 



60 



50 



55 



60 



65 



70 



75 



Seq ID NO: 412 DNA sequence 

Nucleic Acid Accession #: XM_057014 

Coding sequence : 143 . . 874 



I 

GGGAGGGAGA 
CGCGGCGGAG 
CGCTGCCCGG 
CCGCGGCCTC 
CCCCAAGGGG 
AATGTGCTTA 
CATTCCGGGT 
TCTGAGGGAA 
ATTGAATTAT 
AAATAGTGCT 
CTGTCAGCGT 
AGCTATAATT 
CACTTCTTCT 
CTGGGTTGGC 
TTCTCGCATC 
TTTTTTTATT 
CATCTGAATG 
TTTAAATCTA 
TGGTTAGAAT 
GGTCTTTTGT 
TGTACAATTT 
CAACCTTAAA 



11 
I 

GAGGCGCGCG 
CCAGACGCTG 
CAGCCGGGAG 
CTGCTGCTCC 
AAGCAAAAGG 
CAAGGGCCAG 
ACACCTGGGA 
AGCTTTGAGG 
GGCATAGATC 
CTAAGAGTTT 
TGGTATTTCA 
TATTTGGACC 
GTGGAAGGAC 
ACTTGTTCAG 
ATTATTGAAG 
ATGCCTTGGA 
AAAAGCAAAG 
GCATTATTCA 
ACTTTCTTCA 
TTTTTCTCTT 
GTAAATGTTA 
AAAAAAAAAA 



21 
I 

GGTGAAAGGC 
ACCACGTTCC 
CCATGCGACC 
TGCTGCTGCA 
CGCAGCTCCG 
CAGGAGTGCC 
TCCCAGGTCG 
AGTCCTGGAC 
TTGGGAAAAT 
TGTTCAGTGG 
CATTCAATGG 
AAGGAAGCCC 
TTTGTGAAGG 
ATTACCCAAA 
AACTACCAAA 
ATGGTTCACT 
CTAAATATGT 
TTTTGCTTCA 
TAGTCACATT 
AGTATAGCAT 
AGAATTTTTT 
AAAA 



31 
I 

GCATTGATGC 
TCTCCTCGGT 
CCAGGGCCCC 
GCTGCCCGCG 
GCAGAGGGAG 
TGGTCGAGAC 
GGATGGATTC 
ACCCAACTAC 
TGCGGAGTGT 
CTCACTTCGG 
AGCTGAATGT 
TGAAATGAAT 
AATTGGTGCT 
AGGAGATGCT 
ATAAATGCTT 
TAAATGACAT 
TTACAGACCA 
ATCAAAAGTG 
CTCTCAACCT 
TTTTAAAAAA 
TTATATCTGT 



41 
I 

AGCCTGCGGC 
CTCCTCCGCC 
GCCGCCTCCC 
CCGTCGAGCG 
GTGGTGGACC 
GGGAGCCCTG 
AAAGGAGAAA 
AAGCAGTGTT 
ACATTTACAA 
CTAAAATGCA 
TCAGGACCTC 
TCAACAATTA 
GGATTAGTGG 
TCTACTGGAT 
TAATTTTCAT 
TTTAAATAAG 
AAGTGTGATT 
GTTTCAATAT 
ATAATTTGGA 
ATATAAAAGC 
TAAATAAAAA 



51 
I 

GGCCTCGGAG 
TCCAGCTCCG 
CGCAGCGGCT 
CCTCTGAGAT 
TGTATAATGG 
GGGCCAATGG 
AGGGGGAATG 
CATGGAGTTC 
AGATGCGTTC 
GAAATGCATG 
TTCCCATTGA 
ATATTCATCG 
ATGTTGCTAT 
GGAATTCAGT 
TTGCTACCTC 
TTTATGTATA 
TCACACTGTT 
TTTTTTTAGT 
ATATTGTTGT 
TACCAATCTT 
TTATTTCCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



Seq ID NO: 413 Protein sequence 
Protein Accession #: XP_057014 

80 1 11 21 31 41 51 

I I till 

MRPQGPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCLOGPA 60 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 120 
GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAIIYLDQ 180 
85 GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSRIIIEE 240 
LPK 
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Seq ID NO: 414 DNA sequence 

Nucleic Acid Accession 8: XM_084007 

Coding sequence: 13 8.. 2405 

1 11 21 31 41 51 

1111)1 

CTCGTGCCGA ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 60 

CCAGTGGGCC CGTGTGGAAC CAAACCTGCG OGCGTGGCCG GGCCGTGGGA CAACGAGGCC 120 

GCGGAGACGA AGGCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 180 

TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 240 

AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 

ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 360 

TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 

ACCACGACCA TCACTCAGAC CACGAGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 480 

AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCCCAC CATAATCATG 540 

CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 600 

GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 660 

GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 720 

TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 780 

CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 840 

TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 900 
GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG • 960 

GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 1020 

TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 1080 

CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 1140 

TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 1200 

AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 1260 

TACACCTTCT' TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 1320 

CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 13 B0 

GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 

TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 1500 

AGAAGAAACC TGAAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 

CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 

GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 1680 

AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 

GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 1800 

TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 1860 

CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 1920 

TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 

GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 

ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 2100 

AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 2160 

GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 

GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 

GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 2340 

GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 

TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 

AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 2520 

TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2580 

TATTCTATCT TGGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 

TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 2700 

TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 2760 

TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 

AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 2880 

AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 2940 

CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 3000 

GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAGAAATCT 3060 

AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3120 

TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 3180 

TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 

CAAAATTATC AGAGTAGTAA AACTTTGATA TATATGAGGA TATTAAAACT ACACTAAGTA 3300 

TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3360 

GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT G 

Seq ID NO: 415 Protein sequence 
Protein Accession fl: XP_084007 

1 11 21 31 41 51 

I I I ! I I 

MARKLSVILI LTFALSVTNP LHELKAAAFP QTTEKISPNW BSGINVDLAI STRQYHLQQL 60 

PYRYGENNSL SVEGFRKLLQ NIGIDKIKRI HIHHDHDHHS DHEHHSDHER HSDHEHHSDH 120 

EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 180 

SVSASEVTST VYNTVSEGTH FLETIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 

NESVSEPRKG FMYSRNTNEN PQECPNASKL LTSHGMGIQV PLNATEFNYL CPAIINQIDA 300 

RSCLIHTSEK KAEIPPKTYS LQIAWVGGFI AISIISFLSL LGVILVPLMN RVFFKFLLSF 360 

LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 420 

TWKGLTALGG LYFMFLVEHV LTLIKQFKDK KKKNQKKPEN DDDVEIKKQL SKYESQLSTN 480 

EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEVYNE YVPRGCKNKC 540 

HSHFHDTLGQ SDDLIHHKHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVI 600 

MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLKA GMTVKQAVIiY 660 

NALSAMLAYL GMATGIFIGH YAENVSMWIF ALTAGIiFMYV ALVDMVPEML HNDASDHGCS 720 
RWGYFFLQNA GMLLGFGIML LISIFEHKIV FRINF 

Seq ID NO: 416 DNA sequence 

Nucleic Acid Accession ft: NM_0l5419.l 

Coding sequence: 1..8487 
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1 11 21 31 41 51 

I I I I I I 

ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCOGTGGTGC TGATCCTGCT TTGGGGCCAT 60 

CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT ACGTCCCCAG CGAGGTCCAC 120 

TGCACGTTCC GATCCCTGGC TTCCGTGCCC GCTGGCATTG CTAGACACGT GGAAAGAATC 180 

AATTTGGGGT TTAATAGCAT ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG 240 

TTGGAGCTAC TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 300 

GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT GATCACAGGA 360 

CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA TTGACCACAA CAAGATCGAG 420 

TTTATCCACC CTCAAGCTTT CAACGGCTTA ACGTCTCTGA GGCTACTCCA TTTGGAAGGA 480 

AATCTCCTCC ACCAGCTGCA CCCCAGCACC TTCTCCACGT TCACATTTTT GGATTATTTC 540 

AGACTCTCCA CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 600 

AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA TCCGTGGACC 660 

TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA AATCCAGAGG AATTCTGAAG 720 

TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT CAGTTGTGTG CAATGTGCTT CAGTCCAAAG 780 

AAGTTGTACA AACATGAGAT ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA 840 

GAGTCCCCTC TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 900 

GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG CATCTCTTTG 960 

AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT GTGACATCAA GAAACCAATG 1020 

GATGTGTACA AGATTCACTT GAACCAAACG GATCCTCCAG ATATTGACAT AAATGCAACA 1080 

GTTGCCTTGG ACTTTGAGTG TCCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG 1140 

ATAGCATACT ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 1200 

CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA CTACACAGGT 1260 

GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC AGCCATCCAT AGATATCCAG 1320 

CTGAACCGAC GTCAGAGTAC GGCCAAGAAG GTGCTACTTT CCTACTACAC CCAGTATTCT 1380 

CAAACAATAT • CCACCAAAGA TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGATTGAG 1440 

CCTAGTGGAG CTGTGCAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 1500 

TGCAACGTGA AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAGA TGGCTCCATC 1560 

CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA TTCTCAGCAG TGGCTGGCTG 1620 

AGGATCAAGT CCATGGAGCC ATCTGACTCA GGCTTGTACC AGTGCATTGC TCAAGTGAGG 1680 

GATGAAATGG ACCGCATGGT ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC 1740 

GAGAAAGACA CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT GCCTTGCAAT 1800 

GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG GATAATTAAT 1860 

GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG GAACTCTTTC CATCCCAAAG 1920 

GTCCAAGTCA GTGATAGTGG TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC 1980 

CATTTTACGG TGGGAATCAC AGTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 2040 

AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA GGATGAAGGG 2100 

GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC TTCTGCATCC AAAGGACCAA 2160 

GAGGTGTTCC TCAAAACAAA GGATGATGCC ATCAATGGAG ACAAGAAAGC CAAGAAAGGG 2220 

AGAAGAAAGC TGAAACTCTG GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA 2280 

GGTCGCAGAG TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 2340 

GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA GGGCACAGAA 2400 

GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA GCCTAGAAGT CACACCACCT 2460 

TTTCCTGCTG TTTCTCCCCC CTCAGCATCT CCTGTGCAGA CAGTAACCAG- TGCTGAAGAA 2520 

TCCTCAGCAG ATGTACCTCT ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA 2580 

GCCAGCATGG GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 2640 

AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA GATAACTTCC 2700 

ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA TATCTGAGCC TTATGAACCA 2760 

TCTCCTACTC TGCACACATT AGACACAGTC TATGAAAAGC CCACCCATGA AGAGACGGCA 2820 

ACAGAGGGTT GGTCTGCAGC AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGAGTAT 2880 

GAGCCTCCAT TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 2940 

GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC CTTTGCACAC 3000 

CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTCCAGTA CATCACAGTT ATTTGAGGAT 3060 

TCTACTATAG GGGAACCAGG TGTCCCAGGC CAATCACATC TACAAGGACT GACAGACAAC 3120 

ATCCACCTTG TGAAAAGTAG TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG 3180 

AAAGAGATGT CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 3240 

AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA CTCCACACTG 3300 

GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG AAACCACAGT TGGTACCCTC 3360 

CTAGACAAAG ACACCACAAC AGTAACAACA ACACCAAGGC AAAAAGTTGC TCCGTCATCC 3420 

ACCATGAGCA CTCACCCTTC TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC 3480 

AAATTCCGCC ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 3540 

TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA GAGTTCTCTG 3600 

GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA AACAGTTGGA AATGGAGAAG 3660 

AATGCAGAAC CCACATCCAA GGGAACACCA CGGAGAAAAC ACGGGAAGAG GCCAAACAAA 3720 

CATCGATATA CCCCTTCTAC AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT 3780 

CCAGAAAATA AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 3840 

ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC AACCACCAGA 3 900 

AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC CAGTCACATA TAAACCCACA 3 960 

TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG TTGACAAACA TAAAAGTGAC 4020 

ATTTTAGTCA CTGGTGAATC AATTACTAAT GCCATACCAA CTTCTCGCTC CTTGGTCTCC 4080 

ACTATGGGAG AATTTAAGGA AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG 4140 

AATCCCTCAA GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 4200 

GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA TTTCACTTCC 4260 

GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC AGGAAGAAGC TGGTTCTTCC 4320 

ACAACTCTCT CAAGCATAAA AGTGGAGGTG GCTTCAAGTC AGGCAGAAAC CACCACCCTT 4380 

GATCAAGATC ATCTTGAAAC CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT 4440 

CACACCCCTA CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 4500 

ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG AATATCTCAA 4560 

GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG TGGGGAATCC AGAAACAGAA 4620 

GCAACCCCAG TCAACAATGA AGGAACACAG CATATGTCAG GGCCAAATGA ATTATCAACA 4680 

CCCTCTTCCG ACCGGGATGC ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA 4740 

TTTGGTAGTA GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 4800 

GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC AACAGTGAGG 4860 

CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG TAACTTCCCA GTCACCTCGT 4920 

CACTGGACCA ACAAACCGGA AATAACTACA TATCCTTCTG GGGCTTTGCC AGAGAACAAA 4980 

CAGTTTACAA CTCCAAGATT ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA 5040 
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CCCAGCATTC CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 5100 

GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAG TTGGAAAGCC TCCCAGTCCA 5160 

AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCTTTA CCAACAAGAC TCTTTCTTTT 5220 

CCACAGTTGG GAGTCACCCG GAGACCCCAG ATACCCACTT CTCCTGCCCC AGTAATGAGA 5280 

GAGAGAAAAG TTATTCCAGG TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTG 5340 

GACTTTGGCC CTCCGGCACC TCCGTTGTTG CACACTCCGC AGACCACGGG ATCACCCTCA 5400 

ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC CTTTATAACA 5460 

TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT CAAAGTTCTT TGCAGGAGGA 5520 

CCTCCTGCAT CCAAATTCTG GTCTCTTGGG GAAAAGCCCC AAATCCTCAC CAAGTCCCCA 5580 

CAGACTGTGT CCGTCACCGC TGAGACAGAC ACTGTGTTCC CCTGTGAGGC AACAGGAAAA 5640 

CCAAAGCCTT TCGTTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 5700 

AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGGAA GGTTCAAGTA 5760 

CAAGATCGAG GCCAGTATAT GTGCACCGCC AGCAACCTGC ACGGCCTGGA CAGGATGGTG 5820 

GTCTTGCTTT CGGTCACCGT GCAGCAACCT CAAATCCTAG CCTCCCACTA CCAGGACGTC 5880 

ACTGTCTACC TGGGAGACAC CATTGCAATG GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC 5940 

CAAATTTCCT GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC CGTGGAGAGC 6000 

CGCATCACCC TGCACGAAAA CCGGACCCTT TCCATCAAGG AGGCGTCCTT CTCAGACAGA 6060 

GGCGTCTATA AGTGCGTGGC CAGCAATGCA GCCGGGGCGG ACAGCCTGGC CATCCGCCTG 6120 

CACGTGGCGG CACTGCCCCC CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC 6180 

CCGGGGCTCA GCATTCACAT TCACTGCACT GCCAAGGCTG CGCCCCTGCC CAGCGTGCGC 6240 

TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT TCCTCCACGG GAACTTGTTT 6300 

GTTTTCCCCA ACGGGACGCT CTACATCCGC AACCTCGCGC CCAAGGACAG CGGGCGCTAT 6360 

GAGTGCGTGG CCGCCAACCT GGTAGGCTCC GCGCGCAGGA CGGTGCAGCT GAACGTGCAG 6420 

CGTGCAGCAG CCAACGCGCG CATCACGGGC ACCTCCCCGC GGAGGACGGA CGTCAGGTAC 6480 

GGAGGAACCC TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 6540 

AGGCTGCCGT CCAAGAGGAT GATCGACGCG CTCTTCAGTT TTGATAGCAG AATCAAGGTG 6600 

TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGACGGACA AAGATGCCGG AGATTACCTG 6660 

TGCGTAGCTC GAAATAAGGT TGGTGATGAC TACGTGGTGC TCAAAGTGGA TGTGGTGATG 6720 

AAACCGGCCA AGATTGAACA CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC 6780 

CTGAAAGTGG ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA 6840 

GACGGGAGTC TGGTGAACTC CTTCATGCAG TCGGATGACA GCGGTGGACG CACCAAGCGC 6900 

TATGTCGTCT TCAACAATGG GACACTCTAC TTTAACGAAG TGGGGATGAG GGAGGAAGGA 6960 

GACTACACCT GCTTTGCTGA AAATCAGGTC GGGAAGGACG AGATGAGAGT CAGAGTCAAG 7020 

GTGGTGACAG CGCCCGCCAC CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT 7080 

GGAGACGTGG TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 7140 

TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT ATACCAAGAT 7200 

GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGCG GCAACTACAC CTGCCTGGTC 7260 

AGGAACAGCG CGGGAGAGGA TAGGAAGACG GTGTGGATTC ACGTCAACGT CCAGCCACCC 7320 

AAGATCAACG GTAACCCCAA CCCCATCACC ACCGTGCGGG AGATAGCAGC CGGGGGCAGT 7380 

CGGAAACTGA TTGACTGCAA AGCTGAAGGC ATCCCCACCC CGAGGGTGTT ATGGGCTTTT 7440 

CCCGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGATCACTGT CCATGGCAAC 7500 

GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGCGACTCCG TCCAGCTGGT ATGCATGGCA 7560 

CGCAACGAGG GAGGGGAGGC GAGGTTGATC GTGCAGCTCA CTGTCCTGGA GCCCATGGAG 7620 

AAACCCATCT TCCACGACCC GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC 7680 

AGCCTCAACT GCTCTGCCGC GGGGACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 7740 

GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC TGACGGCATG 7800 

CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT ACCGCTGCGT GGCCCGCAAT 7860 

GCCGCTGGCC ACACGGAGAG GCTGGTCTCC CTGAAGGTGG GACTGAAGCC AGAAGCAAAC 7920 

AAGCAGTATC ATAACCTGGT CAG CAT CATC AATGGTGAGA CCCTGAAGCT CCCCTGCACC 7980 

CCTCCCGGGG CTGGGCAGGG ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 8040 

GGCCCCCAAA CCCTGGGACG CGTTTCTCTT CTGGACAATG GCACCCTCAC GGTTCGTGAG 8100 

GCCTCGGTGT TTGACAGGGG TACCTATGTA TGCAGGATGG AGACGGAGTA CGGCCCTTCG 8160 

GTCACCAGCA TCCCCGTGAT TGTGATCGCC TATCCTCCCC GGATCACCAG CGAGCCCACC 8220 

CCGGTCATCT ACACCCGGCC CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT 8280 

CCCAAAGCTG ACATCACGTG GGAGTTACCG GATAAGTCGC ATCTGAAGGC AGGGGTTCAG 8340 

GCTCGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT CCAGCATGCC 8400 

ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGCAAAAA ACATTCTCGG CAGTGACTCC 8460 

AAAACAACTT ACATCCACGT CTTCTGAAAT GTGGATTCCA GAATGATTGC TTAGGAACTG 8520 

ACAACAAAGC GGGGTTTGTA AGGGAAGCCA GGTTGGGGAA TAGGAGCTCT TAAATAATGT 8580 

GTCACAGTGC ATGGTGGCCT CTGGTGGGTT TCAAGTTGAG GTTGATCTTG ATCTACAATT 8640 

GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT CAGCCTTGCT GAGACACTTT 8700 

CTTTTGTGTT TACATCATGC CAGGGGCTTC ATTCAGGGTG TCTGTGCTCT GACTGCAATT 8760 

TTTCTTCTTT TGCAAATGCC ACTCGACTGC CTTCATAAGC GTCCATAGGA TATCTGAGGA 8820 

ACATTCATCA AAAATAAGCC ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA 8880 

TCACCTAGTT AACCTGCTGC AGTTTTTACA TGATAGACTT TGTTCCAGAT TGACAAGTCA 8940 

TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT GCCCAATAAG GATTTAGAAC 9000 

CAGAGTGACT GATATATATA TATATATTTT AATTCAGAGT TACATACATA CAGCTACCAT 9060 

TTTATATGAA AAAAGAAAAA CATTTCTTCC TGGAACTCAC TTTTTATATA ATGTTTTATA 9120 

TATATATTTT TTCCTTTCAA ATCAGACGAT GAGACTAGAA GGAGAAATAC TTTCTGTCTT 9180 

ATTAAAATTA ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGGAA 9240 

ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC ' TGTTATATTA 9300 

CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT AGATTTCCTT GTATGCAAAG 9360 

TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT GAGAGGAGAG GAAGGAGAAA ACTGCATCAT 9420 

AACTTTACAG AATTGAATCT AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT 9480 

CTGGCTTGTC CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9540 

CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT TAAGGTCAAT 9600 
ATACTGTACA TTTGATAATA AAATAATATT CTCCCAAAAA AAAAA 

Seq ID NO: 417 Protein sequence 
Protein Accession #: NP_056234.l 

1 11 21 31 41 51 

I I I I I I 

MPKRAHWGAL SWLILLWGH PRVALACPHP CACYVPSEVH CTFRSLASVP AGIARHVERI 60 

NLGFNSIQAL SETSFAGLTK LELLMIHGNE IPSIPDGALR DLSSLQVFKF SYNKLRVITG 120 

QTLQGLSNLM RLHIDHNKIE FIHPQAFNGL TSLRLLHLEG NLLHQLHPST FSTFTFLDYF 180 

RLSTIRHLYL AENMVRTLPA SMLRNMPLLE NLYLQGNPWT CDCEMRWFLE WDAKSRGILK 240 



340 



WO 02/086443 

CKKDKAYEGG QLCAMCFSPK KLYKHEIHKL KDMTCLKPSI BSPLRQNRSR SIEEEQEQEE 300 

DGGSQLILBK FQLPQWSISL NMTDEHGNMV NLVCDIKKPM DVYKIHLNQT DPPDIDINAT 360 

VALDFECPMT RENYEKLWKL IAYYSEVPVK LHRELMLSKD PRVSYQYRQD ADEEALYYTG 420 

VRAQILAEPE WVMQPSIDIQ LNRRQSTAKK VLLSYYTQYS QTISTKDTRQ ARGRSWVMIB 480 

PSGAVQRDQT VLEGGPCQLS CNVKASESPS IFWVLPDGSI LKAPMDDPDS KFSILSSGWL 540 

RIKSMEPSDS GLYQCIAQVR DEMDRMVYRV LVQSPSTQPA EKDTVTIGKN PGESVTLPCN 600 

ALAIPEAHLS WILPNRRIIN DLANTSHVYM LPNGTLSIPK VQVSDSGYYR CVAVNQQGAD 660 

HFTVGITVTK KGSGLPSKRG RRPGAKALSR VREDIVEDEG GSGMGDEENT SRRLLHPKDQ 720 

EVFLKTKDDA INGDKKAKKG RRKLKLWKHS EKEPETNVAE GRRVFESRRR INMANKQINP 780 

ERWADILAKV RGKNLPKGTE VPPLIKTTSP PSLSLEVTPP FPAVSPPSAS PVQTVTSAEE 840 

SSADVPLLGE EEHVLGTISS ASMGLEHNHN GVILVEPEVT STPLEEWDD LSEKTEEITS 900 

TEGDLKGTAA PTLISEPYEP SPTLHTLDTV YEKPTHEETA TEGWSAADVG SSPEPTSSEY 960 

EPPLDAVSLA ESEPMQYFDP DLETKSQPDE DKMKEDTPAH LTPTPTIWVN DSSTSQLFED 1020' 

STIGEPGVPG QSHLQGLTDN IHLVKSSLST QDTLLIKKGM KEMSQTLQGG NMLEGDPTHS 1080 

RSSESEGQES KSITLPDSTL GIMSSMSPVK KPAETTVGTL LDKDTTTVTT TPRQKVAPSS 1140 

TMSTHPSRRR PNGRRRLRPN KFRHRHKQTP PTTFAPSETF STQPTQAPDI KISSpVESSL 1200 

VPTAWVDNTV NTPKQLEMEK NAEPTSKGTP RRKHGKRPNK HRYTPSTVSS RASGSKPSPS 1260 

PENKHRNIVT PSSETILLPR. TVSLKTEGPY DSLDYMTTTR KIYSSYPKVQ ETLPVTYKPT 1320 

SDGKEIKDDV ATNVDKHKSD ILVTGESITN AIPTSRSLVS TMGEFKEESS PVGFPGTPTW 1380 

NPSRTAQPGR LQTDIPVTTS GENLTDPPLL KELEDVDFTS EFLSSLTVST PFHQEEAGSS 1440 

TTLSSIKVEV ASSQAETTTL DQDHLETTVA ILLSETRPQN HTPTAARMKE PASSSPSTIL 1500 

MSLGQTTTTK PALPS PR I SQ ASRDSKENVF LNYVGNPETE ATPVNNEGTQ HMSGPNELST 1560 

PSSDRDAFNL STKLELEKQV FGSRSLPRGP DSQRQDGRVH ASHQLTRVPA KPILPTATVR 1620 

LPEMSTQSAS RYFVTSQSPR HWTNKPEITT YPSGALPENK QFTTPRLSST TIPLPLHMSK 1680 

PSIPSKFTDR RTDQFNGYSK VFGNNNIPEA RNPVGKPPSP RIPHYSNGRL PFFTNKTLSF 1740 

PQLGVTRRPQ IPTSPAPVMR ERKVIPGSYN RIHSHSTFHL DFGPPAPPLL HTPQTTGSPS 1800 

TNLQNIPMVS STQSSISFIT SSVQSSGSFH QSSSKFPAGG PPASKFWSLG EKPQILTKSP 1860 

QTVSVTAETD TVFPCEATGK PKPFVTWTKV STGALMTPNT RIQRFEVLKN GTLVIRKVQV 1920 

QDRGQYMCTA SNLHGLDRMV VLLSVTVQQP QILASHYQDV TVYLGDTIAM ECLAKGTPAP 1980 

QISWIFPDRR VWQTVSPVES RITLHENRTL SIKEASFSDR GVYKCVASNA AG AD SLA I RL 2040 

HVAALPPVIH QEKLENISLP PGLSIHIHCT AKAAPLPSVR WVLGDGTQIR PSQFLHGNLF 2100 

VFPNGTLYIR NLAPKDSGRY ECVAANLVGS ARRTVQLNVQ RAAANARITG TSPRRTDVRY 2160 

GGTLKLDCSA SGDPWPRILW RLPSKRMIDA LFSFDSRIKV FANGTLWKS VTDKDAGDYL 2220 

CVARNKVGDD YWLKVDWM KPAKIEHKEE NDHKVFYGGD LKVDCVATGL PNPEISWSLP 2280 

DGSLVNSFMQ SDDSGGRTKR YWFNNGTLY FNEVGMREEG DYTCFAENQV GKDEMRVRVK 2340 

WTAPATIRN KTYLAVQVPY GDWTVACEA KGEPMPKVTW LSPTNKVIPT SSEKYQIYQD 2400 

GTLLIQKAQR SDSGNYTCLV RNSAGEDRKT VWIHVNVQPP KINGNPNPIT TVREIAAGGS 2460 

RKLIDCKAEG IPTPRVLWAF PEGWLPAPY YGNRITVHGN GSLDIRSLRK SDSVQLVCMA 2520 

RNEGGEARLI VQLTVLEPME KPIFHDPISE KITAMAGHTI SLNCSAAGTP TPSLVWVLPN 2580 

GTDLQSGQQL QRFYHKADGM LHISGLSSVD AGAYRCVARN AAGHTERLVS LKVGLKPEAN 2640 

KQYHNLVSII NGETLKLPCT PPGAGQGRFS WTLPNGMHLE GPQTLGRVSL LDNGTLTVRE 2700 

ASVFDRGTYV CRMETEYGPS VTSIPVIVIA YPPRITSEPT PVIYTRPGNT VKLNCMAMGI 2760 

PKADITWELP DKSHLKAGVQ ARLYGNRFLH PQGSLTIQHA TQRDAGFYKC MAKNILGSDS 2820 
KTTYIHVF 

Seq ID NO: 418 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence : 1 . . 5001 

I 11 21 31 41 51 

II I 1 I I 

ATGCCAGGCA CAAAACTAAC CCGAACAGGC GCCCCAGCAG ACTACAGAGT GATATTGAAG 60 

ACCTCTCAAG AGGACGAATT GGATGTACCT GACGACATCA GCGTCCGGGT TATGTCATCT 120 

CAGTCTGTGC TTGTGTCCTG GGTGGATCCT GTTCTGGAAA AACAGAAGAA AGTTGTTGCA 180 

TCAAGACAGT ACACCGTGCG CTATCGAGAG AAGGGGGAAT TGGCCAGGTG GGATTATAAG 240 

CAGATCGCTA ACAGGCGTGT GCTGATTGAG AACCTGATTC CAGACACTGT GTATGAATTT 300 

GCAGTCCGTA TTTCACAGGG TGAAAGAGAT GGCAAATGGA GTACGTCAGT CTTCCAAAGA 360 

ACACCAGAAT CTGCCCCTAC CACAGCTCCT GAAAACTTGA ACGTCTGGCC AGTCAATGGC 420 

AAACCTACAG TTGTCGCTGC ATCTTGGGAT GCGCTACCAG AGACTGAGGG GAAAGTGAAA 480 

GTCTGTCTGC TGGACACAGG ACTGTTTTCA GTTTCCTCCT TCCAACCATC TGCCAAATCA 540 

TTTCAGAATA CATTCTTTCA TACGCCCCGG CTCTCAAACC ATTTGGAGCA AAGTCCCTCA 600 

CCTATCCTGG AGACACTACT TCTGCCCTGG TGGATGGTCT GCAGCCTGGG GAACGCTATC 660 

TTTTCAAAAT CCGGGCCACA AACAGGAGAG GCCTGGGACC TCACTCCAAA GCCTTCATTG 720 

TCGCTATGCC AACAAGAATG CAGCTGTACC CAGAAGGATT TCAGTTGTCT AGCTTACCTG 780 

ATCGATATCC AAACCAAACA AGTTAATAAA GATCCACAAC TGGAAGGGAG TGTTTTTGGA 840 

CCATGTTTTC TTTTCTACTT CCTCACATTT ATGCTGGATA TTGGCGGCTT TTCCTTCATT 900 

ATGTGCTATG AAGACCCANN TGTTTCTTCT TTGACAGGCA ATTCTTTAAA ATCTGTTGCA 960 

GCCAGTAAGG CGGATGTTCA GCAGAACACG GAGGACAATG GGAAACCCGA AAAACCTGAG 1020 

CCTTCCTCAC CTTCTCCCAG AGCTCCAGCT TCCTCCCAAC ACCCCTCTGT GCCTGCTTCT 1080 

CCCCAAGGGA GAAATGCCAA GGACCTTCTT CTTGACTTGA AGAACAAAAT ATTGGCTAAT 1140 

GGTGGGGCGC CCCGAAAACC CCAGCTTCGC GCCAAGAAGG CAGAGGAGCT GGATCTTCAG 1200 

TCGACAGAAA TCACTGGGGA GGAGGAGCTG GGTTCCCGGG AGGACTCGCC CATGTCACCC 1260 

TCAGACACCC AAGACCAGAA ACGGACCCTG AGGCCGCCAA GTAGACACGG CCACTCGGTG 1320 

GTTGCTCCCG GCAGGACTGC AGTGAGGGCC CGGATGCCAG CGCTGCCCCG AAGGGAAGGC 1380 

GTAGATAAGC CTGGCTTTTC CCTGGCCACG CAGCCCCGCC CAGGGGCGCC CCCCTCGGCT 1440 

TCGGCCTCTC CTGCCCACCA CGCGTCCACC CAGGGCACCT CTCATCGTCC TTCCCTGCCT 1500 

GCCAGCTTGA ATGACAACGA CTTGGTGGAC TCAGACGAAG ATGAGCGCGC TGTGGGCTCC 1560 

CTCCACCCCA AGGGCGCCTT CGCCCAGCCC CGGCCAGCCC TGTCCCCCAG CCGCCAGTCC 1620 

CCGTCCAGCG TTCTCCGCGA CAGAAGCTCT GTGCACCCCG GCGCAAAGCC AGCCTCGCCG 1680 

GCGCGGAGGA CCCCCCATTC AGGGGCCGCA GAGGAAGATT CCAGTGCCTC AGCCCCACCC 1740 

TCAAGACTTT CTCCACCCCA TGGGGGATCA TCTCGGCTGC TGCCCACCCA GCCACACCTG 1800 

AGCTCTCCAC TTTCCAAGGG CGGGAAGGAT GGTGAGGACG CCCCAGCCAC CAACTCCAAT 1860 

GCGCCATCAC GGTCCACCAT GTCCTCCTCC GTCTCTTCTC ATCTCTCGTC CAGGACGCAG 1920 

GTCTCTGAGG GAGCGGAGGC TTCTGATGGT GAAAGCCACG GTGACGGCGA TAGGGAAGAC 1980 

GGCGGAAGGC AGGCGGAGGC CACGGCCCAG ACGCTGCGGG CCCGGCCTGC CTCTGGACAC 2040 

TTCCATTTGC TCAGACACAA ACCCTTTGCT GCCAACGGGA GGTCTCCAAG CAGGTTCAGC 2100 

ATTGGGCGGG GACCTCGGCT GCAGCCCTCC AGCTCCCCAC AGTCGACTGT GCCCTCCCGA 2160 
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GCCCACCCCA GGGTTCCCTC TCACTCTGAT TCCCACCCTA AGCTTAGCTC AGGTATCCAT 
GGAGACGAGG AGGATGAGAA GCCGCTTCCT GCCACCGTTG TCAATGACCA CGTGCCTTCC 
TCCTCCAGGC AGCCCATCTC CCGGGGCTGG GAGGACTTAA GGAGAAGCCC GCAGAGAGGG 
GCCAGCCTGC ATCGGAAGGA ACCCATCCCA GAGAACCCCA AATCCACAGG GGCAGATACA 
CATCCTCAGG GCAAGTACTC CTCCCTGGCC TCCAAGGCTC AGGATGTTCA ACAGAGCACA 
GACGCGGACA CGGAGGGTCA TTCTCCCAAA GCACAGCCAG GGTCCACAGA CCGCCACGCG 
TCCCCTGCTC GTCCTCCCGC AGCACGGTCA CAGCAGCATC CCAGTGTTCC CAGAAGGATG 
ACACCOGGCC GGGCCCCAGA ACAGCAGCCC CCTCCTCCCG TCGCCACGTC CCAGCACCAC 
CCGGGACCCC AGAGCAGAGA CGCGGGTCGG TCACCTTCCC AGCCCAGGCT CTCACTGACC 
CAGGCCGGGC GGCCCCGCCC CACGTCGCAG GGCCGCTCCC ACTCCTCCTC GGACCCTTAC 
ACGGCGAGCT CCAGAGGGAT GCTCCCCACG GCCCTCCAGA ACCAGGACGA GGATGCCCAG 
GGCAGCTACG ACGACGACAG CACAGAAGTC GAGGCCCAGG ATGTGCGGGC CCCCGCGCAC 
GCCGCGOGCG CCAAGGAGGC AGCTGCGTCC CTTCCCAAGC ACCAGCAGGT GGAGTCTCCC 
ACAGGCGCAG GGGCAGGTGG CGACCACAGG TCCCAGCGCG GACATGCGGC CTCCCCCGCC 
AGGCCCAGCC GACCCGGCGG CCCCCAGTCC CGCGCCCGGG TCCCCAGCAG GGCAGCGCCG 
GGGAAGTCGG AGCCTCCTTC CAAGCGGCCC CTGTCCTCCA AGTCCCAGCA GTCGGTCTCA 
GCCGAGGACG AGGAGGAGGA GGACGCGGGG TTTTTTAAAG GCGGGAAAGA AGACCTTCTG 
TCTTCCTCTG TGCCAAAGTG GCCCTCTTCC TCCACTCCCA GGGGCGGCAA AGACGCCGAT 
GGGAGCCTCG CCAAGGAAGA GAGGGAGCCT GCCATCGCGC TTGCCCCTOG CGGAGGGAGC 
CTGGCTCCTG TGAAGCGACC TCTCCCCCCA CCTCCAGGCA GCTCCCCCAG GGCCTCCCAC 
GTCCCTTCCC GACCGCCGCC TCX5CAGCGCT GCCACCGTGA GCCCCGTCGC GGGCACCCAC 
CCCTGGCCGC GGTACACCAC GCGCGCCCCV CCTGGCCACT TCTCCACCAC CCCGATGCTG 
TCCTTGCGCC AGAGGATGAT GCATGCCAGA TTCCGTAACC CTCTCTCCCG ACAGCCTGCC 
AGACCCTCTT ACAGACAAGG TTATAATGGC AGACCAAATG TAGAAGGGAA AGTCCTTCCT 
GGTAGTAATG GAAAACCGAA TGGACAGAGA ATTATCAATG GCCCTCAAGG AACAAAGTGG 
GTTGTGGACC TTGATCGTGG GTTAGTATTG AATGCAGAAG GAAGGTACCT CCAAGATTCA 
CATGGAAATC CTCTTCGGAT TAAACTAGGA GGAGATGGTC GAACCATTGT AGATCTGGAA 
GGGACCCCCG TGGTGAGTCC TGACGGCCTC CCACTCTTTG GGCAGGGGCG ACATGGCACA 
CCTCTGGCCA ATGCCCAAGA TAAGCCAATT TTGAGTCTTG GAGGAAAGCC GCTGGTGGGC 
TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA CCATGCAGCC CACCACTACT 
ACGACGCCCC TGCCTACCAC TACAACCCCG AGGCCCACCA CTGCCACCAC CATGCAGCCC 
ACCACTACTA CGACGCCCCT GCCTACCACT ACACCGAGGC CCACCACTGC CACCACCCGC 
CGCACGACCA CCAGGCGTCC AACAACCACA GTCCGAACCA CTACGCGGAC AACCACCACC 
ACCACCCCCA AACCCACCAC TCCCATCCCC ACCTGTCCCC CTGGGACCTT GGAACGGCAC 
GACGATGATG GCAACCTGAT AATGAGCTCC AATGGGATCC CAGAGTGCTA CGCTGAAGAA 
GATGAGTTCT CAGGCTTGGA GACTGACACT GCAGTACCTA CGGAAGAGGC CTACGTTATA 
TATGATGAAG ATTATGAATT TGAGACGTCA AGGCCACCAA CCACCACTGA GCCTTCGACC 
ACTGCTACCA CACCGAGGGT GATCCCAGAG GAAGGCGCCA TCAGTTCCTT TCCTGAAGAA 
GAATTTGATC TGGCTGGAAG GAAACGATTT GTTGCTCCTT ACGTGACGTA CCTAAATAAA 
GACCCATCAG CCCCGTGCTC TCTGACTGAT GCACTGGATC ACTTCCAAGT GGACAGCCTG 
GATGAAATCA TCCCCAATGA CCTGAAGAAG AGTGATCTGC CTCCCCAGCA TGCTCCCCGC 
AACATCACCG TGGTGGCCGT GGAAGGTTGC CACTCATTTG TCATTGTGGA TTGGGACAAA 
GCCACCCCAG GAGATTTGGT CACAGGTTAT TTGGTTTACA GTGCATCCTA TGAAGATTTC 
ATCAGGAACA AGTTTTCCAC TCAAGCTTCA TCAGTAACTC ACTTGCCCAT TGAGAACCTA 
AAGCCCAACA CGAGGTATTA TTTTAAAGTG CAAGCACAAA ATCCTCATGG CTACGGACCT 
ATCAGCCCTT CGGTCTCATT TGTCACCGAA TCAGATAATC CTCTGCTTGT TGTGAGGCCC 
CCAGGCGGTG AGCTATCTGG ATCCCATTCG CTTTCAAACA TGATCCCAGC TACACGGACT 
GCCATGGACG GCAATATGTG AAGCGCACGT GGTATCGAAA GTTCGTGGGA GTTGTTCTTT 
GTAATTCACT GAGGTATAAA ATCTACCTCA GTGACAACCT GAAAGATACA TTCTACAGCA 
TTGGAGACAG CTGGGGAAGA GGTGAAGACC ATTGCCAATT TGTGGATTCA CACCTTGATG 
GAAGAACAGG GCCTCAGTCC TATGTAGAAG CCCTCCCTAC TATTCAAGGC TACTATCGCC 
AGTATCGTCA GGAGCCTGTC AGGTTTGGGA ACATCGGCTT CGGAACCCCC TACTACTATG 
TGGGCTGGTA CGAGTGTGGG GTCTCCATCC CTGGAAAGTG GTAATCACAG GACCGTCATG 
CTGCAAGCTT GCCCTGCCCA GCCCCACCAA CTAAGTCGCA CTAGGGGCTG TGAGCAAAGA 
CAGCCAGCAT GCTCAGCCCC GCTGCCCTAG GTGCCAGGAA GGTCACAGAT GGACACTGGC 
CATTCTGGTC ATCTCAGTCT GGAACTCAGT CCCACTTCTT GGCCTGGACA ATGAACAGGA 
TTCAGTTTTG CTGTTAACTT TGCTTCTCTA CTTTTTTTTG TTTGTTTGTA ATAGCACATC 
CCAGAGACAT CAGAAACCAG CAACTGATTC AGTGTGATTT CCCAGACTTT TTAGGCATGA 
AATTCGGACA CTTCAGTATT TCCAGGAATA GCATATGCAC GCTGTTCTTG CTTCATGGAA 
' TGCTACATGC TTTCTGTTTT TCTCATTTTG GATTTCTCCA AAACTAACTG AATTTAAGCT 
TCAGGTCCCT TTGTATGCAG TAGAAAGGAA TTATTAAAAA CACCACCAAA GAAAATAAAT 
ATATCCTACT TGAAATTTAC TCTATGGACT TACCCACTGC TAGAATAAAT GTATCAAATC 
TTATTTGTAA ATTCTCAATT TTGATATATA TATGTATATA TGCATATACA TATCCACACT 
TGTCTGCAAG AATATTGATT AAAATTGCTA AATTTGTACT TGTTCACCAA AAAAAAAAAA 
AAAAAAA 

Seq ID NO: 419 Protein sequence 
Protein Accession # : Eos sequence 



PCT/US02/12476 



MPGTKLTRTG 
SRQYTVRYRE 
TPESAPTTAP 
FQNTFFHTPR 
SLCQQECSCT 
MCYEDPVSSL 
QGRNAKDLLL 
DTQDQKRTLR 
ASPAHHASTQ 
SSVLRDRSSV 
SPLSKGGKDG 
GRQAEATAQT 
HPRVPSHSDS 
SLHRKEPIPE 
PARPPAARSQ 
AGRPRPTSQG 



11 
I 

APADYRVILK 
KGELARWDYK 
ENLNVWPVNG 
LSNHLEQSPS 
QKDFSCLAYL 
TGNSLKSVAA 
DLKNKILANG 
PPSRHGHSW 
GTSHRPSLPA 
HPGAKPASPA 
EDAPATNSNA 
LRARPASGHF 
HPKLSSGIHG 
NPKSTGADTH 
QHPSVPRRMT 
RSHSSSDPYT 



21 
I 

TSQEDELDVP 
QIANRRVLIE 
KPTWAASWD 
PILETLLLPW 
IDIQTKQVNK 
SKADVQQNTE 
GAPRKPQLRA 
APGRTAVRAR 
SLNDNDLVDS 
RRTPHSGAAE 
PSRSTMSSSV 
HLLRHKPFAA 
DEEDEKPLPA 
PQGKYSSLAS 
PGRAPEQQPP 
ASSRGMLPTA 



31 



I 

DDISVRVMSS 
NLIPDTVYEF 
ALPETEGKVK 
WMVCSLGNAI 
DPQLEGSVFG 
DNGKPEKPEP 
KKAEELDLQS 
MPALPRREGV 
DEDERAVGSL 
EDSSASAPPS 
SSHLSSRTQV 
NGRSPSRFSI 
TWNDHVPSS 
KAQDVQQSTD 
PPVATSQHHP 
LQNQDEDAQG 



41 
I 

QSVLVSWVDP 
AVRISQGERD 
VCLLDTGLFS 
FSKSGPQTGE 
PCFLPYFLTF 
SSPSPRAPAS 
TEITGEEELG 
DKPGPSLATQ 
HPKGAFAQPR 
RLSPPHGGSS 
SEGAEASDGE 
GRGPPXQPSS 
SRQPISRGWE 
ADTEGHSPKA 
GPQSRDAGRS 
SYDDDSTEVE 



51 
I 

VLEKQKKWA 
GKWSTSVFQR 
VSSFQPSAKS 
AWDLTPKPSL 
MLDIGGFSFI 
SQHPSVPASP 
SREDSPMSPS 
PRPGAPPSAS 
PALSPSRQSP 
RLLPTQPHLS 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 



SPGSTVPSRA 
DLRRSPQRGA 
QPGSTDRHAS 
PSQPRLSLTQ 
AQDVRAPAHA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



342 



10 



WO 02/086443 

ARAKEAAASL PKHQQVBSPT GAGAGGDHRS QRGHAASPAR PSRPGGPQSR ARVPSRAAPG 1020 

KSEPPSKRPL SSKSQQSVSA EDEEEEDAGF FKGGKEDLLS SSVPKWPSSS TPRGGKDADG 1080 

SIAKEEREPA IALAPRGGSL APVKRPLPPP PGSSPRASHV PSRPPPRSAA TVSPVAGTHP 1140 

WPRYTTRAPP GHFSTTPMLS LRQRMMHARF RNPLSRQPAR PSYRQGYNGR PNVEGKVLPO 1200 

SNGKPNGQRI INGPQGTKHV VDLDRGLVLN AEGRYIiQDSK GNPLRIKLGG DGRTIVDLEG 1260 

TPWSPDGLP LFGQGRHGTP LANA.QDKPIL SLGGKPLVGL EVIKKTTHPP TTTMQPTTTT 1320 

TPLPTTTTPR PTTATTMQPT TTTTPLPTTT PRPTTATTRR TTTRRPTTTV RTTTRTTTTT 1380 

TPKPTTPIPT CPPGTLERHD DDGNLIMSSN G I PECYAEED EFSGLETDTA VPTEEAYVIY 1440 

DEDYEFETSR PPTTTEPSTT ATTPRV1PEB GAISSPPEEE FDLAGRKRFV APYVTYLNKD 1500 

PSAPCSLTDA LDHFQVDSLD EIIPNDLKKS DLPPQHAPRN ITWAVEGCH SFVIVDWDKA 1560 

TPGDLVTGYL VYSASYEDFI RNKFSTQASS VTHLPIENLK PNTRYYFKVQ AQNPHGYGPI 1620 
SPSVSFVTES DNPLLWRPP GGELSGSHSL SNMIPATRTA MDGNM 



PCT/US02/12476 



15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 420 DNA sequence 
Nucleic Acid Accession 8: NM_022743 
Coding sequence: 128.. 1237 



i 

GTGGATTTTA 
TCTCTGCTGC 
AAAGCTGATG 
AAAAGCTTGG 
TCCTCCAGAC 
TTCAGAATCA 
TGAAGATAAG 
AGAAATACAG 
AGTGATCTGC 
ATATCCCAGT 
TGGGCCCCAC 
CTGCTACCTG 
CTGCTTTGAA 
TGGTGATGAG 
GGCACACTGG 
TGAACGGCTT 
CTGCATCAAC 
ATACAGGATT 
CAAACTGCAG 
TGATATTATG 
TTTAGAAGAA 
CGGCGTGTGT 
GTGAACCTCT 
GGTTTGCAAA 
ATTTGGTTGA 



11 
I 

GAGATACCTC 
AACACCCCTA 
CGATGCTCTC 
CCAGACCACA 
TCCGTTCGAC 
GAGAAGCTTT 
AAAGAGGGCC 
GATGCCTCTC 
AACTCTTTCA 
ATCTCTTTGC 
CTCTTACTGC 
GATATGCTGA 
TGTGACTGTT 
CAAGTATGGA 
AAGTGGGAGC 
CCCGATATCA 
CTCGGCCTGT 
TTTTTCCCAG 
CTACATCAAG 
AGAGTGACAC 
TGCGACGCCA 
CTTTGTTGAA 
CTTATTGGAA 
CCACAAGAAT 
GGATGCCAAA 



21 
I 

CCCTCCTTCT 
CTGTTCTCCG 
AGTGCCGCGT 
AGCGGGAATG 
TTCTTGGCAG 
ACTCATTTTA 
TCAGGCAACT 
AGCTGCCACC 
CCATCTGTAA 
TCAATCACAG 
GAGCAGTCCG 
TGACCAGTGA 
TCCGTTGCCA 
AGGAAGTTCA 
AGGTTCTGGC 
ACATCTACCA 
TGGAGGAAGC 
GAAGCCATCC 
GCATGTTTCC 
ATGGCAGAGA 
ACATCAGAGC 
TGCCTTATTG 
ATTCTGTTCC 
CATTAGTTGT 
AAAAAAAAAA 



31 
I 

GCTCAGCTGC 
TGTATTGGCT 
CGCCAAATAC 
CAAATGCCTT 
AGTTGTCTTC 
TGATCTGGAG 
CGTAATGACA 
TGCCTTTGAC 
TGCGGAGATG 
CTGTGACCCC 
AGACATCGAG 
GGAGCGCCGG 
AACCCAGGAC 
AGAATCCCTG 
CATGTGCCAG 
GCTGAAGGTG 
CTTGTTCTAT 
CGTCAGAGGG 
CCAAGCAATG 
ACACAGCCTG 
ATCCTAAGGG 
AGGTCACACA 
GTGTTTGTGT 
AGAGAAGCAC 
AAAAAAA 



41 
I 

CTTGCAGTAA 
TTTCTGGGCA 
TGTAGTGCTA 
AAAAGCTGCA 
AAACTTATGG 
TCAAATATTA 
TTTCAACATT 
CTTTTTGAAG 
CAGGAAGTTG 
AACTGTTCGA 
GTGGGAGAGG 
AAGCAGCTGA 
AAGGATGCTG 
AAAAAAATTG 
GCGATCATAA 
CTCGACTGCG 
GGTACTCGGA 
GTTCAAGTGA 
AAGAATCTGA 
ATTGAAGATT 
AACGCAGTCA 
CTCTATGCTT 
AGGTAAATAA 
GATTATAATA 



51 
I 

TTAAACTCTT 
GCAGGAAGGA 
AGTGTCAGAA 
AACCCAGATA 
ATGGAGCACC 
ACAAACTGAC 
TCATGAGAGA 
CCTTTGCAAA 
GTGTTGGCCT 
TTGTGTTCAA 
AGCTCACCAT 
GGGACCAGTA 
ATATGCTAAC 
AAGAACTGAA 
GCAGCAATTC 
CCATGGATGC 
CCATGGAGCC 
TGAAAGTTGG 
GACTGGCTTT 
TGATTCTACT 
GAGGGAAATA 
TGTTAGCTGT 
AGGCAGACAT 
AATTCAAAAC 



Seq ID NO: 421 Protein sequence 
Protein Accession #: NP_073580 



MRCSQCRVAK 
SEKLYSFYDL 
CNSFTICNAE 
LDMLMTSEER 
WKWEQVLAMC 
IFFPGSHPVR 
ECDANIRAS 



11 
I 

YCSAKCQKKA 
ESNINKLTED 
MQEVGVGLYP 
RKQLRDQYCF 
QAIISSNSER 
GVQVMKVGKL 



21 
I 

WPDHKRECKC 
KKEGLRQLVM 
SISLLNHSCD 
ECDCFRCQTQ 
LPDINIYQLK 
QLHQGMFPQA 



31 
I 

LKSCKPRYPP 
TFQHFMREEI 
PNCSIVFNGP 
DKDADMLTGD 
VLDCAMDACI 
MKNLRLAFDI 



41 
I 

DSVRLLGRW 
QDASQLPPAF 
HLLLRAVRDI 
EQVWKEVQES 
NLGLLEEALF 
MRVTHGREHS 



51 
I 

FKLMDGAPSE 
DLFEAFAKVI 
EVGEELTICY 
LKKIEELKAH 
YGTRTMEPYR 
I1IEDLILLI4E 



Seq ID NO: 422 DNA sequence 

Nucleic Acid Accession #t NMJ)03014.2 

Coding sequence: 238.-648 



GGCGGGTTCG 
CGGAGCTCCG 
AAACTCTCCT 
GGCAGGAAGA 
TTCCTCTCCA 
GCGCCCTGCG 
ATGCCCAACC 
GAGGAGCTGG 
GCGCCCATTT 
CAACGCGCGC 
AGCCTGGCCT 
ATCGTCACGG 
CAGGAAAGGC 
AAGGTGAAGC 
AAAATAAAAG 
GAGATCTTCA 
TCTTGCCAGT 
CGTTCAAGGA 
AAAAGATCCA 
AAGAAAACAG 
GCTCCCAAAC 
. AACCCGAAAA 
GATGAGGCTG 



11 
I 

CGCCCCGAAG 
CGGCCGGACC 
GCGCCCCAGA 
GAAGGCGCTT 
TCCTAGTGGC 
AGGCGGTGCG 
ACCTGCACCA 
TGGACGTGAA 
GCACCCTGGA 
GCGACGACTG 
GCGACGAGCT 
ACCTCCCGGA 
CTCTTGATGT 
CAACTTTGGC 
CTGTGCAGAG 
AGTCCTCATC 
GTCCACACAT 
TGATGCTTCT 
TACAGTGGGA 
CCGGGCGCAC 
CAGCCAGTCC 
GAGTGTGAGC 
GGCATTGCCT 



21 
t 

GCTGAGAGCT 
CCGCGGCCCC 
AGATTTCTTC 
TCTGTCTGCC 
GCTGTGCCTG 
CATCCCTATG 
CAGCACGCAG 
CTGCAGCGCC 
GTTCCTGCAC 
CGAGCCCCTC 
GCCTGTCTAT 
GGATGTTAAG 
TGACTGTAAA 
AACGTATCTC 
GAGTGGCTGC 
ACCCATCCCT 
CCTGCCCCAT 
TGAAAATTGC 
AGAGAGGCTG 
CAGTCGTAGT 
CAAGAAGAAC 
TAACTAGTTT 
GGGACAGCCT 



31 
i 

GGCGCTGCTC 
GCTTTGCTGC 
CTCGGCGAAG 
GGGGTCGCAG 
TGGCTGCACC 
TGCCGGCACA 
GAGAACGCCA 
GTGCTGCGCT 
GACCCTATCA 
ATGAAGATGT 
GACCGTGGCG 
TGGATAGACA 
CGCCTAAGCC 
AGCAAAAACT 
AATGAGGTCA 
CGAACTCAAG 
CAAGATGTTC 
TTAGTTGAAA 
CAGGAACAGC 
AATCCCCCCA 
ATTAAAACTA 
CCAAAGCGGA 
ATGTAAGGCC 



41 
I 

GTGCCCTGTG 
CGACTGGAGT 
GGACAGCGAA 
CGCGAGAGGG 
TGGCGCTGGG 
TGCCCTGGAA 
TCCTGGCCAT 
TCTTCTTCTG 
AGCCGTGCAA 
ACAACCACAG 
TGTGCATTTC 
TCACACCAGA 
CCGATCGGTG 
ACAGCTATGT 
CAACGGTGGT 
TCCCGCTCAT 
TCATCATGTG 
AATGGAGAGA 
GGAGAACAGT 
AACCAAAGGG 
GGAGTGCCCA 
GACTTCCGAC 
ATGTGCCCCT 



51 

I 

TGCCAGACGG 
TTGGGGGAAG 
AGATGAGGGT 
CAGTGCCATG 
CGTGCGCGGC 
CATCACGCGG 
CGAGCAGTAC 
TGCCATGTAC 
GTCGGTGTGC 
CTGGCCCGAA 
GCCTGAAGCC 
CATGATGGTA 
CAAGTGTAAA 
TATTCATGCC 
GGATGTAAAA 
TACAAATTCT 
TTACGAGTGG 
TCAGCTTAGT 
TCAGGACAAG 
AAAGCCTCCT 
GAAGAGAACA 
TTCCTTACAG 
TGCCCTAACA 



60 
120 
180 
240 
300 
. 360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



343 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

ACTCACTGCA GTGCTCTTCA TAGACACATC TTGCAGCATT TTTCTTAAGG CTATGCTTCA 1440 

GTTTTTCTTT GTAAGCCATC ACAAGCCATA GTGGTAGGTT TGCCCTTTGG TACAGAAGGT 1500 

GAGTTAAAGC TGGTGGAAAA GGCTTATTGC ATTGCATTCA GAGTAACCTG TGTGCATACT 1560 

CTAGAAGAGT AGGGAAAATA ATGCTTGTTA CAATTCGACC TAATATGTGC ATTGTAAAAT 1620 

AAATGCCATA TTTCAAACAA AACACGTAAT TTTTTTACAG TATGTTTTAT TACCTTTTGA 1680 

TATCTGTTGT TGCAATGTTA GTGATGTTTT AAAATGTGAT GAAAATATAA TGTTTTTAAG 1740 

AAGGAACAGT AGTGGAATGA ATGTTAAAAG ATCTTTATGT GTTTATGGTC TGCAGAAGGA 1800 

TTTTTGTGAT GAAAGGGGAT TTTTTGAAAA ATTAGAGAAG TAGCATATGG AAAATTATAA 1860 

TGTGTTTTTT TACCAATGAC TTCAGTTTCT GTTTTTAGCT AGAAACTTAA AAACAAAAAT 1920 

AATAATAAAG AAAAATAAAT AAAAAGGAGA GGCAGACAAT GTCTGGATTC CTGTTTTTTG 1980 

GTTACCTGAT TTCCATGATC ATGATGCTTC TTGTCAACAC CCTCTTAAGC AGCACCAGAA 2040 

ACAGTGAGTT TGTCTGTACC ATTAGGAGTT AGGTACTAAT TAGTTGGCTA ATGCTCAAGT 2100 

ATTTTATACC CACAAGAGAG GTATGTCACT CATCTTACTT CCCAGGACAT CCACCCTGAG 2160 

AATAATTTGA CAAGCTTAAA AATGGCCTTC ATGTGAGTGC CAAATTTTGT TTTTCTTCAT 2220 

TTAAATATTT TCTTTGCCTA AATACATGTG AGAGGAGTTA AATATAAATG TACAGAGAGG 2280 

AAAGTTGAGT TCCACCTCTG AAATGAGAAT TACTTGACAG TTGGGATACT TTAATCAGAA 2340 

AAAAAGAACT TATTTGCAGC ATTTTATCAA CAAATTTCAT AATTGTGGAC AATTGGAGGC 2400 

ATTTATTTTA AAAAACAATT TTATTGGCCT TTTGCTAACA CAGTAAGCAT GTATTTTATA 2460 

AGGCATTCAA TAAATGCACA ACGCCCAAAG GAAATAAAAT CCTATCTAAT CCTACTCTCC 2520 

ACTACACAGA GGTAATCACT ATTAGTATTT TGGCATATTA TTCTCCAGGT GTTTGCTTAT 2580 

GCACTTATAA AATGATTTGA ACAAATAAAA CTAGGAACCT GTATACATGT GTTTCATAAC 2640 

CTGCCTCCTT TGCTTGGCCC TTTATTGAGA TAAGTTTTCC TGTCAAGAAA GCAGAAACCA 2700 

TCTCATTTCT AACAGCTGTG TT AT ATT CCA TAGTATGCAT TACTCAACAA ACTGTTGTGC 2760 
TATTGGATAC TTAGGTGGTT TCTTCACTGA CAATACTGAA TAAACATCTC ACCGGAATTC 

Seq ID NO: 423 Protein sequence 
Protein Accession 8: NP 003005.1. 



PCT/US02/12476 



MFLSILVALC 
YEELVDVNCS 
ESLACDELPV 
KKVKPTLATY 
SSCOCPHILP 
KKKTAGRTSR 



11 
I 

LWLHLALGVR 
AVLRFFFCAM 
YDRGVCISPE 
LSKNYSYVIH 
HQDVLIMCYB 
SNPPKPKGKP 



21 
I 

GAPCEAVRIP 
YAPICTLEFL 
AIVTDLPEDV 
AKIKAVQRSG 
WRSRMMLLEN 
PAPKPASPKK 



31 


41 


51 
1 




1 

MCRHMPWNIT 


1 

RMPNHLHHST 


QENAILAIEQ 


60 


HDPIKPCKSV 


CQRARDDCEP 


LMKMYNHSWP 


120 


KWIDITPDMM 


VQERPLDVDC 


KRLSPDRCKC 


180 


CNEVTTWDV 


KEIFKSSSPI 


PRTQVPLITN 


240 


CLVEKWRDQL 


SKRSIQWEER 


LQEQRRTVQD 


300 


NIKTRSAQKR 


TNPKRV 







Seq ID NO: 424 DNA sequence 
Nucleic Acid Accession tt: BC010423 
Coding sequence: 248.. 1780 



CACAGCGTGG 
AGCTACGGCT 
CAAGTGCGAG 
TCTGCAGCCG 
TTCAACCATG 
GCTGCTACTG 
CGTGGTAACT 
CGGCGAGCAA 
ACTAGCGCTA 
GGAGCAGCCG 
GCAGGCGGAT 
GGCGCGGCTG 
ACTAGAAGAG 
CCCCAGCGTG 
CTCCCGCTCT 
GCAGCCACTG 
CATCCTCCAC 
GTGGCACATT 
CTCATACAAC 
CACTTTGGGC 
CAATGAGTTC 
CTCTGGGAAG 
ACTCTTGTTC 
GGCCCAGCAG 
CCGGAGGCTG 
GAGAGCCGAG 
AGAGCCCGAG 
TGAACTGCTG 
CAAACAGGCC 
CAATGGCATC 
CTAGGCCTGG 
ACACCCCCAT 
AACCCTTCTG 
CACTGTGTGT 
TGACTGTCCG 
AAGTGAACTG 
GTTTGGCGTG 
CAGACCCCAG 
CAGACCCAGG 
TCTCCTACCA 
GAGGCTTGAA 
ACATATTTTC 
ACTTTTAATT 
TTTTATTTTT 



11 
I 

GAAGCAGCTC 
GGGTGTGTAG 
AGGCAAGAAC 
GCTCCCAGGG 
CCCCTGTCCC 
CTGGCATCAT 
GTGGTGCTGG 
GTGGGGCAAG 
CTGCACTCCA 
CCGCCCCCAC 
GAGGGCGAGT 
CGGCTCCGAG 
GGCCAGGGCC 
ACCTGGGACA 
GCTGCCGTCA 
ACTTGTGTGG 
GTGTCCTTCC 
GGCAGAGAAG 
TGGACACGGC 
TTTCCCCCAC 
TCCTCAAGGG 
CAGGTGGACC 
TGCCTTCTGG 
ATGACCCAGA 
CATTCCCATC 
GGCCACCCTG 
GGCCGCAGTT 
TCTCCAGGCT 
ATGAACCATT 
TACATCAATG 
CTCCTTCTGT 
TTCTTGCGGA 
TTCATCGGGA 
GTGCATGTGT 
TGGAGGGGTG 
TGGTGTATGT 
TGTGTCATGT 
AGCAGTATTA 
TGTGCGGGCA 
CTTCGGAGCC 
CTGTTACAGA 
TGTAAATATA 
TTTTTCTTTT 
ATTTTTTTTT 



21 
I 

TGGGGGAGCT 
AACGGGGCCG 
TCTGCAGCTT 
AGATCTCGGT 
TGGGAGCCGA 
TTACAGGCCG 
GCCAGGACGC 
TGGCATGGGC 
AATACGGGCT 
GCAACCCCCT 
ACGAGTGCCG 
TGCTGGTGCC 
TGACCCTGGC 
CGGAGGTCAA 
CCTCAGAGTT 
TGTCCCATCC 
TTGCTGAGGC 
GAGCTATGCT 
TGGATGGGCC 
TGACCACTGA 
ATTCTCAGGT 
TAGTGTCAGC 
TGGTGGTGGT 
AATATGAGGA 
ACACGGACCC 
ATAGTCTCAA 
ACTCCACGCT 
CTGGGCGGGC 
TTGTTCAGGA 
GGCGGGGACA 
TGACATGGGA 
AGATGCTCCC 
GGGCTCCACC 
GCCTGTGTGA 
ACTGTGTCCG 
GCCACGGGAT 
GGCTGTGTGT 
ATGATGCAGA 
TAGCTGGAGC 
ATGGGGGCAA 
AGCCCTCTGC 
CATGCGCCGG 
TTTTTTCTTG 
AGAGTTTGAG 



31 
I 

CGGAGCTCCC 
GGGCTGGGGC 
CCTGCCTTCT 
GGAACTTCAG 
GATGTGGGGG 
GTGCCCCGCG 
AAAACTGCCC 
TCGGGTGGAC 
TCATGTGAGC 
GGACGGCTCA 
GGTCAGCACC 
TCCCCTGCCC 
AGCCTCCTGC 
AGGCACAACG 
CCACTTGGTG 
TGGCCTGCTC 
CTCtGTGAGG 
CAAGTGCCTG 
TCTGCCCAGT 
GCACAGCGGC 
CACTGTGGAT 
CTCGGTGGTG 
GGTGCTCATG 
GGAGCTGACC 
CAGGAGCCAG 
GGACAACAGT 
GACCACGGTG 
CGAGGAGGAG 
GAATGGGACC 
CCTGGTCTGA 
GATTTTAGCT 
CATCCCACTG 
AATTGAGTCT 
GTGTTGACTG 
TGGTGTGTAT 
TTGAGTGGTT 
GACCTCTGCC 
GGTTGGAGGA 
TGGAATCTGC 
GTGTGAAGCA 
CCTCTGGTGG 
GAGCTTCTTG 
CCCTTTCCAT 
TCCAGCCTGG 



41 
I 

GATCACGGCT 
TGGGTCCCCT 
GGGTCAGTTC 
AAACGCTGGG 
CCTGAGGCCT 
GGTGAGCTGG 
TGCTTCTACC 
GCGGGCGAAG 
CCGGCTTACG 
GTGCTCCTGC 
TTCCCCGCCG 
TCACTGAATC 
ACAGCTGAGG 
TCCAGCCGTT 
CCTAGCCGCA 
CAGGACCAAA 
GGCCTTGAAG 
AGTGAAGGGC 
GGGGTACGAG 
ATCTACGTCT 
GTTCTTGACC 
GTGGTGGGTG 
TCCCGATACC 
CTGACCAGGG 
CCGGAGGAGA 
AGCTGCTCTG 
AGGGAGATAG 
GAAGATCAGG 
CTACGGGCCA 
CCCAGGCCTG 
CATCTTGGGG 
ACTGCTTGAC 
CTCCCACCAT 
ACTGTGTGTG 
TATGCTGTCA 
GCGTGGGCAA 
TGAAAAAGCA 
GAGAGGTGGA 
CTCCGGTGTG 
GCCAGTCCCT 
CCTCTGGGCC 
CAGGAATACT 
TAGTTGTATT 
ACGATATAGC 
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TCTTGGGGGT 


60 


AGTGGAGACC 


120 


CTTATTCAAG 


180 


CAGTCTGCCT 


240 


GGCTGCTGCT 


300 


AGACCTCAGA 


360 


GAGGGGACTC 


420 


GCGCCCAGGA 


460 


AGGGCCGCGT 


540 


GCAACGCAGT 


600 


GCAGCTTCCA 


660 


CTGGTCCAGC 


720 


GCAGCCCAGC 


780 


CCTTCAAGCA 


840 


GCATGAATGG 


900 


GGATCACCCA 


960 


ACCAAAATCT 


1020 


AGCCCCCTCC 


1080 


TGGATGGGGA 


1140 


GCCATGTCAG 


1200 


CCCAGGAAGA 


1260 


TGATCGCCGC 


1320 


ATCGGCGCAA 


1380 


AGAACTCCAT 


1440 


GTGTAGGGCT 


1500 


TGATGAGTGA 


1560 


AAACACAGAC 


1620* 


ATGAAGGCAT 


1680 


AGCCCACGGG 


1740 


CCTCCCTTCC 


1800 


GCCTCCTTAA 


1860 


CTTTACCTCC 


1920 


GCATGCAGGT 


1980 


TGTGGAGGGG 


2040 


TATCAGAGTC 


2100 


CACTGTCAGG 


2160 


GGTATTTTCT 


2220 


GACTGTGGCT 


2280 


AGGGAACCTG 


2340 


GGGTCAGCCA 


2400 


TGCTGCATGT 


2460 


GCTCCGAATC 


2520 


TTTTATTTAT 


2580 


CAGACCCTGT 


2640 



344 



WO 02/086443 

CTGTAAAAAA ACCAAAACCC AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 425 Protein sequence 
Protein Accession &: AAH10423 

1 11 21 31 41 51 

I I I I I I 

MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGEIiETSDW TWLGQDAKL PCFYRGDSGE 60 

QVGQVAWARV DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPPRNPLDG SVLLRNAVQA 120 

DEGEYECRVS TFPAGSFQAR LRLRVLVPPIi PSLNPGPALE EGQGLTLAAS CTAEGSPAPS 180 

VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCWSHPGL LQDQRITHIL 240 

HVSFLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 300 

GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VWGVIAALL 360 

FCLLVWWL MSRYHRRKAQ QMTQKYEEEL TLTRENSIRR LHSHHTDPRS QPEESVGLRA 420 

EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGIKQ 480 
AMNHFVQENG TLRAKPTGNG IYINGRGHLV 

Seq ID NO: 426 DNA sequence 

Nucleic Acid Accession #: NM_003474.2 

Coding sequence: 37.. 3036 

1 11 21 31 41 51 

I I I I 1 I 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TTGCTCATTT ATTGCAACGG 60 

TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 120 

CTTTTTTAAA AATGAAAGGC TAGAAGAGCT CAGCGGCGGC GCGGGCCGTG CGCGAGGGCT 180 

CCGGAGCTGA CTCGCCGAGG CAGGAAATCC CTCCGGTCGC GACGCCCGGC CCCGCTCGGC 240 

GCCCGCGTGG GATGGTGCAG CGCTCGCCGC CGGGCCCGAG AGCTGCTGCA CTGAAGGCCG 300 

GCGACGATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCCTCCT GCTCGCCCTG 360 

GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CGAAGGAAGA 420 

GCTGATGAAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 480 

TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCGAC TACAACGGGA AAGCAAAGAA 540 

CTGATCATAA ATCTGGAAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC GGAAACCCAC 600 
TATCTGCAAG ACGGTACTGA TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC - 660 

TGTTACTACC ATGGACATGT ACGGGGATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT 720 

TGTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 780 

AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAGAAGC TGAAAAGCGT CCGGGGATCA 840 

TGTGGATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 

CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 960 

GTGATCGTGG CAGACAACCG AGAGTTTCAG AGGCAAGGAA AAGATCTGGA AAAAGTTAAG 1020 

CAGCGATTAA TAGAGATTGC TAATCACGTT GACAAGTTTT ACAGACCACT GAACATTCGG 1080 

ATCGTGTTGG TAGGCGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTCAGGAC 1140 

CCATTCACCA GCCTCCATGA ATTTCTGGAC TGGAGGAAGA TGAAGCTTCT ACCTCGCAAA 1200 

TCCCATGACA ATGCGCAGCT TGTCAGTGGG GTTTATTTCC AAGGGACCAC CATCGGCATG 1260 

GCCCCAATCA TGAGCATGTG CACGGCAGAC CAGTCTGGGG GAATTGTCAT GGACCATTCA 1320 

GACAATCCCC TTGGTGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGGATG 1380 

AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGG CGGTTGAGAA AGGAGGCTGC 1440 

ATCATGAACG CTTCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CAGCAGGAAG 1500 

GACTTGGAGA CCAGCCTGGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 

AGGGAGTCTT TCGGGGGCCA GAAGTGTGGG AACAGATTTG TGGAAGAAGG AGAGGAGTGT 1620 

GACTGTGGGG AGCCAGAGGA ATGTATGAAT CGCTGCTGCA ATGCCACCAC CTGTACCCTG 1680 

AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTGCCAGCT GAAGCCTGCA 1740 

GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGGCC 1800 

AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGATGGGC ACTCATGTCA GGATGTGGAC 1860 

GGCTACTGCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 

CCAGGTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 

TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT GAGAGATGCT 2040 

AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCCAGCCGGC CAGtCATTGG TACCAATGCC 2100 

GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTGTG CCGGGGGACC 2160 

CACGTGTACT TGGGCGATGA CATGCCGGAC CCAGGGCTTG TGCTTGCAGG CACAAAGTGT 2220 

GCAGATGGAA AAATCTGCCT GAATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 

GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTGC 2340 

GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT TTGGAGGAAG CACAGACAGC 2400 

GGCCCCATCC GGCAAGCAGA TAACCAAGGT TTAACCATAG GAATTCTGGT GACCATCCTG 2460 

TGTCTTCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AGACCTTGAT ACGACTGCTG 2520 

TTTACAAATA AGAAGACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCGGCCACCC 2580 

CGTGGCTTCC AACCCTGTCA GGCTCACCTC GGCCACCTTG GAAAAGGCCT GATGAGGAAG 2640 

CCGCCAGATT CCTACCCACC GAAGGACAAT CCCAGGAGAT TGCTGCAGTG TCAGAATGTT 2700 

GACATCAGCA GACCCCTCAA CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2760 

CTTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAGACC CCTGCCAGCC 2820 

AAGCCTGCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCCCCTCA GAAGCCTCTG 2880 

CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2940 

CAATGGGAGA CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCACACCAA 3000 

GTGCCCAGAT CCACCCACAC CGCCTATATT AAGTGAGAAG CCGACACCTT TTTTCAACAG 3060 

TGAAGACAGA AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 3120 

GGATTTTTTT TAATGTTTAA AACATCATTA CTATAAGAAC TTTGAGCTAC TGCCGTCAGT 3180 

GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGTAAATT ATTAATTTAT 3240 

GCAGAATGTT GATTACAGTG CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 3300 

CCATGGCAGG AAGGCTTGTT GTGCTTTTAG TATTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 

ATGGGATTCT GGACAGGATG TGTTTGCTTT CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 

CCAACTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 

CTCAGTTGAT TTTCTGGATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 

TGTGTTTGGC TTTCAGGGAG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 

ACACCTGGGA GAAATCTGGC TTCTGGCCAG GAAGCTTTGG TGAGAACCTG GGTTGCAGAC 3660 

AGGAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA ACACTAGACA AGCCAGAACT 3720 

TGACCCTGAG CTGACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAGTGT CACTCAAGGC 3780 

GGTGCTTGAT AGAAATGCCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGCCA 3840 
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CCAGTAGGTT ATTTAGCTTG GGAAAGGTGG TGTTTCTGTA AGAAACCTAC TGCCCAGGCA 3900 

CTGCAAACCG CCACCTCCCT ATACTGCTTG GAGCTGAGCA AATCACCACA AACTGTAATA 3960 

CAATGATCCT GTATTCAGAC AGATGAGGAC TTTCCATGGG ACCACAACTA TTTTCAGATG 4020 

TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 

AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 

TATAGTTCAT GTCTGCTATC ATTATTCGTA GATATTGGAC AAAGAACCTT CTCTATGGGG 4200 

CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 

ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACAGAATGTG ATAAGGAAAT 4320 

CAACTTGCTT ATCAACTTCC TAAATATTAT GAGATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 

CTCTTCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 

AATGGCATGA GAAATACAAA AATACTCAGA TAAGGTAAAA TGCCATGATG CCTCTGTCTT 4500 

CTGGACTGGT TTTCACATTA GAAGACAATT GACAACAGTT AC ATAA TTCA CTCTGAGTGT 4560 

TTTATGAGAA AGCCTTCTTT TGGGGTCAAC AGTTTTCCTA TGCTTTGAAA CAGAAAAATA 4620 

TGTACCAAGA ATCTTGGTTT GCCTTCCAGA AAACAAAACT GCATTTCACT TTCCCGGTGT 4680 

TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 

AAACACACAC AAAAGGGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 

TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 

AGATACATAC AGAATTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCCAAACATA 4920 

TATATACTAT TAAAAAGGTT TACAGAATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 

AGATGCCCAA ATCCTTAGAT CTGGCATGTT AGCCCTTCCT CCAATTATAA GAGGATATGA 5040 
ACCAAAAAAA AAAAAAAAAA AA 



Seq ID NO: 427 Protein sequence 
Protein Accession 8: NP_003465 



1 11 21 31 41 51 

I I I J . I . I 

MAARPIiPVSP ARALLLALAG ALLAPCEARG VSLWNEGRAD EWSASVRSG DLWIPVKSFD 60 

SKNHPEVLNI RLQRESKELI INLERNEGLI ASSFTETHYL ODGTDVSLAR NYTVI LGHCY 120 

YHGHVRGYSD SAVSLSTCSG LRGLIVFENE SYVLEPMKSA TNRYKLFPAK KLKSVRGSCG 180 

SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVI VADNREFQRQ GKDLEKVKQR 240 

LIEIANHVDK FYRPLNIRIV LVGVEVWNDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 300 

DNAOLVSGVY FQGTTIGMAP IMSMCTADQS GGIVMDHSDN PLGAAVTLAH ELGHNFGMNH 360 

DTLDRGCSCQ MAVEKGGCIM NASTGYPFPM VFSSCSRKDL ETSIiEKGMGV CLFNLPEVRE 420 

SFGGQKCGNR FVEEGEECDC GEPEECMNRC CNATTCTLKP DAVCAHGLCC EDCQLKPAGT 480 

ACRDSSNSCD LPEFCTGASP HCPANVYLHD GHSCQDVDGY CYNGICQTHE QQCVTLWGPG 540 

AKPAPGICFE RVNSAGDPYG NCGKVSKSSF AKCEMRDAKC GKIQCQGGAS RPVIGTNAVS 600 

IETNIPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GKICLNRQCQ NISVFGVHEC 660 

AMQCHGRGVC NNRKNCHCEA HWAPPFCDKF GFGGSTDSGP IRQADNQGLT IGILVTILCL 720 

LAAGFWYLK RKTLIRLLFT NKKTTIEKLR CVRPSRPPRG FQPCQAHLGH LGKGLMRKPP 780 

DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPIiPAKP 840 

ALRQAQGTCK PNPPQKPLPA DPLARTTRLT HALARTPGQW ETGLRLAPLR PAPQYPHQVP 900 
RSTHTAYIK 



Seq ID NO: 428 DNA sequence 

Nucleic Acid Accession #: NM_003714 

Coding sequence: 135. .1043 



1 11 21 31 41 51 

I I I I I I 

GAGGAGGAGG GAAAAGGCGA GCAAAAAGGA AGAGTGGGAG GAGGAGGGGA AGCGGCGAAG 60 

GAGGAAGAGG AGGAGGAGGA AGAGGGGAGC ACAAAGGATC CAGGTCTCCC GACGGGAGGT 120 

TAATACCAAG AACCATGTGT GCCGAGCGGC TGGGCCAGTT CATGACCCTG GCTTTGGTGT 180 

TGGCCACCTT TGACCCGGCG CGGGGGACCG ACGCCACCAA CCCACCCGAG GGTCCCCAAG 240 

ACAGGAGCTC CCAGCAGAAA GGCCGCCTGT CCCTGCAGAA TACAGCGGAG ATCCAGCACT 300 

GTTTGGTCAA CGCTGGCGAT GTGGGGTGTG GCGTGTTTGA ATGTTTCGAG AACAACTCTT 360 

GTGAGATTCG GGGCTTACAT GGGATTTGCA TGACTTTTCT GCACAACGCT GGAAAATTTG 420 

ATGCCCAGGG CAAGTCATTC ATCAAAGACG CCTTGAAATG TAAGGCCCAC GCTCTGCGGC 480 

ACAGGTTCGG CTGCATAAGC CGGAAGTGCC CGGCCATCAG GGAAATGGTG TCCCAGTTGC 540 

AGCGGGAATG CTACCTCAAG CACGACCTGT GCGCGGCTGC CCAGGAGAAC ACCCGGGTGA 600 

TAGTGGAGAT GATCCATTTC AAGGACTTGC TGCTGCACGA ACCCTACGTG GACCTCGTGA 660 

ACTTGCTGCT GACCTGTGGG GAGGAGGTGA AGGAGGCCAT CACCCACAGC GTGCAGGTTC 720 

AGTGTGAGCA GAACTGGGGA AGCCTGTGCT CCATCTTGAG CTTCTGCACC TCGGCCATCC 780 

AGAAGCCTCC CACGGCGCCC CCCGAGCGCC AGCCCCAGGT GGACAGAACC AAGCTCTCCA 840 

GGGCCCACCA CGGGGAAGCA GGACATCACC TCCCAGAGCC CAGCAGTAGG GAGACTGGCC 900 

GAGGTGCCAA GGGTGAGCGA GGTAGCAAGA GCCACCCAAA CGCCCATGCC CGAGGCAGAG 9 SO 

TCGGGGGCCT TGGGGCTCAG GGACCTTCCG GAAGCAGCGA GTGGGAAGAC GAACAGTCTG 1020 

AGTATTCTGA TATCCGGAGG TGAAATGAAA GGCCTGGCCA CGAAATCTTT CCTCCACGCC 1080 

GTCCATTTTC TTATCTATGG ACATTCCAAA ACATTTACCA TTAGAGAGGG GGGATGTCAC 1140 

ACGCAGGATT CTGTGGGGAC TGTGGACTTC ATCGAGGTGT GTGTTCGCGG AACGGACAGG 1200 

TGAGATGGAG ACCCCTGGGG CCGTGGGGTC TCAGGGGTGC CTGGTGAATT CTGCACTTAC 1260 

ACGTACTCAA GGGAGCGCGC CCGCGTTATC CTCGTACCTT TGTCTTCTTT CCATCTGTGG 1320 

AGTCAGTGGG TGTCGGCCGC TCTGTTGTGG GGGAGGTGAA CCAGGGAGGG GCAGGGCAAG 1380 

GCAGGGCCCC CAGAGCTGGG CCACACAGTG GGTGCTGGGC CTCGCCCCGA AGCTTCTGGT 1440 

GCAGCAGCCT CTGGTGCTGT CTCCGCGGAA GTCAGGGCGG CTGGATTCCA GGACAGGAGT 1500 

GAATGTAAAA ATAAATATCG CTTAGAATGC AGGAGAAGGG TGGAGAGGAG GCAGGGGCCG 1560 

AGGGGGTGCT TGGTGCCAAA CTGAAATTCA GTTTCTTGTG TGGGGCCTTG CGGTTCAGAG 1620 

CTCTTGGCGA GGGTGGAGGG AGGAGTGTCA TTTCTATGTG TAATTTCTGA GCCATTGTAC 1680 

TGTCTGGGCT GGGGGGGACA CTGTCCAAGG GAGTGGCCCC TATGAGTTTA TATTTTAACC 1740 

ACTGCTTCAA ATCTCGATTT CACTTTTTTT ATTTATCCAG TTATATCTAC ATATCTGTCA 1800 

TCTAAATAAA TGGCTTTCAA ACAAAGCAAC TGGGTCATTA AAACCAGCTC AAAGGGGGTT 1860 

TAAAAAAAAA AAAACCAGCC CATCCTTTGA GGCTGATTTT TCTTTTTTTT AAGTTCTATT 1920 

TTAAAAGCTA TCAAACAGCG ACATAGCCAT ACATCTGACT GCCTGACATG GACTCCTGCC 1980 

CACTTGGGGG AAACCTTATA CCCAGAGGAA AATACACACC TGGGGAGTAC ATTTGACAAA 2040 

TTTCCCTTAG GATTTCGTTA TCTCACCTTG ACCCTCAGCC AAGATTGGTA AAGCTGCGTC 2100 

CTGGCGATTC CAGGAGACCC AGCTGGAAAC CTGGCTTCTC CATGTGAGGG GATGGGAAAG 2160 

GAAAGAAGAG AATGAAGACT ACTTAGTAAT TCCCATCAGG AAATGCTGAC CTTTTACATA 2220 
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AAATCAAGGA GACTGCTGAA AATCTCTAAG GGACAGGATT TTCCAGATCC TAATTGGAAA 2280 

TTTAGCAATA AGGAGAGGAG TCCAAGGGGA CAAATAAAGG CAGAGAGAGA GAGAGAGAGA 2340 
GGGAGAGGAA GAAAAGAGAG AGAGAAAAGA gcctcgtgcc 

Seq ID NO: 429 Protein sequence 
Protein Accession #: NP_003705 

1 11 21 31 .41 51 

I I I I I I 

MCAERLGQFM TLALVLATFD PARGTDATNP PEGPQDRSSQ QKGRLSLQNT AEIQHCLVNA 60 

GDVGCGVFEC FENNSCEIRG LHGICMTFLH NAGKFDAQGK SFIKDALKCK AHALRHRFGC 120 

ISRKCPAIRE MVSQLQRECY LKHDLCAAAQ ENTRVIVEMI HFKDLUjHEP YVDLVNLLLT 180 

CGEEVKEAIT HSVQVQCEQN WGSLCSILSF CTSAIQKPPT APPERQPQVD RTKLSRAHHG 240 

EAGHHLPEPS SRETGRGAKG ERGSKSHPNA HARGRVGGLG AQGPSGSSEW EDEQSEYSDI 300 
RR 

Seq ID NO: 430 dna sequence 

Nucleic Acid Accession NMJ>05940 

Coding sequence: 23.. 1489 

1 11 21 31 41 51 

I I I I I I 

AAGCCCAGCA GCCCCGGGGC GGATGGCTCC GGCCGCCTGG CTCCGCAGCG CGGCCGCGCG 60 

CGCCCTCCTG CCCCCGATGC TGCTGCTGCT GCTCCAGCCG CCGCCGCTGC TGGCCCGGGC 120 

TCTGCCGCCG GACGTCCACC ACCTCCATGC CGAGAGGAGG GGGCCACAGC CCTGGCATGC 180 

AGCCCTGCCC AGTAGCCCGG CACCTGCCCC TGCCACGCAG GAAGCCCCCC GGCCTGCCAG 240 

CAGCCTCAGG CCTCCCCGCT GTGGCGTGCC CGACCCATCT GATGGGCTGA GTGCCCGCAA 300 

CCGACAGAAG AGGTTCGTGC TTTCTGGCGG GCGCTGGGAG AAGACGGACC tCACCTACAG 360 

GATCCTTCGG TTCCCATGGC AGTTGGTGCA GGAGCAGGTG CGGCAGACGA TGGCAGAGGC 420 

CCTAAAGGTA TGGAGCGATG TGACGCCACT CACCTTTACT GAGGTGCACG AGGGCCGTGC 480 

TGACATCATG ATCGACTTCG CCAGGTACTG GCATGGGGAC GACCTGCCGT TTGATGGGCC 540 

TGGGGGCATC CTGGCCCATG CCTTCTTCCC CAAGACTCAC CGAGAAGGGG ATGTCCACTT 600 

CGACTATGAT GAGACCTGGA CTATCGGGGA TGACCAGGGC ACAGACCTGC TGCAGGTGGC 660 

AGCCCATGAA TTTGGCCACG TGCTGGGGCT GCAGCACACA ACAGCAGCCA AGGCCCTGAT 720 

GTCCGCCTTC TACACCTTTC GCTACCCACT GAGTCTCAGC CCAGATGACT GCAGGGGCGT 780 

TCAACACCTA TATGGCCAGC CCTGGCCCAC TGTCACCTCC AGGACCCCAG CCCTGGGCCC 840 

CCAGGCTGGG ATAGACACCA ATGAGATTGC ACCGCTGGAG CCAGACGCCC CGCCAGATGC 900 

CTGTGAGGCC TCCTTTGACG CGGTCTCCAC CATCCGAGGC GAGCTCTTTT TCTTCAAAGC 960 

GGGCTTTGTG TGGCGCCTCC GTGGGGGCCA GCTGCAGCCC GGCTACCCAG CATTGGCCTC 1020 

TCGCCACTGG CAGGGACTGC CCAGCCCTGT GGACGCTGCC TTCGAGGATG CCCAGGGCCA 1080 

CATTTGGTTC TTCCAAGGTG CTCAGTACTG GGTGTACGAC GGTGAAAAGC CAGTCCTGGG 1140 

CCCCGCACCC CTCACCGAGC TGGGCCTGGT GAGGTTCCCG GTCCATGCTG CCTTGGTCTG 1200 

GGGTCCCGAG AAGAACAAGA TCTACTTCTT CCGAGGCAGG GACTACTGGC GTTTCCACCC 1260 

CAGCACCCGG CGTGTAGACA GTCCCGTGCC CCGCAGGGCC ACTGACTGGA GAGGGGTGCC 1320 

CTCTGAGATC GACGCTGCCT TCCAGGATGC TGATGGCTAT GCCTACTTCC TGCGCGGCCG 1380 

CCTCTACTGG AAGTTTGACC CTGTGAAGGT GAAGGCTCTG GAAGGCTTCC CCCGTCTCGT 1440 

GGGTCCTGAC TTCTTTGGCT GTGCCGAGCC TGCCAACACT TTCCTCTGAC CATGGCTTGG 1500 

ATGCCCTCAG GGGTGCTGAC CCCTGCCAGG CCACGAATAT CAGGCTAGAG ACCCATGGCC 1560 

ATCTTTGTGG CTGTGGGCAC CAGGCATGGG ACTGAGCCCA TGTCTCCTGC AGGGGGATGG 1620 

GGTGGGGTAC AACCACCATG ACAACTGCCG GGAGGGCCAC GCAGGTCGTG GTCACCTGCC 1680 

AGCGACTGTC TCAGACTGGG CAGGGAGGCT TTGGCATGAC TTAAGAGGAA GGGCAGTCTT 1740 

GGGACCCGCT ATGCAGGTCC TGGCAAACCT GGCTGCCCTG TCTCATCCCT GTCCCTCAGG 1800 

GTAGCACCAT GGCAGGACTG GGGGAACTGG AGTGTCCTTG CTGTATCCCT GTTGTGAGGT I860 

TCCTTCCAGG GGCTGGCACT GAAGCAAGGG TGCTGGGGCC CCATGGCCTT CAGCCCTGGC 1920 

TGAGCAACTG GGCTGTAGGG CAGGGCCACT TCCTGAGGTC AGGTCTTGGT AGGTGCCTGC 1980 

ATCTGTCTGC CTTCTGGCTG ACAATCCTGG AAATCTGTTC TCCAGAATCC AGGCCAAAAA 2040 

GTTCACAGTC AAATGGGGAG GGGTATTCTT CATGCAGGAG ACCCCAGGCC CTGGAGGCTG 2100 

CAACATACCT CAATCCTGTC CCAGGCCGGA TCCTCCTGAA GCCCTTTTCG CAGCACTGCT 2160 

ATCCTCCAAA GCCATTGTAA ATGTGTGTAC AGTGTGTATA AACCTTCTTC TTCTTTTTTT 2220 
TTTTTAAACT GAGGATTGTC ATTAAACACA GTTGTTTTCT 

Seq ID NO: 431 Protein sequence 
Protein Accession #: NP_00S931 

1 11 21 31 41 51 

I I I I I ! 

MAPAAWLRSA AARALLPPML LLLLQPPPLL ARALPPDVHH LHAERRGPQP WHAALPSSPA 60 

PAPATQEAPR PASSLRPPRC GVPDPSDGLS ARNRQKRFVL SGGRWEKTDL TYRILRFPWQ 120 

LVQEQVRQTM AEALKVWSDV TPLTPTEVHE GRADIMIDFA RYWHGDDLPF DGPGGILAHA 180 

FFPKTHREGD VHFDYDETWT IGDDQGTDLL QVAAHEFGHV LGLQHTTAAK ALMSAFYTFR 240 

YPLSLSPDDC RGVQHLYGQP WPTVTSRTPA LGPQAGIDTN EIAPLEPDAP PDACEASFDA 300 

VSTIRGELFF FKAGFVWRLR GGQLQPGYPA LASRHWQGLP SPVDAAFEDA QGHIWFFQGA 360 

QYWVYDGEKP VLGPAPLTEL GLVRFPVHAA LVWGPEKNKI YFFRGRDYWR FHPSTRRVDS 420 

PVPRRATDWR GVPSEIDAAF QDADGYAYFL RGRLYWKFDP VKVKALEGFP RLVGPDFFGC 480 
AEPANTFL 

Seq ID NO: 432 DNA sequence 

Nucleic Acid Accession #: NM_024022 

Coding sequence: 202.. 1563 

1 11 21 31 41 51 

I I I I I I 

ACCGGGCACC GGACGGCTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 

GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 

CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 180 

AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 

TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 



347 



WO 02/086443 

GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GA6GGAGGGA 780 

TGTGCCTCTG GCCACGTGGT^ TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGg' AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGA6AAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGG TGACGCCTCC 1260 

CCTGTCCTGA ACCACGCGGC CGTCCCTTTG ATTTCCAACA AGATCTGCAA CCACAGGGAC 1320 

GTGTACGGTG GCATCATCTC CCCCTCCATG CTCTGOGOGG GCTACCTGAC GGGTGGCGTG 1380 

GACAGCTGCC AGGGGGACAG CGGGGGGCCC CTGGTGTGTC AAGAGAGGAG GCTGTGGAAG 1440 

TTAGTGGGAG CGACCAGCTT TGGCATCGGC TGCGCAGAGG TGAACAAGCC TGGGGTGTAC 1500 

ACCCGTGTCA CCTCCTTCCT GGACTGGATC CACGAGCAGA TGGAGAGAGA CCTAAAAACC 1560 

TGAAGAGGAA GGGGACAAGT AGCCACCTGA GTTCCTGAGG TGATGAAGAC AGCCCGATCC 1620 

TCCCCTGGAC TCCCGTGTAG GAACCTGCAC ACGAGCAGAC ACCCTTGGAG CTCTGAGTTC 1680 

CGGCACCAGT AGCAGGCCCG AAAGAGGCAC CCTTCCATCT GATTCCAGCA CAACCTTCAA 1740 

GCTGCTTTTT GTTTTTTGTT TTTTTGAGGT GGAGTCTCGC TCTGTTGCCC AGGCTGGAGT 1800 

GCAGTGGCGA AATCCCTGCT CACTGCAGCC TCCGCTTCCC TGGTTCAAGC GATTCTCTTG 1860 

CCTCAGCTTC CCCAGTAGCT GGGACCACAG GTGCCCGCCA CCACACCCAA CTAATTTTTG 1920 

TATTTTTAGT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGCTCTCAAA CCCCTGACCT 1980 

CAAATGATGT GCCTGCTTCA GCCTCCCACA GTGCTGGGAT TACAGGCATG GGCCACCACG 2040 

CCTAGCCTCA CGCTCCTTTC TGATCTTCAC TAAGAACAAA AGAAGCAGCA ACTTGCAAGG 2100 

GCGGCCTTTC CCACTGGTCC ATCTGGTTTT CTCTCCAGGG GTCTTGCAAA ATTCCTGACG 2160 

AGATAAGCAG TTATGTGACC TCACGTGCAA AGCCACCAAC AGCCACTCAG AAAAGACGCA 2220 

CCAGCCCAGA AGTGCAGAAC TGCAGTCACT GCACGTTTTC ATCTCTAGGG ACCAGAACCA 2280 

AACCCACCCT TTCTACTTCC AAGACTTATT TTCACATGTG GGGAGGTTAA TCTAGGAATG 2340 

ACTCGTTTAA GGCCTATTTT CATGATTTCT TTGTAGCATT TGGTGCTTGA CGTATTATTG 2400 

TCCTTTGATT CCAAATAATA TGTTTCCTTC CCTCAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAA 

Seq ID NO: 433 Protein sequence 
Protein Accession 8: NP_076927 

1 11 21 31 41 51 

I I 1 I I 1 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFFP IIVIGIIALI 60 

LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 

TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 180 

VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 

LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSbLDN PAPSHLVEKI VYHSKYKPKR 300 

LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGG DASPVLNHAA 360 

VPLISNKICN HRDVYGGIIS PSMIiCAGYI/T GGVDSCQGDS GGPLVCQERR LWKLVGATSP 420 
GIGCAEVNKP GVYTRVTSFL DWIHEQMERD LKT 

Seq ID NO: 434 DNA sequence 

Nucleic Acid Accession #: NMJ500493.2 

Coding sequence: 97.. 213 9 

1 11 21 31 41 51 

I I I I I I 

CACCTTCTGC ACTGCTCATC TGGGCAGAGG AAGCTTCAGA AAGCTGCCAA GGCACCATCT 60 

CCAGGAACTC CCAGCACGCA GAATCCATCT GAGAATATGC TGCCACAAAT ACCCTTTTTG 120 

CTGCTAGTAT CCTTGAACTT GGTTCATGGA GTGTTTTACG CTGAACGATA CCAAATGCCC 180 

ACAGGCATAA AAGGCCCACT ACCCAACACC AAGACACAGT TCTTCATTCC CTACACCATA 240 

AAGAGTAAAG GTATAGCAGT AAGAGGAGAG CAAGGTACTC CTGGTCCACC AGGCCCTGCT 300 

GGACCTCGAG GGCACCCAGG TCCTTCTGGA CCACCAGGAA AACCAGGCTA CGGAAGTCCT 360 

GGACTCCAAG GAGAGCCAGG GTTGCCAGGA CCACCGGGAC CATCAGCTGT AGGGAAACCA 420 

GGTGTGCCAG GACTCCCAGG AAAACCAGGA GAGAGAGGAC CATATGGACC AAAAGGAGAT 480 

GTTGGACCAG CTGGCCTACC AGGACCCCGG GGCCCACCAG GACCACCTGG AATCCCTGGA 540 

CCGGCTGGAA TTTCTGTGCC AGGAAAACCT GGACAACAGG GACCCACAGG AGCCCCAGGA 600 

CCCAGGGGCT TTCCTGGAGA AAAGGGTGCA CCAGGAGTCC CTGGTATGAA TGGACAGAAA 660 

GGGGAAATGG GATATGGTGC TCCTGGTCGT CCAGGTGAGA GGGGTCTTCC AGGCCCTCAG 720 

GGTCCCACAG GACCATCTGG CCCTCCTGGA GTGGGAAAAA GAGGTGAAAA TGGGGTTCCA 780 

GGACAGCCAG GCATCAAAGG TGATAGAGGT TTTCCGGGAG AAATGGGACC AATTGGCCCA 840 

CCAGGTCCCC AAGGCCCTCC TGGGGAACGA GGGCCAGAAG GCATTGGAAA GCCAGGAGCT 900 

GCTGGAGCCC CAGGCCAGCC AGGGATTCCA GG AACAAAAG GTCTCCCTGG GGCTCCAGGA 960 

ATAGCTGGGC CCCCAGGGCC' TCCTGGCTTT GGGAAACCAG GCTTGCCAGG CCTGAAGGGA 1020 

GAAAGAGGAC CTGCTGGCCT TCCTGGGGGT CCAGGTGCCA AAGGGGAACA AGGGCCAGCA 1080 

GGTCTTCCTG GGAAGCCAGG TCTGACTGGA CCCCCTGGGA ATATGGGACC CCAAGGACCA 1140 

AAAGGCATCC CGGGTAGCCA TGGTCTCCCA GGCCCTAAAG GTGAGACAGG GCCAGCTGGG 1200 

CCTGCAGGAT ACCCTGGGGC TAAGGGTGAA AGGGGTTCCC CTGGGTCAGA TGGAAAACCA 1260 

GGGTACCCAG GAAAACCAGG TCTCGATGGT CCTAAGGGTA ACCCAGGGTT ACCAGGTCCA 1320 

AAAGGTGATC CTGGAGTTGG AGGACCTCCT GGTCTCCCAG GCCCTGTGGG CCCAGCAGGA 1380 

GCAAAGGGAA TGCCCGGACA CAATGGAGAG GCTGGCCCAA GAGGTGCCCC TGGAATACCA 1440 

GGTACTAGAG GCCCTATTGG GCCACCAGGC ATTCCAGGAT TCCCTGGGTC TAAAGGGGAT 1500 

CCAGGAAGTC CCGGTCCTCC TGGCCCAGCT GGCATAGCAA CTAAGGGCCT CAATGGACCC 1560 

ACCGGGCCAC CAGGGCCTCC AGGTCCAAGA GGCCACTCTG GAGAGCCTGG TCTTCCAGGG 1620 

CCCCCTGGGC CTCCAGGCCC ACCAGGTCAA GCAGTCATGC CTGAGGGTTT TATAAAGGCA 1680 

GGCCAAAGGC CCAGTCTTTC TGGGACCCCT CTTGTTAGTG CCAACCAGGG GGTAACAGGA 1740 
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ATGCCTGTGT CTGCTTTTAC TGTTATTCTC TCCAAAGCTT ACCCAGCAAT AGGAACTCCC 1800 

ATACCATTTG ATAAAATTTT GTATAACAGG CAACAGCATT ATGACCCAAG GACTGGAATC 1860 

TTTACTTGTC AGATACCAGG AATATACTAT TTTTCATACC ACGTGCATGT GAAAGGGACT 1920 

CATGTTTGGG TAGGCCTGTA TAAGAATGGC ACCCCTGTAA TGTACACCTA TGATGAATAC 1980 

ACCAAAGGCT ACCTGGATCA GGCTTCAGGG AGTGCCATCA TCGATCTCAC AGAAAATGAC 2040 

CAGGTGTGGC TCCAGCTTCC CAATGCCGAG TCAAATGGCC TATACTCCTC TGAGTATGTC 2100 

CACTCCTCTT TCTCAGGATT CCTAGTGGCT CCAATGTGAG TACACCCCAC AGAGCTAATC 2160 

TAAATCTTGT GCTAGAAAAA GCATTCTCTA ACTCTACCCC ACCCTACAAA ATGCATATGG 2220 

AGGTAGGCTG AAAAGAATGT AATTTTTATT TTCTGAAATA CAGATTTGAG CTATCAGACC 2280 

AACAAACCTT CCCCCTGAAA AGTGAGCAGC AACGTAAAAA CGTATGTGAA GCCTCTCTTG 2340 

AATTTCTAGT TAGCAATCTT AAGGCTCTTT AAGGTTTTCT CCAATATTAA AAAATATCAC 2400 

CAAAGAAGTC CTGCTATGTT AAAAACAAAC AACAAAAAAC AAAGCAACAA AAAAAAAAAT 2460 

TAAAAAAAAA AACAGAAATA GAGCTCTAAG TTATGTGAAA TTTGATTTGA GAAACTCGGC 2520 

ATTTCCTTTT TAAAAAAGCC TGTTTCTAAC TATGAATATG AGAACTTCTA GGAAACATCC 2580 

AGGAGGTATC ATATAACTTT GTAGAACTTA AATACTTGAA TATTCAAATT TAAAAGACAC 2640 

TGTATCCCCT AAAATATTTC TGATGGTGCA CTACTCTGAG GCCTGTATGG CCCCTTTCAT 2700 

CAATATCTAT TCAAATATAC AGGTGCATAT ATACTTGTTA AAGCTCTTAT ATAAAAAAGC 2760 

CCCAAAATAT TGAAGTTCAT CTGAAATGCA AGGTGCTTTC ATCAATGAAC CTTTTCAAAA 2820 

CTTTTCTATG ATTGCAGAGA AGCTTTTTAT ATACCCAGCA TAAC TTGG AA ACAGGTATCT 2880 

GACCTATTCT TATTTAGTTA ACACAAGTGT GATTAATTTG ATTTCTTTAA TTCCTTATTG 2940 

AATCTTATGT GATATGATTT TCTGGATTTA CAGAACATTA GCACATGTAC CTTGTGCCTC 3000 

CCATTCAAGT GAAGTTATAA TTTACACTGA GGGTTTCAAA ATTCGACTAG AAGTGGAGAT 3060 

ATATTATTTA TTTATGCACT GTACTGTATT TTTATATTGC TGTTTAAAAC TTTTAAGCTG 3120 

TGCCTCACTT ATTAAAGCAC AAAATGTTTT ACCTACTCCT TATTTACGAC ACAATAAAAT 3180 

AACATCAATA GATTTTTAGG CTGAATTAAT TTGAAAGCAG CAATTTGCTG TTCTCAACCA 3240 
TTCTTTCAAG GCTTTTCATT CGACACAATA AAATAACATC AATAG 

Seq ID NO: 435 Protein sequence 
Protein Accession #: NPJ)00484.2 

1 11 21 31 41 51 

I I I I I I 

MLPQIPFLLL VSLNLVHGVF YAERYQMPTG IKGPLPNTKT QFFIPYTIKS KGIAVRGEQG 60 

TPGPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 

GPYGPKGDVG PAGLPGPRGP PGPPGIPGPA GISVPGKPGQ QGPTGAPGPR GFPGEKGAPG 180 

VPGMNGQKGE MGYGAPGRPG ERGLPGPQGP TGPSGPPGVG KRGENGVPGQ PGIKGDRGFP 240 

GEMGPIGPPG PQGPPGERGP EGIGKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 300 

PGLPGLKGER GPAGLPGGPG AKGEQGPAGL PGXPGLTGPP GNMGPQGPKG IPGSHGLPGP 360 

KGETGPAGPA GYPGAKGERG SPGSDGKPGY PGKPGLDGPK GNPGLPGPKG DPGVGGPPGL 420 

PGPVGPAGAK GMPGHNGEAG PRGAPGIPGT RGPIGPPGIP GFPGSKGDPG SPGPPGPAGI 480 

ATKGLNGPTG PPGPPGPRGH SGEPGLPGPP GPPGPPGQAV MPEGFIKAGQ RPSLSGTPLV 540 

SANQGVTGMP VSAFTVILSK AYPAIGTPIP FDKILYNRQQ HYDPRTGIFT CQIPGIYYFS 600 

YHVHVKGTHV WVGLYKNGTP VMYTYDEYTK GYLDQASGSA IIDLTENDQV WLQLPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 



Seq ID NO: 436 DNA sequence 

Nucleic Acid Accession #: XM_062811 

Coding sequence: 1..888 

1 IX 21 31 41 51 

I I I I I I 

ATGTGGGGCG CTCGCCGCTC GTCCGTCTCC TCATCCTGGA ACGCCGCTTC GCTCCTGCAG 60 

CTGCTGCTGG CTGCGCTGCT GGCGGCGGGG GCGAGGGCCA GCGGCGAGTA CTGCCACGGC 120 

TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 180 

GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 240 

GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 300 

CGGGCGGACA AAGACGGCCC CGACGGCTCG GCAGTGCCCA TCTACGTGCC GTTCCTCATT 360 

GTTGGCTCCG TGTTTGTCGC CTTTATCATC TTGGGGTCCC TGGTGGCAGC CTGTTGCTGC 420 

AGATGTCTCC GGCCTAAGCA GGATCCCCAG CAGAGCCGAG CCCCAGGGGG TAACCGCTTG 480 

ATGGAGACCA TCCCCATGAT CCCCAGTGCC AGCACCTCCC GGGGGTCGTC CTCACGCCAG 540 

TCCAGCACAG CTGCCAGTTC CAGCTCCAGC GCCAACTCAG GGGCCCGGGC GCCCCCAACA 600 

AGGTCACAGA CCAACTGTTG CTTGCCGGAA GGGACCATGA ACAACGTGTA TGTCAACATG 660 

CCCACGAATT TCTCTGTGCT GAACTGTCAG CAGGCCACCC AGATTGTGCC ACATCAAGGG 720 

CAGTATCTGC ATCCCCCATA CGTGGGGTAC ACGGTGCAGC ACGACTCTGT GCCCATGACA 780 

GCTGTGCCAC CTTTCATGGA CGGCCTGCAG CCTGGCTACA GGCAGATTCA GTCCCCCTTC 840 
CCTCACACCA ACAGTGAACA GAAGATGTAC CCAGCGGTGA CTGTATAA 



Seq ID NO: 437 Protein sequence 
Protein Accession #: XP_062811 

1 11 21 31 41 51 

! I I I I I 

MWGARRSSVS SSWNAASLLQ LLLAALLAAG ARASGEYCHG WLDAQGVWRI GFQCPERFDG 60 
GDATICCGSC ALRYCCSSAE ARLDQGGCDN DRQQGAGEPG RADKDGPDGS AVPIYVPFLI 120 
VGSVFVAFII LGSLVAACCC ROjRPKQDPQ QSRAPGGNRL METIPMIPSA STSRGSSSRQ 180 
SSTAASSSSS ANSGARAPPT RSQTNCCLPE GTMNNVYVNM PTNFSVLNCQ QATQIVPHQG 240 
QYLHPPYVGY TVQHDSVPMT AVPPFMDGLQ PGYRQIQSPF PHTNSEQKMY PAVTV 



Seq ID NO: 438 DNA sequence 

Nucleic Acid Accession ft: NM_004004.1 

Coding sequence: 1..681 

1 11 21 31 41 51 

I I I I I i ■ 

ATGGATTGGG GCACGCTGCA GACGATCCTG GGGGGTGTGA ACAAACACTC CACCAGCATT 60 

GGAAAGATCT GGCTCACCGT CCTCTTCATT TTTCGCATTA TGATCCTCGT TGTGGCTGCA 120 

AAGGAGGTGT GGGGAGATGA GCAGGCCGAC TTTGTCTGCA ACACCCTGCA GCCAGGCTGC 180 
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AAGAACGTGT GCTACGATCA CTACTTCCCC ATCTCCCACA TCCGGCTATG GGCCCTGCAG 
CTGATCTTCG TGTCCAGCCC AGCGCTCCTA GTGGCCATGC ACGTGGCCTA CCGGAGACAT 
GAGAAGAAGA GGAAGTTCAT CAAGGGGGAG ATAAAGAGTG AATTTAAGGA CATCGAGGAG 
ATCAAAACCC AGAAGGTCCG CATCGAAGGC TCCCTGTGGT GGACCTACAC AAGCAGCATC 
TTCTTCCGGG TCATCTTCGA AGCCGCCTTC ATGTACGTCT TCTATGTCAT GTACGACGGC 
TTCTCCATGC AGCGGCTGGT GAAGTGCAAC GCCTGGCCTT GTCCCAACAC TGTGGACTGC 
TTTGTGTCCC GGCCCACGGA GAAGACTGTC TTCACAGTGT TCATGATTGC AGTGTCTGGA 
ATTTGCATCC TGCTGAATGT CACTGAATTG TGTTATTTGC TAATTAGATA TTGTTCTGGG 
AAGTCAAAAA AGCCAGTTTA A 

Seq ID NO: 439 Protein sequence 
Protein Accession #: NP_003995.1 

1 11 21 31 41 51 

I I I I I I 

MDWGTLQTIL GGVNKHSTSI GKIWLTVLFI PRIMILWAA KEVWGDEQAD PVCNTLQPGC 
KNVCYDHYPP ISHIRLWALQ LIFVSSPALIi VAMHVAYRRH EKKRKFIKGE IKSEPKDIEE 
IKTQKVRIEG SLWWTYTSSI PFRVIFEAAF MYVFYVMYDG FSMQRLVKCN AWPCPNTVDC 
FVSRPTEKTV FTVFM1AVSG ICILLNVTEL CYLLIRYCSG KSKKPV 



PCT/US02/12476 



Seq ID NO: 440 DNA sequence 

Nucleic Acid Accession 8: XM_061091.1 

Coding sequence: 1..2481 



ATGCCAAATA 
CACCGAGCGC 
CCCGGGTACC 
CAGATGCATG 
GTTGGGAAGC 
ATTTCAGCTG 
GATGGGTCTA 
GTCTGTGACG 
TCCACTCCTC 
AGAATCAAGA 
CTTCTGCACA 
GTCACTGATG 
GGTGTCACTG 
GCCAGCGAGC 
GGCCTCTTCA 
CCCGAGCTTG 
CAGCCCTGCC 
TGCCCGCTGG 
GTCGACCTCC 
GCCAAAGTCT 



GATGTGCCTG 
ACGGGCAGTG 
CAGGACCGGC 
GCGGGCCCAG 
GCCGTGCGGG 
GATCCTCAGG 
CGGCCAGGGT 
GTAGGGCCCG 
GAGGTGAACC 
GCCTTCGGGC 
CCCTACCTAG 
ATGACCGTCC 
GGGAGAGGCG 
GTCTTGGTCG 
CGGGATTCCC 
ATTGAGTGGC 
ATGAATGAGG 
TGGGAGGGCC 
GGATGGATTC 
ACCCCTCCCA 
GTCTGTGCCC 



11 
I 

CTTCAGGAAC 
TGGTCGCCGC 
CGCCAGTGCC 
GTGAGAAGGT 
CTCCTTTTGA 
CCAGCAAAAT 
ACAGCGTCGG 
GTCTGGACAT 
ATCTGGAATT 
GGATGGTTTT 
GAGGGTTGCC 
GGAAGTCCCA 
TGTTTGCTGT 
CTAGAGGGCA 
GCACCCTCAG 
TCTTCATGGA 
AGAATGGAGG 
CCTTTGGAGG 
TCTTCCTGCT 
TCGTGAAGCG 
CCACATACAG 
ACCTGGTCTG 
CCTTGCGGCA 
CACGTAGAGT 
CGCGTCACGC 
CAGAGCTGGA 
ATCTGTTCAA 
GCCGGACACA 
AGAATTTTGC 
CTGACGTGAC 
TGGACACCAA 
GTGGGGTGGG 
AGAGGGGTGC 
CAGAGGATGC 
TGGGCGTGGG 
TGATCCACGT 
TGTGTGGAGA 
GCAGCTGCGT 
CCCACTGCGA 
TTGAGACGCC 
GCAACTACAG 
CAGGTCCTTA 



21 
I 

AACCAGGATT 
TCTCCTTCCG 
GGCTGCCGAT 
AGATCTCTGG 
GGCAAACGAA 
GATGTGGTGC 
GAAAGGGAGC 
CAGCCCCGAG 
CCCCTTGGAT 
CAAAGGAGGG 
TGGAGGCAGA 
GGGGGATGTG 
GGGGGTCAGG 
GCACGTGCTG 
CAGCTCGGCC 
GCGGTTAATG 
CACATGTGTT 
GGAGGCTAAC 
GGACAGCTCT 
GTTTGTGCGG 
CAGGGAGCTG 
GAGCCTCGAT 
GGCGGCAGAG 
GGTGGTTTTG 
AAGGGCGCGA 
GGAGATCACA 
CCAAATCCCT 
AGCCCTGGAC 
TCAGATGCAG 
ACAGGTCGGC 
ACCCACCCGG 
CTCAGCCGGC 
CCGGCCTGGT 
AGCCGTTCCT 
GCCTGTCCTA 
GGCAGCTTAC 
AGCCAAGCAG 
CCTGCAGAAT 
GAACCGTGAG 
CCTGAGGCAC 
AGAAGGCCTG 
G 



31 
I 

GAAATTTGGC 
GTGAGTCCCA 
GACCGATTCA 
AGCCTTGGTG 
GTCCATGTAA 
TCGGCTGCAG 
TTTGAAAGGT 
AGGGTCAGAG 
TCATTTTCAA 
CGCACGGAGA 
AATGCTTCTG 
GCACTGCCAT 
TTTCCCAGGT 
TTGGCTGAGC 
ATCTGCTCCA 
GGCATCTCTC 
CCAGAAGGAC 
TGTGCCCTGA 
GCGGGCACCA 
GCCGTGCTGA 
CTGGTGGCGG 
GGCATTCCCT 
CGTGGCTTCG 
CTCACTGAGT 
GAGCTGCTCC 
GGCAGCCCAA 
GAGCTGCAGG 
CTCGTCTTCA 
AGCTTTGTGA 
CTGGTGGTGT 
GCTGCGATGC 
ACCGCCCTGC 
GTCCCCAAAG 
GCCCAGAAGC 
AGTGAGGGTC 
GCCGACCTGC 
CCAGTCAACC 
GGGAGCTACC 
TGGAGCTCTT 
ATGGCTCCCG 
GGCACTGAAA 



41 
! 

TTCtCCAAGA 
GCCCCGAGTT 
CGCTCCCGAT 
TTCTTTGCTA 
GCAAAGAAAC 
TGGACATCAT 
CCAAGCACTT 
TGGGAGCATT 
CCCAACAGGA 
CGGAACTTGC 
TGCCCCAGAT 
CCAAGCAGCT 
GGGAGGAGCT 
AGGTGGAGGA 
GCGCCACGCC 
TGATAGGCCC 
TGGACGGCTA 
AGCTGAGCCT 
CTCTGGACGG 
GCGAGGACTC 
TGCCTGTGGG 
TCCGTGGTGG 
GGAGCGCCAC 
CACACTCCGA 
TGCTGGGTGT 
AGCATGTGAT 
GGAAGCTGTG 
TGTTGGACAC 
GAAGCTGTGC 
ATGGCAGCCA 
TGCGGGCCAT 
TGCACATCTA 
CTGTGGTGGT 
TGAGGAACAA 
TGCGGAGGCT 
GGTACCACCA 
TCTGCAAACC 
GCTGCAAGTG 
GCTCTGTATG 
TGCAGGAGGG 
TGGTGCCTAC 



51 
I 

GCCGCCCGGG 
GGCTCTGGCG 
GATTGGAGGT 
TGAATTTTTA 
CATCGGGAAG 
GTTTCTGTTA 
TGCCATCACA 
CCAGTTCAGT 
AGTGAAGGCA 
TCTGAAATAC 
CCTCATCATC 
GAAGGAAAGG 
GCATGCACTG 
TGCCACCAAC 
AGCTGGGAGC 
CTGTGACTCG 
CCAGTGCCTC 
GGAATGCAGG 
CTTCCTGCGG 
TCGGGCCCGA 
GGAGTACCAG 
CCCCACCCTG 
CAGGACAGGC 
GGATGAGGTT 
AGGCAGTGAG 
GGTCTACTCG 
CAGCCGGCAG 
CTCTGCCTCA 
CCTCCAGTTT 
GGTGCAGACT 
TAGCCAGGCC 
TGACAAAGTG 
GCTCACAGGC 
TGGCATCTCT 
TGCAGGTCCC 
GGACGTGCTC 
CAGCCCGTGC 
TCGGGATGGC 
TGTGAGCCAG 
CAGCAGCCGT 
CTTCTGGAAT 



Seq ID NO: 441 Protein sequence 
Protein Accession #: XP 061091.1 



MPNTSGTTRI 
QMHGEKVDLW 
DGSNSVGKGS 
RIKRMVFKGG 
GVTVFAVGVR 
PELVFMERLM 
VDLLFLLDSS 
DVPDLVWSLD 
AGPARHARAR 
RPGCRTQALD 
AFGLDTKPTR 
GRGAEDAAVP 



11 
I 

EIWLLQEPPG 
SLGVLCYEFL 
FERSKHFAIT 
RTETELALKY 
FPRWEELHAL 
GISLIGPCDS 
AGTTLDGFLR 
GIPFRGGPTL 
ELLLLGVGSE 
IiVFMLDTSAS 
AAMLRAISQA 
AQKLRNNGIS 



21 
I 

HRALVAALLP 
VGKPPFEANE 
VCDGLDISPE 
LltHRGLPGGR 
ASEPRGQHVL 
QPCQNGGTCV 
AKVFVKRFVR 
TGSALRQAAE 
AVRAELEEIT 
VGPENFAQMQ 
PYLGGVGSAG 
VLWGVGPVIi 



31 
I 

VSPSPELALA 
VHVSKETIGK 
RVRVGAFQFS 
NASVPQILII 
LAEQVEDATN 
PEGLDGYQCL 
AVLSEDSRAR 
RGFGSATRTG 
GSPKHVMVYS 
SFVRSCALQF 
TALLHIYDKV 
SEGLRRLAGP 



41 
I 

PGYPPVPAAD 
ISAASKMMWC 
STPHLEFPLD 
VTDGKSQGDV 
GLFSTLSSSA 
CPLAFGGEAN 
VGVATYSREL 
QDRPRRWVL 
DPQDLFNQIP 
EVNPDVTQVG 
MTVQRGARPG 
RDSLIHVAAY 



51 
I 

DRPTLPMIGG 
SAAVDIMFLL 
SFSTQQEVKA 
ALPSKQLKER 
ICSSATPAGS 
CALKLSLECR 
LVAVPVGEYQ 
LTESHSEOEV 
ELQGKLCSRQ 
LWYGSQVQT 
VPKAVWLTG 
ADLRYHQDVL 



240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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IEWLCGEAKQ PVNLCKPSPC MNEGSCVLQN GSYRCKCRDG WEGPHCENRE WSSCSVCVSQ 780 
GWILETPLRH MAPVQEGSSR TPPSNYREGL GTEMVPTFWN VCAPGP 



Seq ID NO: 442 DNA sequence 
5 Nucleic Acid Accession #: Eos sequence 
Coding sequence: 1..2424 

1 11 21 31 41 51 

1n I I I I I I 

10 ATGCCCCCTT TCCTGTTGCT GGAGGCCGTC TGTGTTTTCC TGTTTTCCAG AGTGCCCCCA 60 

TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 120 

AGCAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 180 

AGCGTCGGGA AAGGGAGCTT TGAAAGGTCC AAGCACTTTG CCATCACAGT CTGTGACGGT 240 

CTGGACATCA GCCCCGAGAG GGTCAGAGTG GGAGCATTCC AGTTCAGTTC CACTCCTCAT 300 

15 CTGGAATTCC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGG 360 

ATGGTTTTCA AAGGAGGGCG CACGGAGACG GAACTTGCTC TGAAATACCT TCTGCACAGA 420 

GGGTTGCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGG 480 

AAGTCCCAGG GGGATGTGGC ACTGCCATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG 540 

TTTGCTGTGG GGGTCAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC CAGCGAGCCT 600 

20 AGAGGGCAGC ACGTGCTGTT GGCTGAGCAG GTGGAGGATG CCACCAACGG CCTCTTCAGC 660 

ACCCTCAGCA GCTCGGCCAT CTGCTCCAGC GCCACGCCAG ACTGCAGGGT CGAGGCTCAC 720 

CCCTGTGAGC ACAGGACGCT GGAGATGGTC CGGGAGTTCG CTGGCAATGC CCCATGCTGG 760 

AGAGGATCGC GGCGGACCCT TGCGGTGCTG GCTGCACACT GTCCCTTCTA CAGCTGGAAG 840 

AGAGTGTTCC TAACCCACCC TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC 900 

25 TCGCAGCCCT GCCAGAATGG AGGCACATGT GTTCCAGAAG GACTGGACGG CTACCAGTGC 960 

CTCTGCCCGC TGGCCTTTGG AGGGGAGGCT AACTGTGCCC TGAAGCTGAG CCTGGAATGC 1020 

AGGGTCGACC TCCTCTTCCT GCTGGACAGC TCTGCGGGCA CCACTCTGGA CGGCTTCCTG 1080 

CGGGCCAAAG TCTTCGTGAA GCGGTTTGTG CGGGCCGTGC TGAGCGAGGA CTCTCGGGCC 1140 

CGAGTGGGTG TGGCCACATA CAGCAGGGAG CTGCTGGTGG CGGTGCCTGT GGGGGAGTAC 1200 

30 CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCCGTGG TGGCCCCACC 1260 

CTGACGGGCA GTGCCTTGCG GCAGGCGGCA GAGCGTGGCT TCGGGAGCGC CACCAGGACA 1320 

GGCCAGGACC GGCCACGTAG AGTGGTGGTT TTGCTCACTG AGTCACACTC CGAGGATGAG 1380 

GTTGCGGGCC CAGCGCGTCA CGCAAGGGCG CGAGAGCTGC TCCTGCTGGG TGTAGGCAGT 1440 

GAGGCCGTGC GGGCAGAGCT GGAGGAGATC ACAGGCAGCC CAAAGCATGT GATGGTCTAC 1500 

35 TCGGATCCTC AGGATCTGTT CAACCAAATC CCTGAGCTGC AGGGGAAGCT GTGCAGCCGG 1560 

CAGCGGCCAG GGTGCCGGAC ACAAGCCCTG GACCTCGTCT TCATGTTGGA CACCTCTGCC 1620 

TCAGTAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTCCAG 1680 

TTTGAGGTGA ACCCTGACGT GACACAGGTC GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 1740 

ACTGCCTTCG GGCTGGACAC CAAACCCACC CGGGCTGCGA TGCTGCGGGC CATTAGCCAG 1800 

40 GCCCCCTACC TAGGTGGGGT GGGCTCAGCC GGCACCGCCC TGCTGCACAT CTATGACAAA 1860 

GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCCCCA AAGCTGTGGT GGTGCTCACA 1920 

GGCGGGAGAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCTGAGGAA CAATGGCATC 1980 

TCTGTCTTGG TCGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGCGGAG GCTTGCAGGT 2040 

CCCCGGGATT CCCTGATCCA CGTGGCAGCT TACGCCGACC TGCGGTACCA CCAGGACGTG 2100 

45 CTCATTGAGT GGCTGTGTGG AGAAGCCAAG CAGCCAGTCA ACCTCTGCAA ACCCAGCCCG 2160 

TGCATGAATG AGGGCAGCTG CGTCCTGCAG AATGGGAGCT ACCGCTGCAA GTGTCGGGAT 2220 

GGCTGGGAGG GCCCCCACTG CGAGAACCGT GAGTGGAGCT CTTGCTCTGT ATGTGTGAGC 2280 

CAGGGATGGA TTCTTGAGAC GCCCCTGAGG CACATGGCTC CCGTGCAGGA GGGCAGCAGC 2340 

CGTACCCCTC CCAGCAACTA CAGAGAAGGC CTGGGCACTG AAATGGTGCC TACCTTCTGG 2400 

50 AATGTCTGTG CCCCAGGTCC TTAG 

Seq ID NO: 443 Protein sequence 
Protein Accession #: Eos sequence 

55 1 11 21 31 41 51 

I I I I I I 

MPPFLLLEAV CVFLFSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 120 

MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

60 FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TliSSSAlCSS ATPDCRVEAH 240 

PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 

RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRWV LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

65 EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAWVLT GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 720 

CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 780 

70 RTPPSNYREG LGTEMVPTFW NVCAPGP 

Seq ID NO: 444 DNA sequence 
Nucleic Acid Accession fh Eos sequence 
^ Coding sequence: 89.. 2356 

1 11 21 31 41 51 

111)11 

GCCCCCTGGC CCGAGCCGCG CCCGGGTCTG TGAGTAGAGC CGCCCGGGCA CCGAGCGCTG 60 

Q GTCGCCGCTC TCCTTCCGTT ATATCAACAT GCCCCCTTTC CTGTTGCTGG AAGCCGTCTG 120 

80 TGTTTTCCTG TTTTCCAGAG TGCCCCCATC TCTCCCTCTC CAGGAAGTCC ATGTAAGCAA 180 

AGAAACCATC GGGAAGATTT CAGCTGCCAG CAAAATGATG TGGTGCTCGG CTGCAGTGGA 240 

CATCATGTTT CTGTTAGATG GGTCTAACAG CGTCGGGAAA GGGAGCTTTG AAAGGTCCAA 300 

GCACTTTGCC ATCACAGTCT GTGACGGTCT GGACATCAGC CCCGAGAGGG TCAGAGTGGG 360 

AGCATTCCAG TTCAGTTCCA CTCCTCATCT GGAATTCCCC TTGGATTCAT TTTCAACCCA 420 

85 ACAGGAAGTG AAGGCAAGAA TCAAGAGGAT GGTTTTCAAA GGAGGGCGCA CGGAGACGGA 480 

ACTTGCTCTG AAATACCTTC TGCACAGAGG GTTGCCTGGA GGCAGAAATG CTTCTGTGCC 540 

CCAGATCCTC ATCATCGTCA CTGATGGGAA GTCCCAGGGG GATGTGGCAC TGCCATCCAA 600 
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GCAGCTGAAG GAAAGGGGTG TCACTGTGTT TGCTGTGGGG GTCAGGTTTC CCAGGTGGGA 660 

GGAGCTGCAT GCACTGGCCA GCGAGCCTAG AGGGCAGCAC GTGCTGTTGG CTGAGCAGGT 720 

GGAGGATGCC ACCAACGGCC TCTTCAGCAC CCTCAGCAGC TCGGCCATCT GCTCCAGCGC 780 

CACGCCAGAC TGCAGGGTCG AGGCTCACCC CTGTGAGCAC AGGACGCTGG AGATGGTCCG 840 

GGAGTTCGCT GGCAATGCCC CATGCTGGAG AGGATCGCGG CGGACCCTTG CGGTGCTGGC 900 

TGCACACTGT CCCTTCTACA GCTGGAAGAG AGTGTTCCTA ACCCACCCTG CCACCTGCTA 960 

CAGGACCACC TGCCCAGGCC CCTGTGACTC GCAGCCCTGC CAGAATGGAG GCACATGTGT 1020 

TCCAGAAGGA CTGGACGGCT ACCAGTGCCT CTGCCCGCTG GCCTTTGGAG GGGAGGCTAA 1080 

CTGTGCCCTG AAGCTGAGCC TGGAATGCAG GGTCGACCTC CTCTTCCTGC TGGACAGCTC 1140 

TGCGGGCACC ACTCTGGACG GCTTCCTGCG GGCCAAAGTC TTCGTGAAGC GGTTTGTGCG 1200 

GGCCGTGCTG AGCGAGGACT CTCGGGCCCG AGTGGGTGTG GCCACATACA GCAGGGAGCT 1260 

GCTGGTGGCG GTGCCTGTGG GGGAGTACCA GGATGTGCCT GACCTGGTCT GGAGCCTCGA 1320 

TGGCATTCCC TTCCGTGGTG GCCCCACCCT GACGGGCAGT GCCTTGCGGC AGGCGGCAGA 1380 

GCGTGGCTTC GGGAGCGCCA CCAGGACAGG CCAGGACCGG CCACGTAGAG TGGTGGTTTT 1440 

GCTCACTGAG TCACACTCCG AGGATGAGGT TGCGGGCCCA GCGCGTCACG CAAGGGCGCG 1500 

AGAGCTGCTC CTGCTGGGTG TAGGCAGTGA GGCCGTGCGG GCAGAGCTGG AGGAGATCAC 1560 

AGGCAGCCCA AAGCATGTGA TGGTCTACTC GGATCCTCAG GATCTGTTCA ACCAAATCCC 1620 

TGAGCTGCAG GGGAAGCTGT GCAGCCGGCA GCGGCCAGGG TGCCGGACAC AAGCCCTGGA 1680 

CCTCGTCTTC ATGTTGGACA CCTCTGCCTC AGTAGGGCCC GAGAATTTTG CTCAGATGCA 1740 

GAGCTTTGTG AGAAGCTGTG CCCTCCAGTT TGAGGTGAAC CCTGACGTGA CACAGGTCGG 1800 

CCTGGTGGTG TATGGCAGCC AGGTGCAGAC TGCCTTCGGG CTGGACACCA AACCCACCCG 1860 

GGCTGCGATG CTGCGGGCCA TTAGCCAGGC CCCCTACCTA GGTGGGGTGG GCTCAGCCGG 1920 

CACCGCCCTG CTGCACATCT ATGACAAAGT GATGACCGTC CAGAGGGGTG CCCGGCCTGG 1980 

TGTCCCCAAA GCTGTGGTGG TGCTCACAGG CGGGAGAGGC GCAGAGGATG CAGCCGTTCC 2040 

TGCCCAGAAG CTGAGGAACA ATGGCATCTC TGTCTTGGTC GTGGGCGTGG GGCCTGTCCT 2100 

AAGTGAGGGT CTGCGGAGGC TTGCAGGTCC CCGGGATTCC CTGATCCACG TGGCAGCTTA 2160 

CGCCGACCTG CGGTACCACC AGGACGTGCT CATTGAGTGG CTGTGTGGAG AAGCCAAGCA 2220 

GCCAGTCAAC CTCTGCAAAC CCAGCCCGTG CATGAATGAG GGCAGCTGCG TCCTGCAGAA 2280 

TGGGAGCTAC CGCTGCAAGT GTCGGGATGG CTGGGAGGGC CCCCACTGCG AGAACCGATT 2340 

CTTGAGACGC CCCTGAGGCA CATGGCTCCC GTGCAGGAGG GCAGCAGCCG TACCCCTCCC 2400 

AGCAACTACA GAGAAGGCCT GGGCACTGAA ATGGTGCCTA CCTTCTGGAA TGTCTGTGCC 2460 

CCAGGTCCTT AGAATGTCTG CTTCCCGCCG TGGCCAGGAC CACTATTCTC ACTGAGGGAG 2520 

GAGGATGTCC CAACTGCAGC CATGCTGCTT AGAGACAAGA AAGCAGCTGA TGTCACCCAC 2580 

AAACGATGTT GTTGAAAAGT TTTGATGTGT AAGTAAATAC CCACTTTCTG TACCTGCTGT 2640 

GCCTTGTTGA GGCTATGTCA TCTGCCACCT TTCCCTTGAG GATAAACAAG GGGTCCTGAA 2700 

GACTTAAATT TAGCGGCCTG ACGTTCCTTT GCACACAATC AATGCTCGCC AGAATGTTGT 2760 
TGACACAGTA ATGCCCAGCA GAGGCCTTTA CTAGAGCATC CTTTGGACGG 

Seq ID NO: 445 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

1 I I I I I 

MPPPLLLEAV CVFLFSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 120 

MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 

PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 

RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRWV LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAVW1/T GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 720 
CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR FLRRP 

Seq ID NO: 446 DNA sequence 

Nucleic Acid Accession #: NM_03 1942.1 

Coding sequence: 145.. 1260 

1 11 21 31 41 51 

I I I I I I 

CCCGAGCCCC GCCCCTCCGG GCCCGGGTCG GCGCGCCCAG CCTGCCAGCC GCGCTGCTGC 60 

TGCTCCTCCT GCTGTGGGAC CGCTGACCGC GCGGCTGCTC CGCTCTCCCC GCTCCAAGCG 120 

CCGATCTGGG CACCCGCCAC CAGCATGGAC GCTCGCCGCG TGCCGCAGAA AGATCTCAGA 180 

GTAAAGAAGA ACTTAAAGAA ATTCAGATAT GTGAAGTTGA TTTCCATGGA AACCTCGTCA 240 

TCCTCTGATG ACAGTTGTGA CAGCTTTGCT TCTGATAATT TTGCAAACAC GAGGCTGCAG 300 

TCAGTTCGGG AAGGCTGTAG GACCCGCAGC CAGTGCAGGC ACTCTGGACC TCTCAGGGTG 360 

GCGATGAAGT TTCCAGCGCG GAGTACCAGG GGAGCAACCA ACAAAAAAGC AGAGTCCCGC 420 

CAGCCCTCAG AGAATTCTGT GACTGATTCC AACTCCGATT CAGAAGATGA AAGTGGAATG 480 

AATTTTTTGG AGAAAAGGGC TTTAAATATA AAGCAAAACA AAGCAATGCT TGCAAAACTC 540 

ATGTCTGAAT TAGAAAGCTT CCCTGGCTCG TTCCGTGGAA GACATCCCCT CCCAGGCTCC 600 

GACTCACAAT CAAGGAGACC GCGAAGGCGT ACATTCCCGG GTGTTGCTTC CAGGAGAAAC 660 

CCTGAACGGA GAGCTCGTCC TCTTACCAGG TCAAGGTCCC GGATCCTCGG GTCCCTTGAC 720 

GCTCTACCCA TGGAGGAGGA GGAGGAAGAG GATAAGTACA TGTTGGTGAG AAAGAGGAAG 780 

ACCGTGGATG GCTACATGAA TGAAGATGAC CTGCCCAGAA GCCGTCGCTC CAGATCATCC 840 

GTGACCCTTC CGCATATAAT TCGCCCAGTG GAAGAAATTA CAGAGGAGGA GTTGGAGAAC 900 

GTCTGCAGCA ATTCTCGAGA GAAGATATAT AACCGTTCAC TGGGCTCTAC TTGTCATCAA 960 

TGCCGTCAGA AGACTATTGA TACCAAAACA AACTGCAGAA ACCCAGACTG CTGGGGCGTT 1020 

CGAGGCCAGT TCTGTGGCCC CTGCCTTCGA AACCGTTATG GTGAAGAGGT CAGGGATGCT 1080 

CTGCTGGATC CGAACTGGCA TTGCCCGCCT TGTCGAGGAA TCTGCAACTG CAGTTTCTGC 1140 

CGGCAGCGAG ATGGACGGTG TGCGACTGGG GTCCTTGTGT ATTTAGCCAA ATATCATGGC 1200 

TTTGGGAATG TGCATGCCTA CTTGAAAAGC CTGAAACAGG AATTTGAAAT GCAAGCATAA 1260 

TATCTGGAAA ATTTGCTGCC TGCCTTCTAC TTCTCAAATC TTTCTTGTAA AAGTTTCCAA 1320 

TTTTTTCACT GAAACCTGAG TTAAAAATCT TGATGATCAG CCTGTTTCAT AAGAAACTCC 1380 

AATCAAGTTA ATCTTAGCAG ACATGTGTTT CTG GAG CATC ACAGAAGGTA TATTGCTAGT 1440 
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TACACTTTGC CCTCCTGCAG TTTCTTCTCT GCTCCCAACC CCCATCTCAT AGCATCCCCC 1500 

TCTATTTCCA ATGCTCCTCT CCAACCGCTT AGTTTCTGAA TTTCTTTTAA ATTACAGTTT 1560 

TATGAAAGCA TATTTTATTT ACTTGGTGTT GAAATAGCCC TCATAAAACC TAAGCACTTG 1620 

GAAACACAAT AATAGTATTA ACTAACTAGA TCTATTGAAT TTCAGAGAAG AGCCTTCTAA 1680 

CTTGTTTACA CAAAAACGAG TATGATTTAG CACTCATACT AGTTGAAATT TTTAATAGAA 1740 

TCAAGGCACA AAAGTCTTAA AACCATGTGG AAAAATTAGG TAATTATTGC AGATTGATGT 1800 

CTCTCAATCC CATGTATTGC GCTTATGTTA CAAGTTGTTG TCACAGTTGA GACTTAATTT 1860 

CTCCTAATTT CTTCTGCCCG AAGGGTAAGT GGTGCGTCCA GCTTACACGA TCATAATTCA 1920 

AAGGTTGGTG GGCAATGTAA TACTTAATTA AAATAATGAT GGAAGAGCTA TCTGGAGATT 1980 

ATGAGTAAGC TGATTTGAAT TTTCAGTATA AAACTTTAGT ATAATTGTAG TTTGCAAAGT 2040 

TTATTTCAGT TCACATGTAA GGTATTGCAA ATAAATTCTT GGACAATTTT GTATGGAAAC 2100 

TTGATATTAA AAACTAGTCT GTGGTTCTTT GCAGTTTCTT GTAAATTTAT AAACCAGGCA 2160 

CAAGGTTCAA GTTTAGATTT TAAGCACTTT TATAACAATG ATAAGTGCCT TTTTGGAGAT 2220 

GTAACTTTTA GCAGTTTGTT AACCTGACAT CTCTGCCAGT CTAGTTTCTG GGCAGGTTTC 2280 

CTGTGTCAGT ATTCCCCCTC CTCTTTGCAT TAATCAAGGT ATTTGGTAGA GGTGGAATCT 2340 

AAGTGTTTGT ATGTCCAATT TACTTGCATA TGTAAACCAT TGCTGTGCCA TTCAATGTTT 2400 

GATGCATAAT TGGACCTTGA ATCGATAAGT GTAAATACAG CTTTTGATCT GTAATGCTTT 2460 
TATACAAAAG TTTATTTTAA TAATAAAATG TTTGTTCTAA AAAAAAAAAA 

Seq ID NO: 447 Protein sequence 
Protein Accession #: NP_114148.1 

1 11 21 31 41 51 

I I I I I I 

MDARRVPQKD LRVKKNLKKF RYVKL.ISMET SSSSDDSCDS FASDNFANTR LQSVREGCRT 60 

RSQCRHSGPL RVAMKFPARS TRGATNKKAE SRQPSENSVT DSNSDSEDES GMNFLEKRAL 120 

NIKQNKAMIiA KLMSELESFP GSFRGRHPLP GSDSQSRRPR RRTFPGVASR RNPERRARPL 180 

TRSRSRILGS LDALPMEEEE EEDKYMLVRK. RKTVDGYMNE DDLPRSRRSR SSVTLPHIIR 240 

PVEEITEEEL ENVCSNSREK IYNRSLGSTC HQCRQKTIDT KTNCRNPDCW GVRGQFCGPC 300 

LRNRYGEEVR DALLDPNWHC PPCRGICNCS FCRQRDGRCA TGVLVYLAKY HGFGNVHAYL 360 
KSLKQEFEMQ A 

Seq ID NO: 446 DNA sequence 

Nucleic Acid Accession &: NM_019894 

Coding sequence: 1..1314 

1 11 21 31 41 51 

I I I I I I 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TCGATGTCAA ACCCCTGCGC 60 

AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 120 

CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

GGGAACTGGT TCTCTGCCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT ACGACAAACA GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GCCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 

Seq ID NO: 449 Protein sequence 
Protein Accession #: NP_063947.1 

1 11 21 31 41 51 

I I 1 I I I 

MLQDPDSDQP LNSLDVKPLR KPRIPMETFR KVGIPIIIAL bSLASIIIW VLIKVILDKY 60 

YFLCGQPLHF IPRKQLCDGE LDCPLGEDEE HCVKSFPEGP AVAVRLSKDR STLQVLDSAT 120 

GNWFSACFDN FTEALAETAC RQMGYSSKPT FRAVEIGPDQ DLDWEITEN SQELRMRNSS 180 

GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 240 

LTAAHCFRKH TDVFNWKVRA GSDKLGSFPS LAVAKIIIIE FNPMYPKDND IAL.MKLQFPL 300 

TFSGTVRPIC LPFFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV IDSTRCNADD 360 

AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSWGYGC GGPSTPGVYT 420 



Seq ID NO: 450 DNA sequence 

Nucleic Acid Accession #: XM_051860.2 

Coding sequence: 52.. 3042 

1 11 21 31 41 51 

I I I I I I 

GCTCACCCAG GAAAAATATG CAATCGTCCC ATTGATATAC AGGCCACTAC AATGGATGGA 60 
GTTAACCTCA GCACCGAGGT TGTCTACAAA AAAGGCCAGG ATTATAGGTT TGCTTGCTAC 120 
GACCGGGGCA GAGCCTGCCG GAGCTACCGT GTACGGTTCC TCTGTGGGAA GCCTGTGAGG 180 
CCCAAACTCA CAGTCACCAT TGACACCAAT GTGAACAGCA CCATTCTGAA CTTGGAGGAT 240 
AATGTACAGT CATGGAAACC TGGAGATACC CTGGTCATTG CCAGTACTGA TTACTCCATG 300 
TACCAGGCAG AAGAGTTCCA GGTGCTTCCC TGCAGATCCT GCGCCCCCAA CCAGGTCAAA 360 
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GTGGCAGGGA AACCAATGTA CCTGCACATC GGGGAGGAGA TAGACGGCGT GGACATGCGG 420 

GCGGAGGTTG GGCTTCTGAG CCGGAACATC ATAGTGATGG GGGAGATGGA GGACAAATGC 480 

TACCCCTACA GAAACCACAT CTGCAATTTC TTTGACTTCG ATACCTTTGG GGGCCACATC 540 

AAGTTTGCTC TGGGATTTAA GGCAGCACAC TTGGA6GGCA CGGAGCTGAA GCATATGGGA 600 

CAGCAGCTGG TGGGTCAGTA CCCGATTCAC TTCCACCTGG CCGGTGATGT AGACGAAAGG 660 

GGAGGTTATG ACCCACCCAC ATACATCAGG GACCTCTCCA TCCATCATAC ATTCTCTCGC 720 

TGCGTCACAG TCCATGGCTC CAATGGCTTG TTGATCAAGG ACGTTGTGGG CTATAACTCT 780 

TTGGGCCACT GCTTCTTCAC GGAAGATGGG CCGGAGGAAC GCAACACTTT TGACCACTGT 840 

CTTGGCCTCC TTGTCAAGTC TGGAACCCTC CTCCCCTCGG ACCGTGACAG CAAGATGTGC 900 

AAGATGATCA CAGGAGACTC CTACCCAGGG TACATCCCCA AGCCCAGGCA AGACTGCAAT 960 

GCTGTGTCCA CCTTCTGGAT GGCCAATCCC AACAACAACC TCATCAACTG TGCCGCTGCA 1020 

GGATCTGAGG AAACTGGATT TTGGTTTATT TTTCACCACG TACCAACGGG CCCCTCCGTG 1080 

GGAATGTACT CCCCAGGTTA TTCAGAGCAC ATTCCACTGG GAAAATTCTA TAACAACCGA 1140 

GCACATTCCA ACTACCGGGC TGGCATGATC ATAGACAACG GAGTCAAAAC CACCGAGGCC 1200 

TCTGCCAAGG ACAAGCGGCC GTTCCTCTCA ATCATCTCTG CCAGATACAG CCCTCACCAG 1260 

GACGCCGACC CGCTGAAGCC CCGGGAGCCG GCCATCATCA GACACTTCAT TGCCTACAAG 1320 

AACCAGGACC ACGGGGCCTG GCTGCGCGGC GGGGATGTGT GGCTGGACAG CTGCCGGTTT 1380 

GCTGACAATG GCATTGGCCT GACCCTGGCC AGTGGTGGAA CCTTCCCGTA TGACGACGGC 1440 

TCCAAGCAAG AGATAAAGAA CAGCTTGTTT GTTGGCGAGA GTGGCAACGT GGGGACGGAA 1500 

ATGATGGACA ATAGGATCTG GGGCCCTGGC GGCTTGGACC ATAGCGGAAG GACCCTCCCT 1560 

ATAGGCCAGA ATTTTCCAAT TAGAGGAATT CAGTTATATG ATGGCCCCAT CAACATCCAA 1620 

AACTGCACTT TCCGAAAGTT TGTGGCCCTG GAGGGCCGGC ACACCAGCGC CCTGGCCTTC 1680 

CGCCTGAATA ATGCCTGGCA GAGCTGCCCC CATAACAACG TGACCGGCAT TGCCTTTGAG 1740 

GACGTTCCGA TTACTTCCAG AGTGTTCTTC GGAGAGCCTG GGCCCTGGTT CAACCAGCTG 1800 

GACATGGATG GGGATAAGAC ATCTGTGTTC CATGACGTCG ACGGCTCCGT GTCCGAGTAC 1860 

CCTGGCTCCT ACCTCACGAA GAATGACAAC TGGCTGGTCC GGCACCCAGA CTGCATCAAT 1920 

GTTCCCGACT GGAGAGGGGC CATTTGCAGT GGGTGCTATG CACAGATGTA CATTCAAGCC 1980 

TACAAGACCA GTAACCTGCG AATGAAGATC ATCAAGAATG ACTTCCCCAG CCACCCTOTT 2040 

TACCTGGAGG GGGCGCTCAC CAGGAGCACC CATTACCAGC AATACCAACC GGTTGTCACC 2100 

CTGCAGAAGG GCTACACCAT CCACTGGGAC CAGACGGCCC CCGCCGAACT CGCCATCTGG 2160 

CTCATCAACT TCAACAAGGG CGACTGGATC CGAGTGGGGC TCTGCTACCC GCGAGGCACC 2220 

ACATTCTCCA TCCTCTCGGA TGTTCACAAT CGCCTGCTGA AGCAAACGTC CAAGACGGGC 2280 

GTCTTCGTGA GGACCTTGCA GATGGACAAA GTGGAGCAGA GCTACCCTGG CAGGAGCCAC 2340 

TACTACTGGG ACGAGGACTC AGGGCTGTTG TTCCTGAAGC TGAAAGCTCA GAACGAGAGA 2400 

GAGAAGTTTG CTTTCTGCTC CATGAAAGGC TGTGAGAGGA TAAAGATTAA AGCTCTGATT 2460 

CCAAAGAACG CAGGCGTCAG TGACTGCACA GCCACAGCTT ACCCCAAGTT CACCGAGAGG 2520 

GCTGTCGTAG ACGTGCCGAT GCCCAAGAAG CTCTTTGGTT CTCAGCTGAA AACAAAGGAC 2580 

CATTTCTTGG AGGTGAAGAT GGAGAGTTCC AAGCAGCACT TCTTCCACCT CTGGAACGAC 2640 

TTCGCTTACA TTGAAGTGGA TGGGAAGAAG TACCCCAGTT CGGAGGATGG CATCCAGGTG 2700 

GTGGTGATTG ACGGGAACCA AGGGCGCGTG GTGAGCCACA CGAGCTTCAG GAACTCCATT 2760 

CTGCAAGGCA TACCATGGCA GCTTTTCAAC TATGTGGCGA CCATCCCTGA CAATTCCATA 2820 

GTGCTTATGG CATCAAAGGG AAGATACGTC TCCAGAGGCC CATGGACCAG AGTGCTGGAA 2880 

AAGCTTGGGG CAGACAGGGG TCTCAAGTTG AAAGAGCAAA TGGCATTCGT TGGCTTCAAA 2940 

GGCAGCTTCC GGCCCATCTG GGTGACACTG GACACTGAGG ATCACAAAGC CAAAATCTTC 3000 

CAAGTTGTGC CCATCCCTGT GGTGAAGAAG AAGAAGTTGT GAGGACAGCT GCCGCCCGGT 3060 

GCCACCTCGT GGTAGACTAT GACGGTGACT CTTGGCAGCA GACCAGTGGG GGATGGCTGG 3120 

GTCCCCCAGC CCCTGCCAGC AGCTGCCTGG GAAGGCCGTG TTTCAGCCCT GATGGGCCAA 3180 

GGGAAGGCTA TCAGAGACCC TGGTGCTGCC ACCTGCCCCT ACTCAAGTGT CTACCTGGAG 3240 

CCCCTGGGGC GGTGCTGGCC AATGCTGGAA ACATTCACTT TCCTGCAGCC TCTTGGGTGC 3300 

TTCTCTCCTA TCTGTGCCTC TTCAGTGGGG GTTTGGGGAC CATATCAGGA GACCTGGGTT 3360 

GTGCTGACAG CAAAGATCCA CTTTGGCAGG AGCCCTGACC CAGCTAGGAG GTAGTCTGGA 3420 

GGGCTGGTCA TTCACAGATC CCCATGGTCT TCAGCAGACA AGTGAGGGTG GTAAATGTAG 3480 

GAGAAAGAGC CTTGGCCTTA AGGAAATCTT TACTCCTGTA AGCAAGAGCC AACCTCACAG 3540 

GATTAGGAGC TGGGGTAGAA CTGGCTATCC TTGGGGAAGA GGCAAGCCCT GCCTCTGGCC 3600 

GTGTCCACCT TTCAGGAGAC TTTGAGTGGC AGGTTTGGAC TTGGACTAGA TGACTCTCAA 3660 

AGGCCCTTTT AGTTCTGAGA TTCCAGAAAT CTGCTGCATT TCACATGGTA CCTGGAACCC 3720 

AACAGTTCAT GGATATCCAC TGATATCCAT GATGCTGGGT GCCCCAGCGC ACACGGGATG 3780 

GAGAGGTGAG AACTAATGCC TAGCTTGAGG GGTCTGCAGT CCAGTAGGGC AGGCAGTCAG 3840 

GTCCATGTGC ACTGCAATGC CAGGTGGAGA AATCACAGAG AGGTAAAATG GAGGCCAGTG 3900 

CCATTTCAGA GGGGAGGCTC AGGAAGGCTT CTTGCTTACA GGAATGAAGG CTGGGGGCAT 3960 

TTTGCTGGGG GGAGATGAGG CAGCCTCTGG AATGGCTCAG GGATTCAGCC CTCCCTGCCG 4020 

CTGCCTGCTG AAGCTGGTGA CTACGGGGTC GCCCTTTGCT CACGTCTCTC TGGCCCACTC 4080 

ATGATGGAGA AGTGTGGTCA GAGGGGAGCA ATGGGCTTTG CTGCTTATGA GCACAGAGGA 4140 

ATTCAGTCCC CAGGCAGCCC TGCCTCTGAC TCCAAGAGGG TGAAGTCCAC AGAAGTGAGC 4200 

TCCTGCCTTA GGGCCTCATT TGCTCTTCAT CCAGGGAACT GAGCACAGGG GGCCTCCAGG 4260 

AGACCCTAGA TGTGCTCGTA CTCCCTCGGC CTGGGATTTC AGAGCTGGAA ATATAGAAAA 4320 

TATCTAGCCC AAAGCCTTCA TTTTAACAGA TGGGGAAAGT GAGCCCCCAA GATGGGAAAG 4380 

AACCACACAG CTAAGGGAGG GCCTGGGGAG CCCCACCCTA GCCCTTGCTG CCACACCACA 4440 

TTGCCTCAAC AACCGGCCCC AGAGTGCCCA GGCACTCCTG AGGTAGCTTC TGGAAATGGG 4500 

GACAAGTCCC CTCGAAGGAA AGGAAATGAC TAGAGTAGAA TGACAGCTAG CAGATCTCTT 4560 

CCCTCCTGCT CCCAGCGCAC ACAAACCCGC CCTCCCCTTG GTGTTGGCGG TCCCTGTGGC 4620 

CTTCACTTTG TTCACTACCT GTCAGCCCAG CCTGGGTGCA CAGTAGCTGC AACTCCCCAT 4680 

TGGTGCTACC TGGCTCTCCT GTCTCTGCAG CTCTACAGGT GAGGCCCAGC AGAGGGAGTA 4740 

GGGCTCGCCA TGTTTCTGGT GAGCCAATTT GGCTGATCTT GGGTGTCTGA ACAGCTATTG 4800 

GGTCCACCCC AGTCCCTTTC AGCTGCTGCT TAATGCCCTG CTCTCTCCCT GGCCCACCTT 4860 

ATAGAGAGCC CAAAGAGCTC CTGTAAGAGG GAGAACTCTA TCTGTGGTTT ATAATCTTGC 4920 

ACGAGGCACC AGAGTCTCCC TGGGTCTTGT GATGAACTAC ATTTATCCCC TTTCCTGCCC 4980 

CAACCACAAA CTCTTTCCTT CAAAGAGGGC CTGCCTGGCT CCCTCCACCC AACTGCACCC 5040 

ATGAGACTCG GTCCAAGAGT CCATTCCCCA GGTGGGAGCC AACTGTCAGG GAGGTCTTTC 5100 

CCACCAAACA TCTTTCAGCT GCTGGGAGGT GACCATAGGG CTCTGCTTTT AAAGATATGG 5160 

CTGCTTCAAA GGCCAGAGTC ACAGGAAGGA CTTCTTCCAG GGAGATTAGT GGTGATGGAG 5220 

AGGAGAGTTA AAATGACCTC ATGTCCTTCT TGTCCACGGT TTTGTTGAGT TTTCACTCTT 5280 

CTAATGCAAG GGTCTCACAC TGTGAACCAC TTAGGATGTG ATCACTTTCA GGTGGCCAGG 5340 

AATGTTGAAT GTCTTTGGCT CAGTTCATTT AAAAAAGATA TCTATTTGAA AGTTCTCAGA 5400 

GTTGTACATA TGTTTCACAG TACAGGATCT GTACATAAAA GTTTCTTTCC TAAACCATTC 5460 

ACCAAGAGCC AATATCTAGG CATTTTCTTG GTAGCACAAA TTTTCTTATT GCTTAGAAAA 5520 

TTGTCCTCCT TGTTATTTCT GTTTGTAAGA CTTAAGTGAG TTAGGTCTTT AAGGAAAGCA 5580 
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ACGCTCCTCT GAAATGCTTG TCTTTTTTCT GTTGCCGAAA TAGCTGGTCC TTTTTCGGGA 5640 
GTTAGATGTA TAGAGTGTTT GTATGTAAAC ATTTCTTGTA GGCATCACCA TGAACAAAGA 5700 
TATATTTTCT ATTTATTTAT TATATGTGCA CTTCAAGAAG TCACTGTCAG AGAAATAAAG 5760 
AATTGTCTTA AATGTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 



Seq ID NO: 451 Protein sequence 
Protein Accession #: XP_051860.2 

1 11 21 31 41 51 

I I I I I I 

MDGVNLSTEV VYKKGQDYRF ACYDRGRACR SYRVRPLCGK PVRPKLTVTI DTNVNSTILN 60 

LEDNVQSWKP GDTLVIASTD YSMYQAEEFQ VLPCRSCAPN QVKVAGKPMY LHIGEEIDGV 120 

DMRAEVGLLS RNIIVMGEMB DKCYPYRNHI CNFFDFDTFG GHIKFALGFK AAHLEGTELK 180 

HMGQQLVGQY PIHFHLAGDV DERGGYDPPT YIRDLSIKHT FSRCVTVHGS NGLLIKDWG 240 

YNSLGHCFFT EDGPEERNTF DHCLGLLVKS GTLLPSDRDS KMCKMITGDS YPGYIPKPRQ 300 

DCNAVSTFWM ANPNNNLINC AAAGSEETGF WFIFHHVPTG PSVGMYSPGY SEHIPLGKFY 360 

NNRAHSNYRA GMI IDNGVKT TEASAKDKRP FLSIISARYS PHQDADPLKP REPAIIRHFI 420 

AYKNQDHGAW LRGGDVWLDS CRFADNGIGL TLASGGTFPY DDGSKQBIKN SLFVGESGNV 480 

GTEMMDNRIW GPGGLDHSGR TLPIGQNFPI RGIQLYDGPI NIQNCTFRKF VALEGRHTSA 540 

LAFRLNNAWQ SCPHNNVTGI AFEDVPITSR VFFGEPGPWF NQLDMDGDKT SVFHDVDGSV 600 

SEYPGSYLTK NDNWLVRHPD CINVPDWRGA ICSGCYAQMY IQAYKTSNLR MKIIKNDFPS 660 

HPLYLEGALT RSTHYQQYQP WTLQKGYTI HWDQTAPAEL AIWLINFNKG DWIRVGLCYP 720 

RGTTFSILSD VHNRLLKQTS KTGVFVRTLQ MDKVEQSYPG RSHYYWDEDS GLLFLKLKAQ 780 

NEREKFAFCS MKGCERIKIK ALIPKNAGVS DCTATAYPKF TERAWDVPM PKKLFGSQLK 840 

TKDHFLEVKM ESSKQHFFHL WNDFAYIEVD GKKYPSSEDG IQWVIDGNQ GRWSHTSFR 900 

NSILQGIPWQ LFNYVATIPD NSIVLMASKG RYVSRGPWTR VLEKLGADRG LKLKEQMAFV 960 

GFKGSFRPIW VTLDTEDHKA KIFQWPIPV VKKKKL 

Seq ID NO: 452 DNA sequence 

Nucleic Acid Accession tf: Eos sequence % 
Coding sequence: 261.. 2861 

1 11 21 31 41 51 

I I I I I 1 

GAGCTAGCGC TCAAGCAGAG CCCAGCGCGG TGCTATCGGA CAGAGCCTGG CGAGCGCAAG 60 

CGGCGCGGGG AGCCAGCGGG GCTGAGCGCG GCCAGGGTCT GAACCCAGAT TTCCCAGACT 120 

AGCTACCACT CCGCTTGCCC ACGCCCCGGG AGCTCGCGGC GCCTGGCGGT CAGCGACCAG 180 

ACGTCCGGGG CCGCTGCGCT CCTGGCCCGC GAGGCGTGAC ACTGTCTCGG CTACAGACCC 240 

AGAGGGAGCA CACTGCCAGG ATGGGAGCTG CTGGGAGGCA GGACTTCCTC TTCAAGGCCA 300 

TGCTGACCAT CAGCTGGCTC ACTCTGACCT GCTTCCCTGG GGCCACATCC ACAGTGGCTG 360 

CTGGGTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GACCAAGACC 420 

ACCATGTGCA TATCGGCCAG GGCAAGACAC TGCTGCTCAC CTCTTCTGCC ACGGTCTATT 480 

CCATCCACAT CTCAGAGGGA GGCAAGCTGG TCATTAAAGA CCACGACGAG CCGATTGTTT 540 

TGCGAACCCG GCACATCCTG ATTGACAACG GAGGAGAGCT GCATGCTGGG AGTGCCCTCT 600 

GCCCTTTCCA GGGCAATTTC ACCATCATTT TGTATGGAAG GGCTGATGAA GGTATTCAGC 660 

CGGATCCTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGGCGCT CTTGAGTTGC 720 

ATGGACAGAA AAAGCTCTCC TGGACATTTC TGAACAAGAC CCTTCACCCA GGTGGCATGG 780 

CAGAAGGAGG CTATTTTTTT GAAAGGAGCT GGGGCCACCG TGGAGTTATT GTTCATGTCA 840 

TCGACCCCAA ATCAGGCACA GTCATCCATT CTGACCGGTT TGACACCTAT AGATCCAAGA 900 

AAGAGAGTGA ACGTCTGGTC CAGTATTTGA ACGCGGTGCC CGATGGCAGG ATCCTTTCTG 960 

TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT GGCCAGGAAG GCGATGACCA 1020 

AATTGGGAAG CAAACACTTC CTGCACCTTG GATTTAGACA CCCTTGGAGT TTTCTAACTG 1080 

TGAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 1140 

CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA TGGCGAATAT TTCAATGTTT 1200 

CTTTGTCCAG TGAGTGGGTT CAAGACGTGG AGTGGACGGA GTGGTTCGAT CATGATAAAG 1260 

TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACCTCTG GAAAGCTCAC CCAGGAAAAA 1320 

TATGCAATCG TCCCATTGAT ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 

AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACCGG GGCAGAGCCT 1440 

GCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA 1500 

CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GGATAATGTA CAGTCATGGA 1560 

AACCTGGAGA TACCCTGGTC ATTGCCAGTA CTGATTACTC CATGTACCAG GCAGAAGAGT 1620 

TCCAGGTGCT TCCCTGCAGA TCCTGCGCCC CCAACCAGGT CAAAGTGGCA GGGAAACCAA 16 B0 

TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGGACAT GCGGGCGGAG GTTGGGCTTC 1740 

TGAGCCGGAA CATCATAGTG ATGGGGGAGA TGGAGGACAA ATGCTACCCC TACAGAAACC 1800 

ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1860 

TTAAGGCAGC ACACTTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 1920 

AGTACCCGAT TCACTTCCAC CTGGCCGGTG ATGTAGACGA AAGGGGAGGT TATGACCCAC 1980 

CCACATACAT CAGGGACCTC TCCATCCATC ATACATTCTC TCGCTGCGTC ACAGTCCATG 2040 

GCTCCAATGG CTTGTTGATC AAGGACGTTG TGGGCTATAA CTCTTTGGGC CACTGCTTCT 2100 

TCACGGAAGA TGGGCCGGAG GAACGCAACA CTTTTGACCA CTGTCTTGGC CTCCTTGTCA 2160 

AGTCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 2220 

ACTCCTACCC AGGGTACATC CCCAAGCCCA GG CAAGACTG CAATGCTGTG TCCACCTTCT 2280 

GGATGGCCAA TCCCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATCT GAGGAAACTG 2340 

GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCCCCTC CGTGGGAATG TACTCCCCAG 2400 

GTTATTCAGA GCACATTCCA CTGGGAAAAT TCTATAACAA CCGAGCACAT TCCAACTACC 2460 

GGGCTGGCAT GATCATAGAC AACGGAGTCA AAACCACCGA GGCCTCTGCC AAGGACAAGC 2520 

GGCCGTTCCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA CCAGGACGCC GACCCGCTGA 2580 

AGCCCCGGGA GCCGGCCATC ATCAGACACT TCATTGCCTA CAAGAACCAG GACCACGGGG 2640 

CCTGGCTGCG CGGCGGGGAT GTGTGGCTGG ACAGCTGCCA TTTCAGAGGG GAGGCTCAGG 2700 

AAGGCTTCTT GCTTACAGGA ATGAAGGCTG GGGGCATTTT GCTGGGGGGA GATGAGGCAG 2760 

CCTCTGGAAT GGCTCAGGGA TTCAGCCCTC CCTGCCGCTG CCTGCTGAAG CTGGTGACTA 2820 

CGGGGTCGCC CTTTGCTCAC GTCTCTCTGG CCCACTCATG ATGGAGAAGT GTGGTCAGAG 2880 

GGGAGCAATG GGCTTTGCTG CTTATGAGCA CAGAGGAATT CAGTCCCCAG GCAGCCCTGC 2940 

CTCTGACTCC AAGAGGGTGA AGTCCACAGA AGTGAGCTCC TGCCTTAGGG CCTCATTTGC 3000 

TCTTCATCCA GGGAACTGAG CACAGGGGGC CTCCAGGAGA CCCTAGATGT GCTCGTACTC 3060 

CCTCGGCCTG GGATTTCAGA GCTGGAAATA TAGAAAATAT CTAGCCCAAA GCCTTCATTT 3120 
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TAACAGATGG GGAAAGTGAG CCCCCAAGAT GGGAAAGAAC 
TGGGGAGCCC CACCCTAGCC CTTGCTGCCA CACCACATTG 
GTGCCCAGGC ACTCCTGAGG TAGCTTCTGG AAATGGGGAC 
AAATGACTAG AGTAGAATGA CAGCTAGCAG ATCTCTTCCC 
AACCCGCCCT CCCCTTGGTG TTGGCGGTCC CTGTGGCCTT 
AGCCCAGCCT GGGTGCACAG TAGCTGCAAC TCCCCATTGG 
TCTGCAGCTC TACAGGTGAG GCCCAGCAGA GGGAGTAGGG 
CCAATTTGGC TGATCTTGGG TGTCTGAACA GCTATTGGGT 
TGCTGCTTAA TGCCCTGCTC TCTCCCTGGC CCACCTTATA 
TAAGAGGGAG AACTCTATCT GTGGTTTATA ATCTTGCACG 
GTCTTGTGAT GAACTACATT TATCCCCTTT CCTGCCCCAA 
AGAGGGCCTG CCTGGCTCCC TCCACCCAAC TGCACCCATG 
TTCCCCAGGT GGGAGCCAAC TGTCAGGGAG GTCTTTCCCA 
GGGAGGTGAC CATAGGGCTC TGCTTTTAAA GATATGGCTG 
GGAAGGACTT CTTCCAGGGA GATTAGTGGT GATGGAGAGG 
TCCTTCTTGT CCACGGTTTT GTTGAGTTTT CACTCTTCTA 
GAACCACTTA GGATGTGATC ACTTTCAGGT GGCCAGGAAT 
TTCATTTAAA AAAGATATCT ATTTGAAAGT TCTCAGAGTT 
AGGATCTGTA CATAAAAGTT TCTTTCCTAA ACCATTCACC 
TTTCTTGGTA GCACAAATTT TCTTATTGCT TAGAAAATTG 
TGTAAGACTT AAGTGAGTTA GGTCTTTAAG GAAAGCAACG 
TTTTTCTGTT GCCGAAATAG CTGGTCCTTT TTCGGGAGTT 
TGTAAACATT TCTTGTAGGC ATCACCATGA ACAAAGATAT 
ATGTGCACTT CAAGAAGTCA CTGTCAGAGA AATAAAGAAT 
GAGATGTCCT TTGCATTGCT TGGAAGGGGT GTACCTAGAG 
TTGGAAAAAT TTTGCTGTTA TTATAGTAAA CATACAAAGG 
AAAAAAAAAA AAAAAAAAAA AA 

Seq ID NO: 453 Protein sequence 
protein Accession #: Eos sequence 



1 
I 

MGAAGRQDFL 
GKTLLLTSSA 
TIILYGRADE 
ERSWGHRGVI 
SRNLDDMARK 
KLFQTEHGEY 
IQATTMDGVN 
STILNLEDNV 
EIDGVDMRAE 
GTELKHMGQQ 
KDWGYNSLG 
PKPRQDCNAV 
LGKFYNNRAH 
IRHFIAYKNQ 
FSPPCRCLLK 



11 
I 

FKAMLTISWL 
TVYSIHISEG 
GIQPDPYYGL 
VHVIDPKSGT 
AMTKLGSKHF 
FNVSLSSEWV 
LSTEWYKKG 
QSWKPGDTLV 
VGLLSRNIIV 
LVGQYPIHFH 
HCFFTEDGPE 
STFWMANPNN 
SNYRAGMIID 
DHGAWLRGGD 
LVTTGSPFAH 



21 
I 

TLTCFPGATS 
GKLVIKDHDE 
KYIGVGKGGA 
VIHSDRFDTY 
LHLGFRHPWS 
QDVEWTEWFD 
QDYRFACYDR 
IASTDYSMYQ 
MGEMEDKCYP 
LAGDVDERGG 
ERNTFDHCLG 
NLINCAAAGS 
NGVKTTEASA 
VWLDSCHFRG 
VSLAHS 



31 
I 

TVAAGCPDQS 
PIVLRTRHIL 
LELKGQKKLS 
RSKKESERLV 
FLTVKGNPSS 
HDKVSQTKGG 
GRACRSYRVR 
AEEFQVLPCR 
YRNHICNFFD 
YDPPTYIRDL 
LLVKSGTLLP 
EETGFWFIFH 
KDKRPFLSII 
EAQEGFLLTG 



CACACAGCTA 
CCTCAACAAC 
AAGTCCCCTC 
TCCTGCTCCC 
CACTTTGTTC 
TGCTACCTGG 
CTCGCCATGT 
CCACCCCAGT 
GAGAGCCCAA 
AGGCACCAGA 
CCACAAACTC 
AGACTCGGTC 
CCAAACATCT 
CTTCAAAGGC 
AGAGTTAAAA 
ATGCAAGGGT 
GTTGAATGTC 
GTACATATGT 
AAGAGCCAAT 
TCCTCCTTGT 
CTCCTCTGAA 
AGATGTATAG 
ATTTTCTATT 
TGTCTTAAAT 
CCAAGGAAAT 
ATGTCAAAAA 



41 
I 

PELQPWNPGH 
IDNGGELHAG 
WTFLNKTLHP 
QYLNAVPDGR 
SVEDHIEYHG 
EKISDLWKAH 
FLCGKPVRPK 
SCAPNQVKVA 
FDTFGGHIKF 
SIHHTFSRCV 
SDRDSKMCKM 
HVPTGPSVGM 
SARYSPHQDA 
MKAGGILLGG 



AGGGAGGGCC 
CGGCCCCAGA 
GAAGGAAAGG 
AGCGCACACA 
ACTACCTGTC 
CTCTCCTGTC 
TTCTGGTGAG 
CCCTTTCAGC 
AGAGCTCCTG 
GTCTCCCTGG 
TTTCCTTCAA 
CAAGAGTCCA 
TTCAGCTGCT 
CAGAGTCACA 
TGACCTCATG 
CTCACACTGT 
TTTGGCTCAG 
TTCACAGTAC 
ATCTAGGCAT 
TATTTCTGTT 
ATGCTTGTCT 
AGTGTTTGTA 
TATTTATTAT 
GTCATGATTG 
TGGCTCTGGT 
AAAAAAAAAA 



51 
I 

DQDHHVHIGQ 
SALCPFQGNF 
GGMAEGGYFF 
ILSVAVNDEG 
HRGSAAARVF 
PGKICNRPID 
LTVTIDTNVN 
GKPMYLHIGE 
ALGFKAAHLE 
TVHGSNGLLI 
ITEDSYPGYI 
YSPGYSEHIP 
DPLKPREPAI 
DEAASGMAQG 



Seq ID NO: 454 DNA sequence 

Nucleic Acid Accession #: NM_013282.2 

Coding sequence: 85.. 2466 



CGACTCCTTA 
GTCCCTCCCC 
ACCCACACGG 
CAGGAGCTGT 
GAGGACGGCC 
GTCCGCCAGA 
ACCGACTCCG 
GCGGCCGCCG 
GGGCTGTACA 
GAGGCGCAGG 
ACGTCCAGGC 
GAGAACGGCG 
AAGTGGCAGG 
AAGGAGCGGG 
CGGGAACTCT 
TTCGTGGACG 
CCCATGAGAC 
TGCCGGGTCT 
TGCGATGAGT 
CCCAGCGAGG 
GCGGGAGAGC 
TCACAGCGGG 
GTCCCGTCCA 
CGAGTCCAGG 
AGCAACGACG 
GGGAATTTTT 
GCGGAACAGT 
TTTGCTCCCA 
GTCAGGGTGG 
AACCGCTACG 
TTTCTCGTGT 
GAGGGGAAGG 



11 
I 

GAGCATGGCA 
TCAGCGCCGA 
TGGACTCGCT 
TCCACGTGGA 
ATACCCTCTT 
GCCTCGTGCT 
GCTGCTGCCT 
AGACTGACAG 
AGGTCAATGA 
TGGTCAGGGT 
CGGCGCTGGA 
TGGTCCAGAT 
ACCTGGAGGT 
GCTTCTGGTA 
ACGCCAACGT 
AAGTCTTCAA 
GGAAGAGCGG 
GCGCCTGCCA 
GCGACATGGC 
ACGAGTGGTA 
GGCTGAGAGA 
ACTGGGGCAA 
ACCACTACGG 
TCAGCGAGTC 
GAGCGTACTC 
TCACATACAC 
CTTGTGATCA 
TCAATGACCA 
TGCGCAATGT 
ATGGCATCTA 
GGCGCTACCT 
ACCGGATCAA 



21 
I 

TGGCTCAGAG 
CACCATGTGG 
GTCCAGGCTG 
GCCAGGCCTG 
CGACTACGAG 
CCCCCACAGC 
GGGCCAGAGT 
CAGGCCAGCC 
GTACGTCGAT 
GACGCGGAAG 
GGAGGACGTC 
GAACTCCAGG 
GGGCCAGGTG 
CGACGCGGAG 
GGTGCTGGGG 
GATTGAGCGG 
GCCGTCCTGC 
CCTGTGCGGG 
CTTCCACATC 
CTGCCCTGAG 
GAGCAAGAAG 
GGGCATGGCC 
ACCCATCCCG 
GGGTGTCCAT 
CCTAGTCCTG 
GGGTAGTGGT 
GAAACTCACC 
AGAAGGGGCC 
CAAGGGTGGC 
CAAGGTTGTG 
TCTGCGGAGG 
GAAGCTGGGG 



31 

I 

GTGCTGGTAA 
ATCCAGGTTC 
ACCAAGGTGG 
CAGAGGCTGT 
GTCCGCCTGA 
ACCAAGGAGC 
GAGTCAGACA 
GATGAGGACA 
GCTCGGGACA 
GCCCCCTCCC 
ATTTACCACG 
GACGTCCGAG 
GTCATGCTCA 
ATCTCCAGGA 
GATGATTCTC 
CCGGGTGAAG 
AAGCACTGCA 
GGCCGGCAGG 
TACTGCCTGG 
TGCCGGAATG 
AAGGCGAAGA 
TGTGTGGGCC 
GGGATCCCCG 
CGGCCCCACG 
GCGGGGGGCT 
GGTCGAGATC 
AACACCAACA 
GAGGCCAAGG 
AAGAATAGCA 
AAATACTGGC 
GACGATGATG 
CTGACCATGC 



41 
I 

AACTGATGGG 
GGACCATGGA 
AGGAGCTGAG 
TCTACAGGGG 
ATGACACCAT 
GGGACTCCGA 
AGTCCTCCAC 
TGTGGGATGA 
CGAACATGGG 
GGGACGAGCC 
TGAAATACGA 
CGCGCGCCCG 
ACTACAACCC 
AGCGCGAGAC 
TGAACGACTG 
GGAGCCCCAT 
AGGACGACGT 
ACCCCGACAA 
ACCCGCCCCT 
ATGCCAGCGA 
TGGCCTCGGC 
GCACCAAGGA 
TGGGCACCAT 
TGGCTGGCAT 
ATGAGGATGA 
TTTCCGGCAA 
GGGCGCTGGC 
ACTGGCGGTC 
AGTACGCCCC 
CCGAGAAGGG 
AGCCTGGCCC 
AGTATCCAGA 



51 
I 

GGTTTTTGCT 
CGGGAGGCAG 
GCGGAAGATC 
CAAACAGATG 
CCAGCTCCTG 
GCTCTCCGAC 
CCACGGCGAG 
GACGGAATTG 
GGCGTGGTTT 
CTGCAGCTCC 
CGACTACCCG 
CACCATCATC 
CGACAACCCC 
CAGGACGGCG 
TCGGATCATC 
GGTTGACAAC 
GAACAGACTC 
GCAGCTCATG 
CAGCAGTGTT 
GGTGGTACTG 
CACATCGTCC 
ATGTACCATC 
GTGGCGGTTC 
ACACGGCCGG 
CGTGGACCAT 
CAAGAGGACC 
TCTCAACTGC 
GGGGAAGCCG 
CGCTGAGGGC 
GAAGTCCGGG 
TTGGACGAAG 
AGGCTACCTG 
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3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
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4200 
4260 
4320 
4380 
4440 
4500 
4560 
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4680 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 
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GAAGCCCTGG CCAACCGAGA GCGAGAGAAG GAGAACAGCA AGAGGGAGGA 
CAGGAGGGGG GCTTCGCGTC CCCCAGGACG GGCAAGGGCA AGTGGAAGCG 
GGAGGTGGCC CGAGCAGGGC CGGGTCCCCG CGCCGGACAT CCAAGAAAAC 
CCCTACAGTC TCACGGCCCA GCAGAGCAGC CTCATCAGAG AGGACAAGAG 
CTGTGGAATG AGGTCCTGGC GTCACTCAAG GACCGGCCGG GGAGCGGCAG 
TTGTTCCTGA GTAAAGTGGA GGAGACGTTC CAGTGTATCT GCTGTCAGGA 
CGGCCCATCA CGACCGTGTG CCAGCACAAC GTGTGCAAGG ACTGCCTGGA 
CGGGCACAGG TGTTCAGCTG CCCTGCCTGC CGCTACGACC TGGGCCGCAG 
CAGGTGAACC AGCCTCTGCA GACCGTCCTC AACCAGCTCT TCCCCGGCTA 
CGGTGATCTC CAAGCACTTC TCGACAGGCG TTTTGCTGAA AACGTGTCGG 
CATCGGCACT GATTTTGTTC TTAGTGGGCT TAACTTAAAC AGGTAGTGTT 
CCTAAAAAGG TTTGTCTTCC TTTTTTTTTA TTTTTATTTT TCAAATCTAT 
GAATTTATGT ATTCTGGCTA AAAGTTGGAC TTCTCAGTAT TGTGTTTAGT 
CATAAAAGCC TGCAATTTCT CGACAAAACA ACACAAGATT TTTTAAAGAT 
ACTACGTGGT GTGGAGGCTG TTGATGTTTC TGGTGTCAAG TTCTCAGAAG 
CAACTCTTTA AGAAGGCGAC AGGATCAGTC CTTCTCTAGG GTTCTGGCCC 
AGCAAGCATC TTCCTGACAG CATTTTGTCA TCTAAAGTCC AGTGACATGG 
TGGCCCGTGG CAGCCCGTGG CATGGCGTGG CTCAGCTGTC TGTTGAAGTT 
AAAGAGGAAA CATCTCGGGC CTAGTTCAAA CCTTTGCCTC AAAGCCATCC 
TGCTTAGCGT CTGAGATCCG CGTGAAAAGT CCTCTGCCCA CGAGAGCAGG 
CACGCAGAAA TGGCCTCAAG GGGACTCTGC TCCACGTGGG GCCAGGCGTG 
TGTCCGACGA AGGCGGCCAC GGACGGACGC CAGCACACGA AGTCACGTGC 
GATTCGTTCC TTCTTTCTAA AGACGACAGT CTTTGTTGTT AGCACTGAAT 
GTCAACCAGA TTCTAGAAAC TGCGGTCATC CAGTTCTTCC TGACACCGGA 
GGAACCGTTT GAGCCTTATA GATCATTTAC ATTCAATTTT TTTAACTCAG 
" CTTACAAGAG GGTTTTTTTT TAATTTTTTT TTCTCTTAAT GAACACATTT 
TTTTTTTTGT AGTTACTGTA TATGTACCAA GAAAGATATA ACGTTAGGGT 
TTGTTTTTGT ATTTTTTTTC TTTTGAAAGG GTTTGTTAAT TTTTCTAATT 
TTGCAGCCTA TACCTCAATA AAACAGGGAT ATTTTAAATC ACATACCTGC 
GAGCAATGTT ATTTTTAAAG GGTTTTTTTC ACCTCCTTAT TCTTAGATTA 
AGGGAAGAAT GAGACAATTT TGTGTAGGCT TTTTCTAAAG TCCAGTACTT 
TTAGATTCTC AGAATAAATG TTTTTCACAG ATTGAAAAAA AAAAAAAA 

Seq ID NO: 455 Protein sequence 
Protein Accession ft: NP 037414.2 



GGAGGAGCAG 
GAAGTCGGCA 
CAAGGTGGAG 
CAACGCCAAG 
CCCGTTCCAG 
GCTGGTGTTC 
CAGATCCTTT 
CTATGCCATG 
OGGCAATGGC 
AGGGCTCGTT 
TCCTCCGTTC 
ACATTTTCAG 
TCTTTGAAAA 
GGAATCAGAA 
TTGCTGCCAC 
CCAAGGTCAG 
TTCCCCGTGG 
GTTGCAAGGA 
CCCACCAGAC 
GAGTTGGGGC 
TGACTGACGC 
AAGTGCCTTT 
TATTGAAAAT 

CAAGTGAGAA 
TCTAAATGAA 



TTACCAAAGT 
AGACAAACTG 
TTAATGTATT 
TGTCCAGATT 



MWIQVRTMDG 
YEVRLNDTIQ 
PADEDMWDET 
DVIYHVKYDD 
ABISRKRETR 
SCKHCKDDVN 
PECRNDASEV 
IPGIPVGTMW 
SGGRDLSGNK 
GGKNSKYAPA 
LGLTMQYPEG 
SPRRTSKKTK 
TFQCICCQEL 



11 
I 

RQTHTVDSLS 
LLVRQSLVLP 
ELGLYKVNEY 
YPENGWQMN 
TARELYANW 
RLCRVCACHL 
VLAGERLRES 
RFRVQVSESG 
RTAEQSCDQK 
EGNRYDGIYK 
YLEALANRER 
VEPYSLTAQQ 
VFRPITTVCQ 



21 
I 

RLTKVEELRR 
HSTKERDSEL 
VDARDTNMGA 
SRDVRARART 
LGDDSLNDCR 
CGGRQDPDKQ 
KKKAKMASAT 
VHRPHVAGIH 
LTNTNRALAL 
WKYWPEKGK 
EKENSKREEE 
SSIjIREDKSN 
HNVCKDCLOR 



31 
I 

KIQELFHVEP 
SDTDSGCCLG 
WFEAQWRVT 
IIKWQDLEVG 
IIFVDEVFKI 
LMCDECDMAF 
SSSQRDWGKG 
GRSNDGAYSL 
NCFAPINDQE 
SGFLVWRYLL 
EQQEGGFASP 
AKLWNEVLAS 
SFRAQVFSCP 



41 
I 

GLQRLFYRGK 
QSESDKSSTH 
RKAPSRDEPC 
QWMLNYNPD 
ERPGEGSPMV 
HIYCLDPPLS 
MACVGRTKEC 
VLAGGYEDDV 
GAEAKDWRSG 
RRDDDEPGPW 
RTGKGKWKRK 
LKDRPASGSP 
ACRYDLGRSY 



51 
I 

QMEDGHTLFD 
GEAAAETDSR 
SSTSRPALEE 
NPKERGFWYD 
DNPMRRKSGP 
SVPSEDEWYC 
TIVPSNHYGP 
DHGNFFTYTG 
KPVRWRNVK 
TKEGKDRIKX 
SAGGGPSRAG 
FQLFLSKVEE 
AMQVNQPLQT 



Seq ID NO: 456 DNA sequence 

Nucleic Acid Accession NM_001200. 

Coding sequence: 325.. 1514 



GGGGACTTCT 
TGCCCCAGCX3 
TGCCCGACAC 
GAGAAGGAGG 
AGAGTTTTTC 
CTGCGGTCTC 
TTCCCCAGGT 
TCGCGGCGGC 
TCX3AGTTGCG 
CCGTGGTGCC 
CCGCCCCAGA 
ACCATGAAGA 
TCTTTAATTT 
TCCGAGAACA 
TTTATGAAAT 
ACACCAGGTT 
TGATGCGGTG 
TGGAGGAGAA 
ATGAACACAG 
GGCATCCTCT 
AGTCCAGCTG 
GGATTGTGGC 
TGGCTGATCA 
ACTCTAAGAT 
ACCTTGACGA 
GTGGGTGTCG 



11 
I 

TGAACTTGCA 
GAGCCTGCTT 
TGAGACGCTG 
AGGCAAAGAA 
CATGTGGACG 
CTAAAGGTCG 
CCTCCTGGGC 
GTCGTCGGGC 
GCTGCTCAGC 
CCCCTACATG 
CCACCGGTTG 
ATCTTTGGAA 
AAGTTCTATC 
GATGCAAGAT 
CATAAAACCT 
GGTGAATCAG 
GACTGCACAG 
ACAAGGTGTC 
CTGGTCACAG 
CCACAAAAGA 
TAAGAGACAC 
TCCCCCGGGG 
TCTGAACTCC 
TCCTAAGGCA 
GAATGAAAAG 
CTAGTACAGC 



21 
I 

GGGAGAATAA 
CGCCATCTCC 
TTCCCAGCGT 
AAGGAACGGA 
CTCTTTCAAT 
ACCATGGTGG 
GGCGCGGCTG 
CGCCCCTCAT 
ATGTTCGGCC 
CTAGACCTGT 
GAGAGGGCAG 
GAACTACCAG 
CCCACGGAGG 
GCTTTAGGAA 
GCAACAGCCA 
AATGCAAGCA 
GGACACGCCA 
TCCAAGAGAC 
ATAAGGCCAT 
GAAAAACGTC 
CCTTTGTACG 
TATCACGCCT 
ACTAATCATG 
TGCTGTGTCC 
GTTGTATTAA 
AAAATTAAAT 



31 
I 

CTTGCGCACC 
GAGCCCCACC 
GAAAAGAGAG 
CATTCGGTCC 
GGACGTGTCC 
CCGGGACCCG 
GCCTCGTTCC 
CCCAGCCCTC 
TGAAACAGAG 
ATCGCAGGCA 
CCAGCCGAGC 
AAACGAGTGG 
AGTTTATCAC 
ACAATAGCAG 
ACTCGAAATT 
GGTGGGAAAG 
ACCATGGATT 
ATGTTAGGAT 
TGCTAGTAAC 
AAGCCAAACA 
TGGACTTCAG 
TTTACTGCCA 
CCATTGTTCA 
CGACAGAACT 
AGAACTATCA 
ACATAAATAT 



41 
I 

CCACTTTGCG 
GCCCCTCCAC 
ACTGCGCGGC 
TTGCGCCAGG 
CCGCGTGCTT 
CTGTCTTCTA 
GGAGCTGGGC 
TGACGAGGTC 
ACCCACCCCC 
CTCAGGTCAG 
CAACACTGTG 
GAAAACAACC 
CTCAGCAGAG 
TTTCCATCAC 
CCCCGTGACC 
TTTTGATGTC 
CGTGGTGGAA 
AAGCAGGTCT 
TTTTGGCCAT 
CAAACAGCGG 
TGACGTGGGG 
CGGAGAATGC 
GACGTTGGTC 
CAGTGCTATC 
GGACATGGTT 
ATATATA 



51 
I 

CCGGTGCCTT 
TCCTCGGCCT 
CGGCACCCGG 
TCCTTTGACC 
CTTAGACGGA 
GCGTTGCTGC 
CGCAGGAAGT 
CTGAGCGAGT 
AGCAGGGACG 
CCGGGCTCAC 
CGCAGCTTCC 
CGGAGATTCT 
CTTCAGGTTT 
CGAATTAATA 
AGACTTTTGG 
ACCCCCGCTG 
GTGGCCCACT 
TTGCACCAAG 
GATGGAAAAG 
AAACGCCTTA 
TGGAATGACT 
CCTTTTCCTC 
AACTCTGTTA 
TCGATGCTGT 
GTGGAGGGTT 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
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60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



Seq ID NO: 457 Protein sequence 
Protein Accession #: NP 001191.1 
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1 11 21 31 

i I i I 

MVAGTRCLLA LLLPQVLLGG AAGLVPELGR RKPAAASSGR 
FGLKQRPTPS RDAWPPYML DLYRRHSGQP GSPAPDHRLE 
LPETSGKTTR RFFFNIiSSIP TKEFITSAEL QVPREQMQDA 
TANSKFPVTR LLDT 

Seq ID NO: 458 DNA sequence 
Nucleic Acid Accession #: NM_001999.2 
Coding sequencer 1..8736 

1 11 21 31 41 51 

I I I I i I 

ATGGGGAGAA GACGGAGGCT GTGTCTCCAG CTCTACTTCC TGTGGCTGGG CTGTGTGGTG 60 

CTCTGGGCGC AGGGCACGGC CGGCCAGCCT CAGCCTCCTC CGCCCAAGCC GCCCCGGCCC 120 

CAGCCGCCGC CGCAACAGGT TCGGTCCGCT ACAGCAGGCT CTGAAGGCGG GTTTCTAGCG 180 

CCCGAGTATC GCGAGGAGGG TGCCGCAGTG GCCAGCCGCG TCCGCCGGCG AGGACAGCAG 240 

GACGTGCTCC GAGGGCCCAA CGTGTGCGGC TCCAGATTCC ACTCCTACTG CTGCCCTGGA 300 

TGGAAGACGC TCCCTGGAGG AAACCAGTGC ATTGTCCCGA TTTGTAGAAA TAGTTGTGGA 360 

GATGGATTTT GTTCCCGTCC TAACATGTGT ACTTGTTCCA GTGGGCAAAT ATCATCAACC 420 

TGTGGATCAA AATCAATTCA GCAGTGCAGT GTGAGATGCA TGAATGGTGG GACCTGTGCA 480 

GATGACCACT GCCAGTGCCA GAAAGGATAT ATTGGAACTT ATTGTGGACA ACCTGTCTGT 540 

GAAAATGGAT GTCAGAATGG TGGACGTTGC ATCGCCCAAC CGTGTGCTTG TGTTTATGGG 600 

TTCACTGGTC CACAGTGTGA AAGAGATTAC AGGACAGGCC CGTGTTTCAC TCAGGTCAAC 660 

AACCAGATGT GCCAAGGGCA GCTGACAGGC ATTGTCTGCA CGAAGACTCT GTGCTGTGCC 720 

ACCACTGGAC GGGCGTGGGG CCATCCCTGT GAGATGTGTC CAGCCCAGCC TCAGCCCTGC 780 

CGACGGGGTT TCATCCCCAA CATCCGCACT GGAGCTTGCC AAGATGTTGA TGAATGCCAG 840 

GCTATCCCAG GGATATGCCA AGGAGGAAAC TGTATCAATA CAGTGGGCTC TTTTGAATGC 900 

AGATGCCCTG CTGGTCACAA ACAGAGTGAA ACTACTCAGA AATGTGAAGA CATTGATGAG 960 

TGCAGCATCA TTCCTGGGAT ATGTGAAACT GGTGAATGTT CCAACACCGT GGGAAGCTAT 1020 

TTTTGTGTTT GTCCACGTGG ATATGTAACC TCAACAGATG GCTCTCGATG CATCGATCAG 1080 

AGAACAGGCA TGTGTTTCTC GGGCCTGGTG AATGGCCGCT GTGCACAAGA GCTCCCGGGG 1140 

AGAATGACGA AAATGCAGTG CTGCTGTGAG CCTGGCCGCT GCTGGGGCAT CGGAACCATT 1200 

CCTGAAGCCT GTCCTGTCAG AGGTTCTGAG GAATATCGCA GACTTTGCAT GGATGGACTT 1260 

CCAATGGGAG GAATTCCAGG GAGTGCTGGT TCCAGACCTG GAGGCACTGG GGGAAATGGC 1320 

TTTGCCCCAA GTGGCAATGG CAATGGCTAT GGCCCAGGAG GGACAGGCTT CATCCCCATC 1380 

CCTGGAGGCA ATGGCTTTTC TCCTGGCGTT GGGGGAGCCG GTGTGGGGGC CGGGGGACAG 1440 

GGACCTATCA TCACTGGACT AACAATTCTG AACCAGACAA TAGATATCTG TAAGCATCAT 1500 

GCTAACCTTT GTTTAAATGG ACGCTGTATA CCAACTGTCT CAAGCTACCG ATGTGAATGC 1560 

AACATGGGTT ATAAGCAGGA TGCAAATGGA GATTGTATAG ATGTTGATGA ATGCACATCA 1620 

AATCCCTGCA CTAATGGAGA TTGTGTTAAC ACACCTGGTT CCTATTATTG TAAATGTCAT 1680 

GCTGGATTCC AGAGGACTCC TACCAAGCAA GCATGCATTG ATATTGATGA GTGCATCCAG 1740 

AATGGGGTTC TTTGTAAAAA CGGTCGATGC GTGAACTCAG ATGGAAGTTT CCAGTGCATT 1800 

TGCAATGCCG GCTTTGAATT AACTACAGAT GGAAAAAACT GTGTTGATCA TGATGAATGT 1860 

ACAACTACCA ACATGTGTTT GAATGGAATG TGCATCAATG AAGATGGCAG CTTCAAGTGC 1920 

ATCTGCAAAC CAGGATTTGT CTTGGCTCCA AATGGGCGTT ACTGTACTGA TGTTGATGAA 1980 

TGCCAGACCC CAGGAATCTG CATGAATGGG CACTGCATCA ACAGTGAAGG GTCCTTCCGC 2040 

TGTGACTGTC CCCCAGGCCT GGCTGTGGGC ATGGATGGAC GTGTGTGTGT TGATACTCAC 2100 

ATGCGCAGTA CCTGCTATGG AGGAATCAAG AAAGGAGTGT GTGTGCGTCC TTTCCCCGGT 2160 

GCAGTGACCA AGTCCGAATG CTGCTGTGCC AATCCAGACT ATGGTTTTGG AGAACCCTGC 2220 

CAGCCATGCC CTGCAAAAAA TTCAGCTGAA TTCCACGGCC TTTGTAGTAG TGGAGTAGGT 2280 

ATCACTGTGG ATGGAAGAGA TATCAATGAA TGTGCTTTGG ATCCTGATAT ATGTGCCAAT 2340 

GGGATTTGTG AAAACTTACG TGGTAGTTAC CGTTGTAATT GCAACAGTGG CTATGAACCA 2400 

GATGCCTCTG GAAGAAACTG TATTGACATT GATGAATGTT TAGTAAACAG ACTGCTTTGT 2460 

GATAACGGAT TGTGCCGAAA CACGCCAGGA AGTTACAGCT GTACGTGCCC ACCAGGGTAT 2520 

GTGTTCAGGA CTGAGACAGA GACCTGTGAA GATATAAATG AATGTGAAAG CAACCCATGT 2580 

GTCAATGGGG CCTGCAGAAA CAACCTTGGA TCTTTCAATT GTGAATGTTC GCCCGGCAGC 2640 

AAACTCAGCT CCACAGGATT GATCTGTATT GACAGCCTGA AGGGGACCTG TTGGCTCAAC 2700 

ATCCAGGACA GCCGCTGTGA GGTGAATATT AATGGAGCCA CTCTGAAATC TGAATGCTGT 2760 

GCCACCCTCG GAGCCGCCTG GGGGAGCCCC TGTGAGCGGT GTGAACTAGA TACAGCTTGC 2820 

CCAAGAGGGC TTGCCAGGAT TAAAGGTGTT ACGTGTGAAG ATGTTAATGA GTGTGAGGTG 2880 

TTCCCTGGCG TTTGTCCAAA TGGACGCTGT GTCAACAGTA AGGGATCTTT TCATTGCGAG 2940 

TGCCCTGAAG GCCTTACGTT GGATGGGACT GGCCGTGTAT GTTTGGATAT TCGCATGGAG 3000 

CAGTGTTACT TGAAGTGGGA TGAAGATGAA TGCATCCACC CCGTTCCTGG AAAGTTCCGC 3060 

ATGGATGCCT GCTGCTGTGC TGTCGGGGCG GCTTGGGGCA CCGAGTGTGA GGAGTGCCCC 3120 

AAACCTGGCA CCAAGGAATA CGAGACACTG TGCCCCCGCG GGGCTGGCTT TGCTAACCGA 3180 

GGGGATGTTC TTACTGGGCG GCCATTTTAC AAAGACATCA ATGAATGCAA AGCATTTCCT 3240 

GGGATGTGCA CTTATGGGAA GTGCAGAAAT ACAATCGGAA GCTTCAAATG CCGTTGCAAT 3300 

AGTGGCTTTG CTCTAGACAT GGAGGAAAGA AACTGCACGG ACATCGACGA GTGCAGGATT 3360 

TCTCCTGACC TCTGTGGCAG TGGAATCTGC GTCAATACAC CGGGCAGCTT TGAGTGCGAG 3420 

TGCTTCGAAG GCTATGAAAG TGGCTTCATG ATGATGAAGA ACTGCATGGA CATTGACGGA 3480 

TGTGAACGTA ACCCTCTCCT TTGTAGGGGT GGCACCTGTG TGAACACTGA GGGCAGCTTT 3540 

CAGTGTGACT GCCCACTGGG ACACGAGCTG TCACCATCCC GTGAGGACTG TGTGGATATT 3600 

AATGAATGCT CCCTGAGTGA CAATCTCTGC AGAAATGGAA AATGTGTGAA CATGATTGGA 3660 

ACCTATCAGT GCTCTTGCAA TCCTGGATAT CAGGCTACGC CAGACCGCCA GGGCTGTACA 3720 

GATATTGATG AATGTATGAT AATGAACGGA GGCTGTGACA CCCAGTGCAC AAATTCAGAG 3780 

GGAAGCTACG AATGCAGCTG CAGTGAGGGT TATGCCCTGA TGCCAGATGG GAGATCGTGT 3840 

GCAGACATTG ATGAATGTGA AAACAATCCT GATATCTGTG ATGGCGGCCA GTGTACCAAC 3900 

ATTCCTGGAG AGTATCGCTG CCTCTGCTAT GATGGCTTCA TGGCTTCCAT GGACATGAAA 3960 

ACATGCATTG ATGTCAATGA ATGTGACCTA AATTCAAATA TCTGCATGTT TGGGGAATGT 4020 

GAGAACACAA AGGGATCCTT CATTTGCCAC TGTCAGCTGG GTTACTCAGT GAAGAAGGGG 4080 

ACCACAGGAT GTACAGATGT GGATGAGTGT GAAATTGGTG CTCATAACTG CGACATGCAT 4140 

GCCTCATGTC TGAATATCCC AGGAAGCTTC AAGTGTAGCT GCAGAGAAGG CTGGATTGGA 4200 

AACGGCATCA AGTGTATTGA TCTGGACGAA TGTTCTAATG GAACCCACCA GTGTAGCATC 4260 

AATGCTCAGT GTGTAAATAC CCCGGGCTCA TACCGCTGTG CCTGCTCCGA AGGTTTCACT 4320 

GGTGATGGCT TTACCTGCTC AGATGTTGAT GAGTGTGCAG AAAACATAAA CCTCTGTGAG 4380 



41 51 
i I 

PSSQPSDEVL SEFELRLLSM 60 
RAASRANTVR SFHHEESLEE 120 
LGNNSSFHHR INIYEIIKPA 180 
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AACGGACAGT GCCTTAATGT CCCGGGTGCA TATCGCTGCG AGTGTGAGAT GGGCTTCACT 4440 

CCAGCCTCAG ACAGCAGATC CTGCCAAGAT ATTGATGAAT GCTCCTTCCA AAACATTTGT 4500 

GTCTCTGGAA CATGTAATAA CCTGCCTGGA ATGTTTCATT GCATCTGCGA TGATGGTTAT 4560 

GAATTGGACA GAACAGGAGG GAACTGTACA GATATTGATG AGTGTGCAGA TCCTATAAAC 4620 

TGTGTCAATG GCCTATGTGT CAACACGCCT GGTCGCTATG AGTGTAACTG CCCACCGGAT 4680 

TTTCAGTTGA ACCCAACTGG TGTGGGTTGT GTTGACAACC GTGTGGGCAA CTGCTACCTG 4740 

AAGTTTGGAC CTCGAGGAGA TGGGAGTCTG TCTTGCAACA CCGAGATCGG GGTGGGCGTC 4800 

AGTCGCTCTT CATGCTGCTG CTCTCTGGGA AAGGCCTGGQ GAAACCCCTG TGAGACATGC 4860 

CCCCCTGTCA ATAGCACTGA ATATTACACC CTGTGTCCCG GAGGTGAAGG CTTCAGACCT 4 920 

AACCCCATCA CAATCATTTT AGAAGACATT GACGAATGCC AGGAGTTACC AGGTCTCTGC 4980 

CAGGGTGGAA ACTGCATCAA CACTTTTGGG AGCTTCCAGT GTGAGTGCCC ACAAGGCTAC 5040 

TACCTCAGCG AGGATACCCG CATCTGTGAG GATATTGATG AGTGTTTTGC ACATCCTGGT 5100 

GTGTGTGGGC CTGGGACCTG CTATAACACC CTGGGAAATT ACACCTGCAT TTGCCCACCT 5160 

GAGTACATGC AGGTCAATGG AGGCCACAAC TGCATGGACA TGAGAAAAAG CTTTTGCTAC 5220 

CGAAGCTATA ATGGAACCAC TTGTGAGAAT GAGTTGCCTT TCAATGTGAC AAAAAGGATG 5280 

TGCTGCTGCA CATATAATGT GGGCAAAGCT GGGAACAAAC CTTGTGAACC ATGCCCAACT 5340 

CCAGGAACAG CTGACTTTAA AACCATATGT GGAAATATTC CTGGATTCAC CTTTGACATT 5400 

CACACAGGAA AAGCTGTTGA CATTGATGAA TGTAAAGAGA TTCCAGGCAT TTGTGCAAAT 5460 

GGTGTGTGCA TTAACCAGAT TGGCAGTTTC CGCTGTGAAT GCCCTACAGG ATTCAGTTAC 5520 

AATGACCTGC TGTTGGTTTG TGAAGATATA GATGAGTGCA GCAATGGTGA TAATCTCTGC 5580 

CAGCGGAATG CAGACTGCAT CAATAGTCCT GGTAGTTACC GCTGTGAATG TGCCGCGGGT 5640 

TTCAAACTTT CACCCAATGG GGCCTGTGTA GATCGCAATG AATGTTTAGA AATTCCTAAC 5700 

GTTTGCAGTC ATGGCTTGTG TGTTGATCTG CAAGGAAGTT ACCAGTGCAT CTGCCACAAT 5760 

GGCTTTAAGG CTTCTCAGGA CCAGACCATG TGCATGGATG TTGATGAGTG CGAGCGGCAC 5820 

CCATGTGGAA ATGGAACTTG TAAAAACACC. GTTGGATCCT ATAACTGTCT GTGCTACCCA 5880 

GGGTTTGAAC TCACTCATAA TAATGATTGC CTGGACATAG ATGAGTGCAG TTCCTTTTTT 5940 

GGTCAGGTGT GCAGAAATGG ACGTTGTTTT AATGAAATTG GTTCTTTCAA GTGTCTATGT 6000 

AACGAAGGTT ATGAACTTAC CCCAGATGGC AAAAACTGTA TAGACACTAA TGAGTGTGTC 6060 

GCCCTTCCCG GCTCTTGCTC TCCTGGTACC TGTCAGAATT TGGAGGGATC CTTCAGATGC 6120 

ATCTGTCCCC CAGGGTATGA AGTAAAAAGC GAGAACTGCA TTGATATAAA TGAATGTGAT 6180 

GAAGATCCCA ACATTTGTCT TTTTGGTTCC TGTACTAATA CTCCAGGGGG CTTCCAGTGC 6240 

CTCTGCCCCC CTGGCTTTGT ACTATCTGAT AATGGACGGA GATGCTTTGA TACTCGCCAG 6300 

AGCTTCTGCT TCACAAATTT TGAAAATGGA AAGTGTTCTG TACCCAAAGC TTTCAACACC 6360 

ACAAAAGCAA AATGCTGCTG TAGTAAGATG CCAGGAGAGG GCTGGGGGGA CCCCTGTGAG 6420 

CTGTGCCCCA AAGACGATGA AGTTGCATTT CAGGATTTGT GTCCATATGG CCATGGAACT 6480 

GTCCCTAGTC TTCATGATAC ACGTGAAGAT GTCAATGAGT GTCTTGAGAG CCCAGGCATT 6540 

TGTTCAAATG GTCAATGTAT CAACACCGAC GGATCTTTTC GCTGTGAATG TCCAATGGGC 6600 

TACAACCTTG ACTACACTGG AGTACGCTGT GTGGATACTG ATGAGTGTTC AATCGGCAAT 6660 

CCGTGTGGAA ATGGTACATG CACCAATGTT ATTGGGAGTT TTGAATGCAA TTGCAATGAA 6720 

GGCTTTGAGC CAGGGCCCAT GATGAATTGT GAAGATATCA ACGAATGTGC CCAGAACCCA 6780 

CTGCTGTGTG CTTTACGCTG CATGAACACT TTTGGGTCCT ATGAATGCAC GTGCCCGATT 6840 

GGCTATGCCC TCAGGGAAGA TCAAAAGATG TGCAAAGATC TGGATGAATG TGCTGAAGGG 6900 

TTACACGACT GTGAATCTAG GGGCATGATG TGTAAGAATC TAATCGGCAC CTTCATGTGC 6960 

ATCTGCCCTC CTGGAATGGC CCGAAGGCCC GATGGAGAAG GCTGTGTAGA TGAAAATGAA 7020 

TGCAGGACCA AGCCAGGAAT CTGTGAAAAT GGACGTTGTG TTAACATTAT TGGAAGCTAT 7080 

AGATGTGAGT GTAATGAAGG ATTCCAGTCA AGTTCTTCAG GCACTGAATG CCTTGACAAT 7140 

CGACAGGGTC TCTGCTTTGC AGAGGTACTG CAGACAATAT GTCAAATGGC ATCCAGTAGT 7200 

CGCAATCTCG TCACTAAGTC AGAATGCTGC TGTGATGGTG GGCGAGGCTG GGGCCACCAG 7260 

TGCGAGCTTT GCCCACTTCC TGGAACTGCC CAGTACAAAA AGATATGTCC TCATGGCCCA 7320 

GGATATACAA CTGATGGAAG AGATATTGAT GAATGTAAGG TAATGCCAAA CCTCTGCACC 7380 

AATGGTCAGT GCATCAATAC CATGGGCTCA TTCCGATGCT TCTGCAAGGT TGGCTACACC 7440 

ACAGACATCA GTGGAACCTC TTGTATAGAQ CTTGATGAAT GCTCCCAGTC CCCGAAACCA 7500 

TGCAACTACA TCTGCAAGAA CACTGAGGGG AGTTATCAGT GTTCATGTCC GAGGGGGTAT 7560 

GTCCTGCAAG AGGATGGAAA GACATGCAAA GACCTTGATG AATGTCAAAC AAAGCAGCAT 7620 

AACTGCCAGT TCCTCTGTGT CAACACCCTG GGGGGGTTTA CCTGTAAATG TCCACCTGGT 7680 

TTCACACAGC ATCACACTGC TTGTATCGAC AACAACGAAT GTGGGTCTCA ACCTTTGCTT 7740 

TGTGGAGGAA AGGGAATCTG TCAAAACACT CCAGGCAGTT TCAGCTGTGA ATGCCAAAGA 7800 

GGGTTCTCTC TTGATGCCAC CGGACTGAAC TGTGAAGATG TTGATGAATG TGATGGGAAC 7860 

CACAGGTGCC AACACGGCTG CCAGAACATC CTGGGTGGCT ACAGATGTGG CTGCCCCCAA 7920 

GGCTACATCC AGCACTACCA GTGGAATCAG TGTGTCGATG AGAATGAATG CTCCAATCCC 79B0 

AATGCCTGTG GCTCTGCTTC CTGCTACAAC ACCCTGGGGA GTTA'CAAGTG CGCCTGCCCC 8040 

TCGGGGTTCT CCTTCGACCA GTTCTCCAGT GCCTGCCACG ACGTGAATGA GTGCTCGTCC 8100 

TCCAAGAACC CCTGCAATTA CGGCTGCTCT AACACGGAGG GGGGCTACCT CTGTGGCTGC 8160 

CCCCCTGGGT ATTACAGAGT GGGACAAGGC CACTGTGTCT CAGGAATGGG ATTTAACAAG 8220 

GGGCAGTACC TGTCACTGGA TACAGAGGTC GATGAGGAAA ATGCTCTGTC CCCAGAAGCA 8280 

TGCTACGAGT GCAAAATCAA CGGCTATCCT AAGAAAGACA GCAGGCAGAA GAGAAGTATT 8340 

CATGAACCTG ATCCCACTGC TGTTGAACAG ATCAGCCTAG AGAGTGTCGA CATGGACAGC 8400 

CCCGTCAACA TGAAGTTCAA CCTCTCCCAC CTCGGCTCTA AGGAGCACAT CCTGGAACTA 8460 

AGGCCCGCCA TCCAGCCCCT CAACAACCAC ATCCGTTATG TCATCTCTCA AGGGAACGAT 8520 

GACAGCGTCT TCCGCATCCA CCAAAGGAAT GGGCTCAGCT ACTTGCACAC GGCCAAGAAG 8580 

AAGCTCATGC CCGGCACATA CACACTGGAA ATCACTAGCA TCCCTCTCTA CAAGAAGAAG 8640 

GAGCTTAAGA AACTGGAAGA GAGCAATGAG GATGACTACC TCCTAGGGGA GCTTGGGGAG 8700 

GCTCTCAGAA TGAGGCTGCA GATTCAGCTC TATTAACCGT TCACAGACTT GGGCCCAGGC 8760 

TCAAATCCTA GCACAGCCAG TCTGCAGAAG CATTTGAAAA GTCAAGGACT AATTTTAAAG 8820 

AGGAAAAATA ATAATAACTC TTGTTTCTTT CCTCCCTGTC TTAGACTTTG AATGTTGACC 8880 

CTCACAGGGA GGGATAATTT AGACTCTGGT ATGGCCAAAG ATTTGAGCTC AAAGGCAACC 8940 

GTGGTTACTG TATTTTTTAT ATAACTTCAT TTTAAAATAT ATTAAAAGAA ACCTAAATGT 9000 

TCAAGATATC AGCATATGGC ACTAAATGCA CAAAAATAAT GTGAGCTTTT TTTTTTTTTT 9060 

CCTGTTAGCA GTCTGTAACA CTTTGGGTAT TTTGCTATAG TTGCTAATTA AAAAAATATA 9120 

GATGTTTATT TATTTTTAAT GCAGTAATAT ATGGAGAAAT GAACAAACTA TGTAAACAAA 9180 

AAGGGAAACT CACTTGTTTT TCTTTAGATT TATAAATTTG AGCTATTTTT TTTAGAGGTG 9240 

CTTTTTAAAA ATCCAATAGA TACAAGAGAT GTTTCCTTTG GTTTTCTGCC AGTCATCCAG 9300 

CTGATACACA CCTGATCGAT TTTAAAGAAA GCCACACAGA GCTGAATCGG GCAGTGCTAA 9360 

TCAATAATTT AAAAGACATG AATGTCATTA GATCCTTTAT AACGTAGATC GAAGCCAAAG 9420 

CAGCTCATTT GTGACAACAT TTCATATCAC CAGACACACC AGGCAACAGA AGTTGAAGCA 9480 

CAACCACTGT AGCAAAATAC CTTGACTGCT TGTGAGACCA TTAGCATTGC AGGCCAAACC 9540 

GTACTGTATT TCCTTCTCAT AACCTCAAGG AACCATATGT GCTACCCACA ACACCTCATT 9600 
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CTTACCCAGG GTGCGCTGCG TCCTCATGGT ACTGTAGGCA GCTGAAGAAC CGCCGTTCCC 9660 

TTGAAAGGGA ACACCTGGCA TTCTGTGGTG TTTCGTGCTG TCTTAAATAA TGGTGCATTT 9720 

ATTATGTTCA AGTTATTTCA GGATTGCCAT ATGTGCAAAC AAATCATGCA ATGCAGCCAA 9780 

GGAATATATG TTGTTGTTGT TGTTTTAAAC CCATTTTTTT TTTAGAATTT TCATTAATAC 9840 

TGTAGTTATA CACCATATGC CTCATTTTAT CATAGCCTAT TGTGTATGAA AGATGTTTGT 9900 

ACAATGAATT GATGTTTAGT TTGCTTTAGT CATTTAAAAA GATATTGTAC CAGGATGTGC 9960 

TATTAAGAGC ACGTATCCAT TATTCTTCTC AACCCAAGAA CCTGTTTCCT GGACCAGTGA 10020 

CCAAACCTCA TATGTGAAAT GGCCAAAGCA CATGCAGGCT CCTGGTTGTT CCTCTCAAAC 10080 

CTGTGCTGAC CAAAGATTAG TAACCAGTTA TACCCAGTAT TTTGAGGTTT TATTGTTTTT 10140. 
TTAATAACTA AAAAAAAACT CGTGCC 

Seq ID NO: 459 Protein sequence 
Protein Accession #: NP_001990.1 

1 11 21 31 41 51 

I I I I I I 

MGRRRRLCLQ LYFXWLGCW LWAQGTAGQP QPPPPKPPRP QPPPQQVRSA TAGSEGGFLA 60 

PEYREEGAAV ASRVRRRGQQ DVLRGPNVCG SRFHSYCCPG WKTLPGGNQC IVPICRNSCG 120 

DGFCSRPNMC TCSSGQISST CGSKSIQQCS VRCMNGGTCA DDHCQCQKGY IGTYCGQPVC 180 

ENGCQNGGRC IAQPCACVYG FTGPQCERDY RTGPCFTQVN NQMCQGQLTG IVCTKTLCCA 240 

TTGRAWGHPC EMCPAQPQPC RRGFIPNIRT GACQDVDECQ AIPGICQGGN CINTVGSFEC 300 

RCPAGHKQSE TTQKCBDIDE CS1IPGICET GECSNTVGSY FCVCPRGYVT STDGSRCIDQ 360 

RTGMCFSGLV NGRCAQELPG RMTKMQCCCE PGRCWGIGTI PEACPVRGSE EYRRLCMDGL 420 

PMGGIPGSAG SRPGGTGGNG FAPSGNGNGY GPGGTGFIPI PGGNGFSPGV GGAGVGAGGQ 480 

GPIITGLTIL NQTIDICKHH ANLCLNGRCI PTVSSYRCEC NMGYKQDANG DCIDVDECTS S40 

NPCTNGDCVN TPGSYYCKCH AGFQRTPTKQ ACIDIDECIQ NGVLCKNGRC VNSDGSFQCI 600 

CNAGFELTTD GKNCVDHDEC TTTNMCLNGM CINEDGSFKC ICKPGFVLAP NGRYCTDVDE 660 

CQTPGICMNG HCINSEGSFR CDCPPGLAVG MDGRVCVDTH MRSTCYGGIK KGVCVRPFPG 720 

AVTKSECCCA NPDYGFGEPC QPCPAKNSAE FHGLCSSGVG ITVDGRDINE CALDPDICAN 780 

GICENLRGSY RCNCNSGYEP DASGRNCIDI DECLVNRLLC DNGLCRNTPG SYSCTCPPGY 840 

VFRTBTETCE DINECESNPC VNGACRNNLG SFNCECSPGS KLSSTGLICI DSLKGTCWLN 900 

IQDSRCEVNI NGATLKSECC ATLGAAWGSP CERCELDTAC PRGLARIKGV TCEDVNECEV 960 

FPGVCPNGRC VNSKGSFHCE CPEGLTLDGT GRVCLDIRME QCYLKWDEDE CIHPVPGKFR 1020 

MDACCCAVGA AWGTECEECP KPGTKEYETL CPRGAGFANR GDVLTGRPFY KDINECKAFP 1080 

GMCTYGKCRN TIGSFKCRCN SGFAIiDMEBR NCTDIDECRI SPDLCGSGIC VNTPGSFECE 1140 

CFEGYESGFM MMKNCMDIDG CERNPLLCRG GTCVNTEGSF QCDCPLGHEL SPSREDCVDI 1200 

NECSLSDNLC RNGKCVNM1G TYQCSCNPGY QATPDRQGCT DIDECMIMNG GCDTQCTNSE 1260 

, GSYECSCSEG YALMPDGRSC ADIDECENNP DICDGGQCTN IPGEYRCLCY DGFMASMDMK 1320 

TCIDVNECDL NSNICMFGEC ENTKGSFICH CQLGYSVKKG TTGCTDVDEC EIGAHNCDMH 1380 

ASCLNIPGSP KCSCREGWIG NGIKCIDLDE CSNGTHQCSI NAQCVNTPGS YRCACSEGFT 1440 

GDGFTCSDVD ECAENINLCE NGQCLNVPGA YRCECEMGFT PASDSRSCQD IDECSFQNIC 1500 

VSGTCNNLPG MFHCICDDGY ELDRTGGNCT DIDECADPIN CVNGLCVNTP GRYECNCPPD 1560 

FQLNPTGVGC VDNRVGNCYL KFGPRGDGSL SCNTEIGVGV SRSSCCCSLG KAWGNPCETC 1620 

PPVNSTEYYT LCPGGEGFRP NPITIILEDI DECQELPGLC QGGNCINTFG SFQCECPQGY 1680 

YLSEDTRICE DIDECFAHPG VCGPGTCYNT LGNYTCICPP EYMQVNGGHN CMDMRKSFCY 1740 

RSYNGTTCEN ELPFNVTKRM CCCTYNVGKA GNKPCEPCPT PGTADFKTIC GNIPGFTFDI 1800 

HTGKAVDIDE CKEIPGICAN GVCINQIGSF RCECPTGFSY NDLLLVCEDI DECSNGDNLC 1860 

QRNADCINSP GSYRCECAAG FKLSPNGACV DRNECLEIPN VCSHGLCVDL QGSYQCICHN 1920 

GFKASQDQTM CMDVDECERH PCGNGTCKNT VGSYNCLCYP GFELTHNNDC LDIDECSSFF 1980 

GQVCRNGRCF NEIGSFKCLC NEGYELTPDG KNCIDTNECV ALPGSCSPGT CQNLEGSFRC 2040 

ICPPGYEVKS ENCIDINECD EDPNICLFGS CTNTPGGFQC LCPPGFVLSD NGRRCFDTRQ 2100 

SFCFTNFENG KCSVPKAFNT TKAKCCCSKM PGEGWGDPCE LCPKDDEVAF QDLCPYGHGT 2160 

VPSLHDTRED VNECLESPGI CSNGQCINTD GSFRCECPMG YNLDYTGVRC VDTDECSIGN 2220 

PCGNGTCTNV IGSFECNCNE GFEPGPMMNC EDINECAQNP LLCALRCMNT FGSYECTCPI 2280 

GYALREDQKM CKDLDECAEG LHDCESRGMM CKNLIGTFMC ICPPGMARRP DGEGCVDENE 2340 

CRTKPGICEN GRCVNIIGSY RCECNEGFQS SSSGTECLDN ROGLCFAEVL QTICQMASSS 2400 

RNLVTKSECC CDGGRGWGHQ CELCPLPGTA QYKKICPHGP GYTTDGRDID ECKVMPNLCT 2460 

NGQCINTMGS FRCFCKVGYT TDISGTSCID LDECSQSPKP CNYICKNTEG SYQCSCPRGY 2520 

VLQEDGKTCK DLDECQTKQH NCQFLCVNTL GGFTCKCPPG FTQHHTACID NNECGSQPLL 2580 

CGGKGICQNT PGSFSCECQR GFSLDATGLN CEDVDECDGN HRCQHGCQNI LGGYRCGCPQ 2640 

GYIQHYQWNQ CVDENECSNP NACGSASCYN TLGSYKCACP SGFSFDQFSS ACHDVNECSS 2700 

SKNPCNYGCS NTEGGYLCGC PPGYYRVGQG HCVSGMGFNK GQYLSLDTEV DEENALSPEA 2760 

CYECKINGYP KKDSRQKRSI HEPDPTAVEQ ISLESVDMDS PVNMKFNLSH LGSKEHILEL 2820 

RPAIQPLNNH IRYVISQGND DSVFRIHQRN GLSYLHTAKK KLMPGTYTLE ITSIPLYKKK 28B0 
ELKKLEESNE DDYLLGELGE ALRMRLQIQL Y 

Seq ID NO: 460 DNA sequence 

Nucleic Acid Accession 8: NM__013372 .1 

Coding sequence: 63.. 617 

1 11 21 31 41 51 

I I I I I I 

GCGGCCGCAC TCAGCGCCAC GCGTCGAAAG CGCAGGCCCC GAGGACCCGC CGCACTGACA 60 

GTATGAGCCG CACAGCCTAC ACGGTGGGAG CCCTGCTTCT CCTCTTGGGG ACCCTGCTGC 120 

CGGCTGCTGA AGGGAAAAAG AAAGGGTCCC AAGGTGCCAT CCCCCCGCCA GACAAGGCCC 180 

AGCACAATGA CTCAGAGCAG ACTCAGTCGC CCCAGCAGCC TGGCTCCAGG AACCGGGGGC . 240 

GGGGCCAAGG GCGGGGCACT GCCATGCCCG GGGAGGAGGT GCTGGAGTCC AGCCAAGAGG 300 

CCCTGCATGT GACGGAGCGC AAATACCTGA AGCGAGACTG GTGCAAAACC CAGCCGCTTA 360 

AGCAGACCAT CCACGAGGAA GGCTGCAACA GTCGCACCAT CATCAACCGC TTCTGTTACG 420 

GCCAGTGCAA CTCTTTCTAC ATCCCCAGGC ACATCCGGAA GGAGGAAGGT TCCTTTCAGT 480 

CCTGCTCCTT CTGCAAGCCC AAGAAATTCA CTACCATGAT GGTCACACTC AACTGCCCTG 540 

AACTACAGCC ACCTACCAAG AAGAAGAGAG TCACACGTGT GAAGCAGTGT CGTTGCATAT 600 

CCATCGATTT GGATTAAGCC AAATCCAGGT GCACCCAGCA TGTCCTAGGA ATGCAGCCCC 660 

AGGAAGTCCC AGACCTAAAA CAACCAGATT CTTACTTGGC TTAAACCTAG AGGCCAGAAG 720 

AACCCCCAGC TGCCTCCTGG CAGGAGCCTG CTTGTGCGTA GTTCGTGTGC ATGAGTGTGG 780 

ATGGGTGCCT GTGGGTGTTT TTAGACACCA GAGAAAACAC AGTCTCTGCT AGAGAGCACT 840 

CCCTATTTTG TAAACATATC TGCTTTAATG GGGATGTACC AGAAACCCAC CTCACCCCGG 900 
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CTCACATCTA AAGGGGCGGG GCCGTGGTCT GGTTCT<3ACT TTGTGTTTTT GTGCCCTCCT 960 

GGGGACCAGA ATCTCCTTTC GGAATGAATG TTCATGGAAG AGGCTCCTCT GAGGGCAAGA 1020 

GACCTGTTTT AGTGCTGCAT TCGACATGGA AAAGTCCTTT TAACCTGTGC TTGCATCCTC 1080 

CTTTCCTCCT CCTCCTCACA ATCCATCTCT TCTTAAGTTG ATAGTGACTA TGTCAGTCTA 1140 

ATCTCTTGTT TGCCAAGGTT CCTAAATTAA TTCACTTAAC CATGATGCAA ATGTTTTTCA 1200 

TTTTGTGAAG ACCCTCCAGA CTCTGGGAGA GGCTGGTGTG GGCAAGGACA AGCAGGATAG 1260 

TGGAGTGAGA AAGGGAGGGT GGAGGGTGAG GCCAAATCAG GTCCAGCAAA AGTCAGTAGG 1320 

GACATTGCAG AAGCTTGAAA GGCCAATACC AGAACACAGG CTGATGCTTC TGAGAAAGTC 1380 

TTTTCCTAGT ATTTAACAGA ACCCAAGTGA ACAGAGGAGA AATGAGATTG CCAGAAAGTG 1440 

ATTAACTTTG GCCGTTGCAA TCTGCTCAAA CCTAACACCA AACTGAAAAC ATAAATACTG 1500 

ACCACTCCTA TGTTCGGACC CAAGCAAGTT AGCTAAACCA AACCAACTCC TCTGCTTTGT 1560 

CCCTCAGGTG GAAAAGAGAG GTAGTTTAGA ACTCTCTGCA TAGGGGTGGG AATTAATCAA 1620 

AAACCKCAGA GGCTGAAATT CCTAATACCT TTCCTTTATC GTGGTTATAG TCAGCTCATT 1680 

TCCATTCCAC TATTTCCCAT AATGCTTCTG AGAGCCACTA ACTTGATTGA TAAAGATCCT 1740 

GCCTCTGCTG AGTGTACCTG ACAGTAAGTC TAAAGATGAR AGAGTTTAGG GACTACTCTG 1800 

TTTTAGCAAG ARATATTKTG GGGGT CTTTT TGTTTTAACT ATTGTCAGGA GATTGGGCTA 1860 

RAGAGAAGAC GACGAGAGTA AGGAAATAAA GGGRATTGCC TCTGGCTAGA GAGTAAGTTA 1920 

GGTGTTAATA CCTGGTAGAA ATGTAAGGGA TATGACCTCC CTTTCTTTAT GTGCTCACTG 1980 

AGGATCTGAG GGGACCCTGT TAGGAGAGCA TAGCATCATG ATGTATTAGC TGTTCATCTG 2040 

CTACTGGTTG GATGGACATA ACTATTGTAA CTATTCAGTA TTTACTGGTA GGCACTGTCC 2100 

TCTGATTAAA CTTGGCCTAC TGGCAATGGC TACTTAGGAT TGATCTAAGG GCCAAAGTGC 2160 

AGGGTGGGTG AACTTTATTG TACTTTGGAT TTGGTTAACC TGTTTTCTTC AAGCCTGAGG 2220 

TTTTATATAC AAACTCCCTG AATACTCTTT TTGCCTTGTA TCTTCTCAGC CTCCTAGCCA 2280 

AGTCCTATGT AATATGGAAA ACAAACACTG CAGACTTGAG ATTCAGTTGC CGATCAAGGC 2340 

TCTGGCATTC AGAGAACCCT TGCAACTCGA GAAGCTGTTT TTATTTCGTT TTTGTTTTGA 2400 

TCCAGTGCTC TCCCATCTAA CAACTAAACA GGAGCCATTT CAAGGCGGGA GATATTTTAA 2460 

ACACCCAAAA TGTTGGGTCT GATTTTCAAA CTTTTAAACT CACTACTGAT GATTCTCACG 2520 

CTAGGCGAAT TTGTCCAAAC ACATAGTGTG TGTGTTTTGT ATACACTGTA TGACCCCACC 2580 

CCAAATCTTT GTATTGTCCA CATTCTCCAA CAATAAAGCA CAGAGTGGAT TTAATTAAGC 2640 

ACACAAATGC TAAGGCAGAA TTTTGAGGGT GGGAGAGAAG AAAAGGGAAA GAAGCTGAAA 2700 

ATGTAAAACC ACACCAGGGA GGAAAAATGA CATTCAGAAC CAGCAAACAC TGAATTTCTC 2760 

TTGTTGTTTT AACTCTGCCA CAAGAATGCA ATTTCGTTAA TGGAGATGAC TTAAGTTGGC 2820 

AGCAGTAATC TTCTTTTAGG AGCTTGTACC ACAGTCTTGC ACATAAGTGC AGATTTGGCT 2880 

CAAGTAAAGA GAATTTCCTC AACACTAACT TCACTGGGAT AATCAGCAGC GTAACTACCC 2940 

TAAAAGCATA TCACTAGCCA AAGAGGGAAA TATCTGTTCT TCTTACTGTG CCTATATTAA 3000 

GACTAGTACA AATGTGGTGT GTCTTCCAAC TTTCATTGAA AATGCCATAT CTATACCATA 3060 

TTTTATTCGA GTCACTGATG ATGTAATGAT ATATTTTTTC ATTATTATAG TAGAATATTT 3120 

TTATGGCAAG ATATTTGTGG TCTTGATCAT ACCTATTAAA ATAATGCCAA ACACCAAATA 3180 

TGAATTTTAT GATGTACACT TTGTGCTTGG CATTAAAAGA AAAAAACACA CATCCTGGAA 3240 

GTCTGTAAGT TGTTTTTTGT TACTGTAGGT CTTCAAAGTT AAGAGTGTAA GTGAAAAATC 3300 

TGGAGGAGAG GATAATTTCC ACTGTGTGGA ATGTGAATAG TTAAATGAAA AGTTATGGTT 3360 

ATTTAATGTA ATTATTACTT CAAATCCTTT GGTCACTGTG ATTTCAAGCA TGTTTTCTTT 3420 

TTCTCCTTTA TATGACTTTC TCTGAGTTGG GCAAAGAAGA AGCTGACACA CCGTATGTTG 3480 

TTAGAGTCTT TTATCTGGTC AGGGGAAACA AAATCTTGAC CCAGCTGAAC ATGTCTTCCT 3540 

GAGTCAGTGC CTGAATCTTT ATTTTTTAAA TTGAATGTTC CTTAAAGGTT AACATTTCTA 3600 

AAGCAATATT AAGAAAGACT TTAAATGTTA TTTTGGAAGA CTTACGATGC ATGTATACAA 3660 

ACGAATAGCA GATAATGATG ACTAGTTCAC ACATAAAGTC CTTTTAAGGA GAAAATCTAA 3720 

AATGAAAAGT GGATAAACAG AACATTTATA AGTGATCAGT TAATGCCTAA GAGTGAAAGT 3780 

AGTTCTATTG ACATTCCTCA AGATATTTAA TATCAACTGC ATTATGTATT ATGTCTGCTT 3840 

AAATCATTTA AAAACGGCAA AGAATTATAT AGACTATGAG GTACCTTGCT GTGTAGGAGG 3900 

ATGAAAGGGG AGTTGATAGT CTCATAAAAC TAATTTGGCT TCAAGTTTCA TGAATCTGTA 3960 

ACTAGAATTT AATTTTCACC CCAATAATGT TCTATATAGC CTTTGCTAAA GAGCAACTAA 4020 
TAAATTAAAC CTATTCTTTC AAAAAAAAA 

Seq ID NO: 461 Protein sequence 
Protein Accession #: NP_037504.1 

1 11 21 31 41 

I I II I. 

MSRTAYTVGA LLLLLGTLLP AAEGKKKGSQ GAIPPPDKAQ HNDSEQTQSP 
GQGRGTAMPG EEVLESSQEA LHVTERKYLK RDWCKTQPLK QTIHEEGCNS 
QCNSFYIPRH IRKEEGSFQS CSFCKPKKPT TMMVTLNCPE LQPPTKKKRV 
IDLD 

Seq ID NO: 462 DNA sequence 
Nucleic Acid Accession ft: Eos sequence 
Coding sequence: 1..2733 

1 11 21 31 41 51 

'1)1)11 

ATGAAAGTTG GAGTGCTGTG GCTCATTTCT TTCTTCACCT TCACTGACGG CCACGGTGGC 60 

TTCCTGGGGA AAAATGATGG CATCAAAACA AAAAAAGAAC TCATTGTGAA TAAGAAAAAA 120 

CATCTAGGCC CAGTCGAAGA ATATCAGCTG CTGCTTCAGG TGACCTATAG AGATTCCAAG 180 

GAGAAAAGAG ATTTGAGAAA TTTTCTGAAG CTCTTGAAGC CTCCATTATT ATGGTCACAT 240 

GGGCTAATTA GAATTATCAG AGCAAAGGCT ACCACAGACT GCAACAGCCT GAATGGAGTC 300 

CTGCAGTGTA CCTGTGAAGA CAGCTACACC TGGTTTCCTC CCTCATGCCT TGATCCCCAG 360 

AACTGCTACC TTCACACGGC TGGAGCACTC CCAAGCTGTG AATGTCATCT CAACAACCTC 420 

AGCCAGAGTG TCAATTTCTG TGAGAGAACA AAGATTTGGG GCACTTTCAA AATTAATGAA 480 

AGGTTTACAA ATGACCTTTT GAATTCATCT TCTGCTATAT ACTCCAAATA TGCAAATGGA 540 

ATTGAAATTC AACTTAAAAA AGCATATGAA AGAATTCAAG GTTTTGAGTC GGTTCAGGTC 600 

ACCCAATTTC GAAATGGAAG CATCGTTGCT GGGTATGAAG TTGTTGGCTC CAGCAGTGCA 660 

TCTGAACTGC TGTCAGCCAT TGAACATGTT GCCGAGAAGG CTAAGACAGC CCTTCACAAG 720 

CTGTTTCCAT TAGAAGACGG CTCTTTCAGA GTGTTCGGAA AAGCCCAGTG TAATGACATT 780 

GTCTTTGGAT TTGGGTCCAA GGATGATGAA TATACCCTGC CCTGCAGCAG TGGCTACAGG 840 

GGAAACATCA CAGCCAAGTG TGAGTCCTCT GGGTGGCAGG TCATCAGGGA GACTTGTGTG 900 

CTCTCTCTGC TTGAAGAACT GAACAAGAAT TTCAGTATGA TTGTAGGCAA TGCCACTGAG 960 

GCAGCTGTGT CATCCTTCGT GCAAAATCTT TCTGTCATCA TTCGGCAAAA CCCATCAACC 1020 



51 
I 

QQPGSRNRGR 60 
RTIINRFCYG 120 
TRVKQCRCIS 180 
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ACAGTGGGGA ATCTGGCTTC GGTGGTGTCG ATTCTGAGCA ATATTTCATC TCTGTCACTG 1080 

GCCAGCCATT TCAGGGTGTC CAATTCAACA ATGGAGGATG TCATCAGTAT AGCTGACAAT 1140 

ATCCTTAATT CAGCCTCAGT AACCAACTGG ACAGTCTTAC TGCGGGAAGA AAAGTATGCC 1200 

AGCTCACGGT TACTAGAGAC ATTAGAAAAC ATCAGCACTC TGGTGCCTCC GACAGCTCTT 1260 

CCTCTGAATT TTTCTCGGAA ATTCATTGAC TGGAAAGGGA TTCCAGTGAA CAAAAGCCAA 1320 

CTCAAAAGGG GTTACAGCTA TCAGATTAAA ATGTGTCCCC AAAATACATC TATTCCCATC 1380 

AGAGGCCGTG TGTTAATTGG GTCAGACCAA TTCCAGAGAT CCCTTCCAGA AACTATTATC 1440 

AGCATGGCCT CGTTGACTCT GGGGAACATT CTACCCGTTT CCAAAAATGG AAATGCTCAG 1500 

GTCAATGGAC CTGTGATATC CACGGTTATT CAAAACTATT CCATAAATGA AGTTTTCCTA 1560 

TTTTTTTCCA AGATAGAGTC AAACCTGAGC CAGCCTCATT GTGTGTTTTG GGATTTCAGT 1620 

CATTTGCAGT GGAACGATGC AGGCTGCCAC CTAGTGAATG AAACTCAAGA CATCGTGACG 1680 

TGCCAATGTA CTCACTTGAC CTCCTTCTCC ATATTGATGT CACCTTTTGT CCCCTCTACA 1740 

ATCTTCCCCG TTGTAAAATG GATCACCTAT GTGGGACTGG GTATCTCCAT TGGAAGTCTC 1800 

ATTTTATGCC TGATCATCGA GGCTTTGTTT TGGAAGCAGA TTAAAAAAAG CCAAACCTCT 1860 

CACACACGTC GTATTTGCAT GGTGAACATA GCCCTGTCCC TCTTGATTGC TGATGTCTGG 1920 

TTTATTGTTG GTGCCACAGT GGACACCACG GTGAACCCTT CTGGAGTCTG CACAGCTGCT 1980 

GTGTTCTTTA CACACTTCTT CTACCTCTCT TTGTTCTTCT GGATGCTCAT GCTTGGCATC 2040 

CTGCTGGCTT ACCGGATCAT CCTCGTGTTC CATCACATGG CCCAGCATTT GATGATGGCT 2100 

GTTGGATTTT GCCTGGGTTA TGGGTGCCCT CTCATTATAT CTGTCATTAC CATTGCTGTC 2160 

ACGCAACCTA GCAATACCTA CAAAAGGAAA GATGTGTGTT GGCTTAACTG GTCCAATGGA 2220 

AGCAAACCAC TCCTGGCTTT TGTTGTCCCT GCACTGGCTA TTGTGGCTGT GAACTTCGTT 2280 

GTGGTGCTGC TAGTTCTCAC AAAGCTCTGG AGGCCGACTG TTGGGGAAAG ACTGAGTCGG 2340 

GATGACAAGG CCACCATCAT CCGCGTGGGG AAGAGCCTCC TCATTCTGAC CCCTCTGCTA 2400 

GGGCTCACCT GGGGCTTTGG AATAGGAACA ATAGTGGACA GCCAGAATCT GGCTTGGCAT 2460 

GTTATTTTTG CTTTACTCAA TGCATTCCAG GGATTTTTTA TCTTATGCTT TGGAATACTC 2520 

TTGGACAGTA AGCTGCGACA ACTTCTGTTC AACAAGTTGT CTGCCTTAAG TTCTTGGAAG 2580 

CAAACAGAAA AGCAAAACTC ATCAGATTTA TCTGCCAAAC CCAAATTCTC AAAGCCTTTC 2640 

AACCCACTGC AAAACAAAGG CCATTATGCA TTTTCTCATA CTGGAGATTC CTCCGACAAC 2700 
ATCATGCTAA CTCAGTTTGT CTCAAATGAA TAA 

Seq ID NO: 463 Protein sequence 
Protein Accession #: Bos sequence 

1 11 21 31 41 51 

I I I I I I 

MKVGVLWLIS FFTFTDGHGG FLGKNDGIKT KKELIVNKKK HLGPVEEYQL LLQVTYRDSK 60 

EKRDLRNFLK LLKPPLLWSH GLIRIIRAKA TTDCNSLNGV LQCTCEDSYT WFPPSCLDPQ 120 

NCYLHTAGAL PSCECHLNNL SQSVNFCERT KIWGTFKINE RFTNDLLNSS SAIYSKYANG 180 

IEIQLKKAYE RIQGFESVQV TQFRNGSIVA GYEWGSSSA SELLSAIEHV AEKAKTALHK 240 

LFPLEDGSFR VFGKAQCNDI VFGFGSKDDE YTLPCSSGYR GNITAKCESS GWQVIRETCV 300 

LSLLEELNKN FSMIVGNATE AAVSSFVQNL SVIIRQNPST TVGNIASWS ILSNISSLSL 360 

ASHFRVSNST MEDVISIADN ILNSASVTNW TVLLREEKYA SSRLLETLEN ISTLVPPTAL 420 

PLNFSRKFID WKGIPVNKSQ LKRGYSYQIK MCPQNTSIPI RGRVLIGSDQ FQRSLPETII 480 

SMASLTLGNI LPVSKNGNAQ VNGPVISTVI QNYSINEVFL FFSKIESNLS QPHCVFWDFS 540 

HLQWNDAGCH LVNETQDIVT CQCTHLTSFS ILMSPFVPST IFPWKWITY VGLGISIGSL 600 

ILCLI IEALF WKQIKKSQTS HTRRICMVNI ALSLLIADVW FIVGATVDTT VNPSGVCTAA 660 

VFFTHFFYLS LFFWMLMLGI LLAYRIILVF HHMAQHIiMMA VGFCLGYGCP LIISVITIAV 720 

TQPSNTYKRK DVCWLNWSNG SKPLLAFWP ALAIVAVNFV WLLVLTKLW RPTVGERLSR 780 

DDKATIIRVG KSLLILTPLL GLTWGFGIGT IVDSQNLAWH VIFALLNAFQ GFFILCFGIL 840 

LDSKLRQLLF NKLSALSSWK QTEKQNSSDL SAKPKFSKPF NPLQNKGHYA FSHTGDSSDN 900 
IMLTQFVSNE 

Seq ID NO: 464 DNA sequence 

Nucleic Acid Accession #: AB035089.1 

Coding sequence: 9845.. 10219 

1 11 21 31 41 51 

I I 1 I I I 

GGGCATGCAG CCATCGGGGA AAATCCATAG TGCAGATAAA GCAAGGAGGA AGAAGAAGGA 60 

CAGTTCTAGT AAAAGGGAGA ACATCAATAT AGGATGTTTC TTAGCAATAG AAAAAGAAGG 120 

CCAAGAGGAA TTAGGGAGAG AGTTATAAGA GATCAGCAAG GGGACAGGGT TAGATTTGGT 180 

TTGGTTTGAA AGCATACAGT AAATATGATG TCTGTCCCTG GCAGTGTTGG CAGAGTAGGA 240 

AGGAGGAAGG GAGGCAAGAG ATAATATCAT TTTCTCTGTG CTCCAACTGT ACTTACATAT 300 

GAGACTATTT CCCTCTCTGC TTTTCAAACC TTACTGGAGT TGTTTTCCCT CATGAAAACC 360 

AAGAAAGGAA AGCTAGTTAG TCTTGTTCTG AGGTTGTTCA ATGTATACAT ATCTATATCT 420 

GTAGACAGAA TCCTTGGGAA TACAGTAATT GACATATATT CTGTTATTTG ATGCTTGAAA 480 

AATCTCCTCC ACTAACCAGT TTCCCTATAG ATTGCCACAA GCACATAATA AGAAACAATA 540 

AATAAAATGT TCTCTTGACT TTGTTACTTA ACAATGCTGA GAAAACTTTA CAGCCTTCAT 600 

AAGGAAGTGA GGTCCAGGAA AATCTAGGAG ATATTTCTTA ACCAATCTAT AAAGGCATTA 660 

GTAATGACAG GATATTTCCT GAAAGTGTAA TTTCCCATTG AGGATTTGTT TTTAATTTCT 720 

GGATTCCTGG AGCCAATGAA GTTGGTGTAT GTTTATGAAA TATCAAGAGA CATAAGTTGG 780 

CAAGTGTTCA TATGCAAAAA CTTCTTGGAA TTTCTGAGTT CTCTGTGGCA ATATATGACA 840 

TCAGGATATG TCCAGTCTCA CACACCAGGA TATGTCCTTT CTAGCCTGTC TATCACATGC 900 

TAGGAGAACT ATTTAGGAAC AGAAAAAAAT GCCTGAAATG ATTTCTCATT TGAACTCATC 960 

CAAGCTTTCT CTAAATTTAA GCAAACTCCT GGTCATTTTC AGTTAGTACC TTTCCTTAAG 1020 

TTCAACCTTC AGGGCAAACC TCCGTGCCTC AGACGTTTAG CCATAGTCTG AAATTCTCTT 1080 

CCATAGATTG GTCCCCTGTA ACCCCGGTTT GTCTCAGCTT GTTATCCTGT TTTTTTCTTC 1140 

CCTCCATTCC CAGGATGAGC TTGTTGCTTC TGTCCTATGA GACATTAGAT TCCTTTTCTT 1200 

TGGTACCCGA GTAAATCCAT CCTACTCCAA TAGAGGAAGG TCCATTTTTG TCTTATAGCG 1260 

CTGGATGCAG ACTCAGCTGA GAAGACCATT ATTCATTTTT GGAATTCTTT ATCTCAGATA 1320 

TTTCCTCTTC TTTCTTTTTC TTCTATCTTT GGATTTTTAG TCCATCAACG CCCCATTAGT 1380 

CTATTCCCCG ACTTCAATCA GGGAACTTAT ACCTCTTAAA CTCATTCAGA GACTCAAAAC 1440 

ATATATATTG ATACAGGAGA CCTAAGAAGA GCATGTCTTG GGGGTTGAGG AAA CAGG CAG 1500 

GTGAGAAATT TCCAGATTGG AAACACAGCT TCCTTTCTCC CATCCAGCCC CTACTTTCAG 1560 

CCTATGTGTT TCTGGCACCT TGTTGTAGAT AAATCTCCCT TGACTTTGTG ATGTGCTGAG 1620 

AAAACAAACT CACGGCTGGT GTTAAAAAGG GCCCATGACA ATACCAAGTG TTGGGGAGAA 1680 

TGTGGAGAAA TCAGAACTCT ATTCACGGTC GGTTGGAATG CACACTTGTG CAGAATTCTA 1740 
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TGGAGAAGAG TCTGGCATTT CCTCAAAATG 
TTTCATTCAT AGGTTTATAC TCAAAAGAAA 
ATGAAAGGTC ACAACATCAT TATTCATAAT 
ATCAACTTAT GATTAAAGAA AATCTGGTCT 
AAAAGGAATG ATGTACTGAT CCATGCAATG 
TTAAAGAAGC CAGTCACAAA AGGACTTACT 
ATAGGCAAAT CCATAGAAAC AGGAGGTAGA 
AGAATGAAGT ACAAGATTTC TTTTGGAGGT 
GATGATAGCA CAACTTTGTG AATATAATAA 
TATATAAATT ATATGTTAAT AAAAAGGGGG 
TGTCAGGGAG ATATTGGATT AAATGGCCTT 
ATTCTTCAGA TTACAAGATA TTCCAGGGGA 
AAACAGAAGG ACCATTGAGA AATGTTGTGA 
CTTCATTTTT AAATGTAAAA TTAGAAAGCT 
GCTCTTCTCA GTGTCAGCCT GTTAACTCAA 
AAACATACCT GAGACTGGCA AGAAAAAGAG 
GGGGAGGCCT CAGAATCACA GTAGGAGGCA 
AAGATGAGGA AGAAGCAAAA GAAGAAACCC 
ATTAACTATC ATGAGAATAG CACAAGAAAG 
TGGGTCCCTC CAATAACATG TGGAAATTCT 
GGGAACACAG CCAAACCATA TCACTCAGCA 
AACAGAAAAC CATCTGGGAT GGTTGTAAGG 
CAAAGCTGAG CACTCAGGAG AAGGCAATAG 
ACTGAAGTAC ACTTCTTCAC TATCTCTTTG 
TACAGAAAAA TTACTAAGGA AATTCATAGG 
GAAATGTAAG CTTTTTAGTT CTTTGGTATT 
CAAGAAAAGA ATGGTGGGGT TTTTGTTTGT 
AGAATACAAA GGGATGGAGT TGAAACAAAT 
CATTGGCATT AGAAAGGCAC CTACATGTAT 
TTCTTATTTT TTCCCTATAG ATTAAAAAGG 
TTTGTCATAA ATTTTCATAT TCATAAAGGT 
TGAGTAACTT CAAATACTAA CCACAGAGGG 
GGATCTCACC CTTCATTCCA CAGACACACA 
GGAACACAGG TAAGAGCTTC AAGCCTCTCC 
AATAATGATA CTGTGTTCTA TATCATGCAT 
TATTCTGCCA GAGCAAAATT AAAATACCTA 
CTTAGTTCCA AGTAAACCAA GGCACTTTTA 
GAGAGTCTTG AAGGAGATGT CAGGGACGCA 
AGGTCTCCTG TTAGCATTCA TTGTAAAGCC 
GAAAGAAAGA TAAAGAGGGT CGATTACTTA 
TAAGCCTTCT AATTAGGACA TTAATATATT 
CGTTAAAAAT AAGAGAAAAA CTTTAAATGT 
TTTAAGAAAA TTGTACTACA AAATACCATT 
TGATGCTTTT CCAGGAGTTC CAGATCACAT 
CAACACCAAG TTCATGTTCG ATCTGTTCCA 
CTTCTATTCC CCTATCAGCA TCACATCAGC 
CAACACTGCA CAACAAATTA GCAAGGTAGC 
TTTTTCTCTG GTTCCGTCGG CTAGCACGCA 
AGCACAGGGG GCTGTGCAGG AATTCCCATA 
TTGAGTAAAG TTTTCTTGTC CCGCTTCATG 
CACAGAGAAC ACCACAGAAA AAGCTGCAAC 
CAGCTTTAGA TCCCTGAACA GGTCATAGTT 
AGGCCAGTTT TAGGGAAAAT CTTGGACACA 
TTCATGGCAC ATAATTATTA TTCCTCATTT 
CCTCAGAGCA TAGGTCTGGA TCAGGATAGG 
AAATGATGAA TAAGAGCAGG ACACAACTGC 
CTAAGGGTAA GAAAAAATCT GACTCAATAC 
GCCTTGCCCA TAAAAGTCAT AATAAATGTT 
TAATCATAAT AATGTGAAAA TAATTTAATT 
ATAAGCACAA GTCCAAGTAT ATTTTGGAAA 
CCTTAATAGT GCAAAATGCT TTGCTGGAAG 
GTTCAAAACT TGGCACTTCT AATTTATGTC 
TGAGCTTTCT TGTGTTCATC TGAATTGAAC 
TTAGCCATGG ACCTGGCATA CACTCTTCTT 
AGCCACAGAT CAGTCAATGT GTCCTACAAG 
CCTGGCATAA TCTATTTAAA ATATCCAACC 
AGAAGTGAAA TATGGTCCTT GCCCATAAGG 
AACCCTTTAA ACCAACAAGG AGAAAATCTA 
AAGAGAAAAG ATTGCAGTAC GTTAGAGCAA 
GGTGGATTTT GAAGGGTATT TGAGGTGAAA 
GTCCTCAATG AGACTACCAG CATTTAGGGA 
ATTTACATTG ATACAGCAAT TGAATGATCT 
AGGAAATGTT CATCACCAGT TTCAAAAGCT 
ATATGAGCTG AAGATCGCCA ACAAGCTCTT 
AATTTCACCT GGCCTACCCA CATTTCATTT 
AAATGGAAGA AAGCAAGGCA GATGAGCCTG 
GTGCATTAGC TCCATTTCCA CAACTCTCCC 
ATCACTGAAG TGTGGATTTA GGGATAATCT 
TGAGTAAGAG TAATAAGTAA TAAGATACCA 
CGATACATCT TGGTGGGAAA TGTATGACTA 
GTGAGTTCCT GAGAATAGTT GAGGAAGTAC 
CAAAAACAAA GATCAGAAAC ATCATGGTTA 
GAATCTCCTT CAGGGAAGCC TGCTCTGCAG 
CTTTTGCCAA GATGATATTG ACCTTCAGTG 
TTTTGCATTG CCTACATGAC ACCTGTATAA 
ATTCAGGGTC TGGATTCAGC TTACTGTTGT 
TACATAAATT ACTCCTAATT CCTACTTCTT 



TTAACCTGGA TTTACCATAT GACCCAGCGA 1800 

TGAAGAAATA TGCCATGCAA AAAAATGTAC 1860 

AGTAAAAGGA TGGAAACAAC ACAAATGTCC 1920 

ATTCATAGAA TGGAATATTA TTCGACCACA 1980 

ATGTGGACAA ACCATGAAAA TAACACTAGA 2040 

GTATGATTCC ATTTACCTGA AATGTTTGGA 2100 

TTCCTGGTTT CCAGGGTCTC CAGGAAGGGA 2160 

AGTGAAATTG TTGTGGAATG AGATCATGAT 2220 

AATCATTGAA TTGTACAGTT GAATTTATGG 2280 

TCCACAAAAC AAACAGCCCC CCACTCTGGT 2340 

GGACAACAAC CCCTCTCCCT GGCCACAGAC 2400 

AACACTGGAA TGAGTCTGAA GCCAGGTGCT 2460 

TCCTGACAGG TCAAGCAATT TATTTTTCGG 2520 

GCCATTTAAA ATGGCCCGTC TGTTTCAATT 2580 

TGTGTTAGTC TGTTTTCATG CTGCTGATAA 2640 

GTTTAATTGG GCTTAGAGTT CCACGTGATT 2700 

AAAGTTATTC TTACATGGTG GCTGCAAGAG 2760 

CTGATAAACC CATCGGATCT CCTGAGGCTT 2820 

ACCGGCCCCC ATGATTCAAT TACCTCTACC 2880 

GGTAGATACA ATTCAAGTTG AGATTTGGGT 2940 

AGGCAGATAA CTTTCTCACT GAGCCTATGC 3000 

GGCACAGGAA GTGACTGGTA GGATCACTGC 3060 

AATCCTATTC TCCATAGTAT GCTATAAGAT 3120 

GACTTAGAAT TAGCACTACA TTCCTTGTTA 3180 

ATGACAAAAA CTTTCAGAAC TGAAAAACAG 3240 

CGAAGTATGC CTAAAAGACA ATGCAAAATC 3300 

TTGGTTTTGT TTTTGTTTTA CAGCTGGAGT 3360 

GAG AGGAAAT TGGAATTCTA AACTTATTCT 3420 

TTCACATGAG CCGGTGACTG CTGACTTGCA 3480 

AGGTACAATG GTAGAACTGT AATCCTGTCC 3540 

GAGTGTTAGC CCGCTTGTGA AATCTGAAGT 3600 

AAAGGCAGCA AGAGGAGAGG CATAAATTTA 3660 

CAGCCTCTCT GCCCACCTCT GCTTCCTCTA 3720 

AGCTTAATAA CATGAATTAT TTTTGAGAAT 3780 

CTCCTGCATT CTGTCTGATT ATATTTTACT 3840 

TTTCATCTGA TTTGTCCTTT ATCTAAATTG 3900 

GGAACACAGA GGGAGAGTGC CTTGCAGCCA 3960 

TCTTAACAGC TGGTTGGATG TGATCCACAG 4020 

ATCCTACCTA GCTCTAGTGT AACCAGCAAT 4080 

TTTACAATAG TCTTTAAAAA CGTAGTTTTG 4140 

TAATATATGC ACATTGTAGA AAGATTGAAG 4200 

CAAAATCTCA CAACCCAGAT ATATCATTTC 4260 

CCATTTATTA AAGTCATTCT GACAGGAATC 4320 

CGAGTTCACC ATGAATTCAC TCAGTGAAGC 4380 

ACAGTTCAGA AAATCAAAAG AGAACAACAT 4440 

ATTAGGGATG GTCCTCTTAG GAGCCAAAGA 4500 

TATCAGCATC ATTACGTTGT CCTGTTGCAG 4560 

GATGGTAATA GATGTGGTGG TCTGATGGGT 4620 

ACTGTGAGAC CACTGACTTA AACAGATCTT 4680 

TCTCTTCCAG GTTCTTCACT TTGATCAAGT 4740 

ATATCATGTG AGTCACAGAG CACTCTGATT 4800 

TAAACCTGGA ACTTCACAAA AACTAAGAAA 4860 

AAGATTGAGA CATACAGAGT GGGTTGGCAT 4920 

CTGCGTTACT AAAAGACAGT CAGCACTGTA 4980 

CTGGGTTCAG ACTCCAGCTT TGCTCTTCAC 5040 

TCGGAGTCCC AGTGACCTCA TCCCAGAAAA 5100 

ATGCAAATAC ATGCAAATGT TTACAACAGT 5160 

ATTATTATTA TAAAGTAGCT ATAATTATAC 5220 

TTCATTGAGT CATTAATGAG ATTCAGAGGA 5280 

ATGATTGCTA TGGAATATAT TGGTTTAGAG 5340 

GTAGAAAGTT CTAGATTTAA ACAGGCTTAG 5400 

TCTATAAACA GGGTTTTTTT CCCCATTCTC 5460 

TAAAGACTTA GAGTTACCCA TGTAAAGTCC 5520 

ACGTGCAGAG AATGACCATC ATGAGGAAAG 5580 

ATAATAGCAC CAACAGGTAT AACAGGGCTT 5640 

TTCAACATAC TCGTATCCTT GATGACTGTT 5700 

AGCTGAGAGT TTAACTGGGA AGCTAAACCT 5760 

CTGGTAGACA GCGCTGCATC TTTAGTTCAG 5820 

GAAGAATTTT CTGGAAGAAG TCAAATATAA 5880 

TACACCAATT ATCAGGGAAT AACATCAAAG 5940 

CTGATCTAAC AGACTTAGCA TGGGTTTAGT 6000 

CCTTTTTTGA TGTTTGAAGG TTGATAGGTC 6060 

TCTGACTGAA TTCAACAAAT CCACTGATGC 6120 

CGGAGAAAAG ACGTATCAAT TTTTACAGGT 6180 

GCATCCTGAT GTCTGTGTCT CTGAGTGGCC 6240 

GCCGACCCAG GTGGAGAGCA TTTACTCAGA 6300 

CCACTGGAGT GTCCCAGACC CCAACGATAC 6360 

TGTGATAAAA GAGGAGGTTG TGTAATAGAG 6420 

TCGATAAACT GGCACTGACT CAGTCACATA 6480 

ATGGGATATT ATTGGAATGG GCAGGCTTGG 6540 

CAGGAAATAT TGAATGCACA GGATGAAAGA 6600 

AAATTACTGG AGAGAAGTCT GAGAAGCAAT 6660 

TTTGCAAACC ACAGCCTCTT CTGCTTCTGC 6720 

ACCTCTTTCT TGTGCCAGCC CACATTCCCC 6780 

AAATATCCAT GGACAGGAGA TACTGCATCT 6840 

TACAAATAAG TAAGTTTGGT AATATATAGT 6900 

CCTTCATATC TCAAAGGAAT ATTTAGATGC 6960 
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CATCAAGAAA TTTTACCAGA CCAGTGTGGA ATCTACTGAT TTTGCAAATG CTCCAGAAGA 7020 
AAGTCGAAAG AAGATTAACT CCTGGGTGGA AAGTCAAACG AATGGTAGGA GAGCCACCCA 7080 
TTATAGAAAC ACCTTTGAGA AACCTATGCC AGTGAGCCTT GTGCTTGACA CTGCATGGGG 7140 
GAACAGGTGT GGGGATTGAG ATGGGTTTGC AGGGAGGGCT GAAGAGGGCA CTCCAGATGA 7200 
AGGATTTGTC CAAATGAATA TGAAGAGAGC CTAGGGGAGC CAAGGAGGAA ATCACAGGAA 7260 
GCCAATTAGA TGGAAACACA TCTGGAGAAT TATTTGCTTA TGGCCCTGCA TGACAATAGC 7320 
TTTGTGGATC CCCTGTCTCC GCTCAGACCT ATTTTGAGAT CATATCCTTT ACTTTAAATC 7380 
AGACTCAAAT TTTTATGATG AATATTTAAT AGAAAACATT AGAAAGCGTC TCTCGTCTCC 7440 
TTTACTAATT GGGAAACAAG CAGCTCTCTG GTAAATCACC CTTTTGTCTC TGAGCTGGAG 7S00 
CTGCCTGGAT CACATCTGTA GCCAATGTGT TCTGCAGGGA TTATCACAGC TCTCTTCCCC 7560 
ATCAAGGGCA AAGAGCTTGA CAAAGTCTCC ATTCTACAGA CATCTTTCTT ACCTCCCACC 7620 
TCTCATTACA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAGGAA GATACCCCCG 7680 
GAAGTAGTGT CTGACAGCAC AGGACATGCG TTTCATATTA CAGAGCTCAA GTCACTCATC 7740 
CTAAAATGCA ATCAGGGCCT CCTTCCTCTG. AATGGGGACC CCGTAGTTAA AAAAAAATAA 7800 
AAGTAGGAAG AGGAGGGAGG GAGAAAGGAA AGACACATGT TGGAAGAGTA GACAAAATCA 7860 
GTTTATCAGT ATTCCAAATC AGATGATTGG AGACATTCAT ACACAGAGAA CGTGAACTCC 7920 
TTCTCTATCA CAAGAAGTGA TGTCTCCATC AAGGGTAACT TTATACGACT GGAGCCTTGA 7980 
AGAAAGCTGC ATCTGGTGAA CCACTGGTCA GTGAGTCTAA CAATTCAAAG ATCAAAGTCA 8040 
GTGAGTCTCA AGCAGGGATT TGGGTCAATA ATTAACGATC AGTCACGAAC ATTTGCAAAG 8100 
CATCTTCCAG ACAAGCCATT TGTAGCTTGT GTAAAAGACT CTTTTATTCT TTCCCTTGCA 8160 
GAAAAAATTA AAAACCTATT TCCTGATGGG ACTATTGGCA ATGATACGAC ACTGGTTCTT 8220 
GTGAACGCAA TCTATTTCAA AGGGCAGTGG GAGAATAAAT TTAAAAAAGA AAACACTAAA 8280 
GAGGAAAAAT TTTGGCCAAA CAAGGTATTG TCTATATTTT ATTTATATAG TGTAATATGT 8340 
TAATACATGG AATGTTAAAC ATTTCTGATG GAATGTAACA TGATAAGTAA AAAATAAAAA 8400 
TTGTTCATGT CTGTTATTTT GTTGTTTTAC TCTTATAACT TTATTTAGTT AGGAATACCT 8460 
GAAAAACTAT TGTTTCTAAC TCATGGAATT CCTGGGTTAT TTCTTAGAAG AAGAAGGATG 8520 
TGTTGCTATC TCAATAATAT TATCTTTTTT GTCTTGTGTT TCACGTGTTA TTTGTTGGAC 8580 
ACATTGATTT ATTGCAGAAT ACATACAAAT CTGTACAGAT GATGAGGCAA TACAATTCCT 8640 
TTAATTTTGC CTTGCTGGAG GATGTACAGG CCAAGGTCCT GGAAATACCA TACAAAGGCA 8700 
AAGATCTAAG CATGATTGTG CTGCTGCCAA ATGAAATCGA TGGTCTGCAG AAGGTAAGAA 8760 
CTTGCATCTA CAACTCTTCC TTCTACTGCC GGACATTTTT CCAAAGATAC CAAGTTTAAA 8820 
CAAGGTAAAA GCTTATGACC GAGTTGCCTC AAAATGATGA AAAATTCTAA ATGAGGAATG 8880 
ATGACTCACC TTCATATTAC AAATATTTGA GCATAGGGCC TGACACAAAC TGAAAGCTTA 8940 
GTTTTTGTTT GTTTGTTTGT TTTTATTATT ATTATTATAA TACTTTAAGC TTTAGGGTAC 9000 
ATGTGCACAA TGTGCAGGTT AGTTACATAT GTATACATGT GCCATGCTGG TGTGCTGCAC 9060 
CCATTAACTC ATCATTTAGC GTTAGGTATA TCTCCTAATG CTATCCCTCC CCCCTCCCCC 9120 
CACCCCACAA CAGTCCTCAG AGTGTGATGT TACCTTCCTG TGTCCAAGTG TTCTCATTGT 9180 
TCAATTCCCA TCTATGATTT AATTCCATCT ATGGCTTAGT TAATGATTAA TTTATTAGAG 9240 
TTACATGCAT TGGATATCAA TTTGATGATA TTATTATGCA GCAATTTAAA CTTGACTGGG 9300 
AGAAATATAT ACCAATGTGA GGAAAGTTTA CAAATAGGCC GAGTAGAAAA GGGAATACAA 9360 
ATTTAGGAAT TTAGGGAATT ACAATTTAAT AATTGCAATG TGTACTAAAT AATGTATACA 9420 
GAAAAATATG ATGAGCCTAT TAAAAATTGA CACATGTAGT AGGCTGTTGG CACAAGAAAT 9480 
AGTGATACAT ACAGTTCATT GTGTACAAAA TAATGTAATC ATATTTTACA TGTGTATCAT 9540 
ACAGTTGTAT ACATACATAT GTACACATAT ACATATACGT AAAAACATGA TTCTGTTTTT 9600 
ACATACATGT ATATACATAT ACACATATAA CCCAATGTAT TTATATATTC AGGACTCATA 9660 
TTTTACCTAT TAGAATAATA ATGTCTATTA AAGTGAACCT TCTGTATTTC ACATTTATTG 9720 
CCAAAATAAC GAATCTCCAC ATAGTCAATT CATTGTTAAG GTGTATTAGA GATCGACAGT 9780 
TAGTCATATC AGTTTCTTTT TTCCATTTGT ATAGCTTGAA GAGAAACTCA CTGCTGAGAA 9840 
ATTGATGGAA TGGACAAGTT TGCAGAATAT GAGAGAGACA TGTGTCGATT TACACTTACC 9900 
TCGGTTCAAA ATGGAAGAGA GCTATGACCT CAAGGACACG TTGAGAACCA TGGGAATGGT 9960 
GAATATCTTC AATGGGGATG CAGACCTCTC AGGCATGACC TGGAGCCACG GTCTCTCAGT 10020 
ATCTAAAGTC CTACACAAGG CCTTTGTGGA GGTCACTGAG GAGGGAGTGG AAGCTGCAGC 10080 
TGCCACCGCT GTAGTAGTAG TCGAATTATC ATCTCCTTCA ACTAATGAAG AGTTCTGTTG 10140 
TAATCACCCT TTCCTATTCT TCATAAGGCA AAATAAGACC AACAGCATCC TCTTCTATGG 10200 
CAGATTCTCA TCCCCATAGA TGCAATTAGT CTGTCACTCC ATTTAGAAAA TGTTCACCTA 10260 
GAGGTGTTCT GGTAAACTGA TTGCTGGCAA CAACAGATTC TCTTGGCTCA TATTTCTTTT 10320 
CTATCTCATC TTGATGATGA TAGTCATCAT CAAGAATTTA ATGATTAAAA TAGCATGCCT 10380 
TTCTCTCTTT CTCTTAATAA GCCCACATAT AAATGTACTT TTCCTTCCAG AAAAATTTCC 10440 
CTTGAGGAAA AATGTCCAAG ATAAGATGAA TCATTTAATA CCGTGTCTTC TAAATTTGAA 10500 
ATATAATTCT GTTTCTGACC TGTTTTAAAT GAACCAAACC AAATCATACT TTCTCTTCAA 10560 
ATTTAGCAAC CTAGAAACAC ACATTTCTTT GAATTTAGGT GATACCTAAA TCCTTCTTAT 10620 
GTTTCTAAAT TTTGTGATTC TATAAAACAC ATCATCAATA AAATAATGAC ATAAAATCAT 10680 
TTTTGCTTTA CCTGTTTTCT CTCTGGAAAG GGCAAGTGTC CAGTTACACA TAGGAAAGAT 10740 
AATTTAGAGA TATATTAATC ATATATAAAG GAAAATTAAA AACAGAGTAG TTCATGATGA 10800 
GCCTGGAGTA GAAGGCATAT CCCAGAACAG GAGGAGCCTT GTAAACCACA TAGGAACTTC 10860 
CTATTTTATG CTAAAGGGAT AAGAAACTCA TTACAGGCTT TGATGGTTGT TTGTCAAAGA 10920 
GGGGCATAAA ATTATCATAT CCACATCTAG AAAATACATC TCTGGCTACG CTGATATCAA 10980 
TGGATGCGAG GAAAGAACAG TGTGGTTACC ATATATAAAT TAGGAAATCA TTAGAGTATT 11040 
GGGAGTGGAA ATGGAGAGAA AGAAAGAGCC TGGGGGAATT ATTTAGGAAA TAATAGTTAC 11100 
AGAAAGACAT CTAAGTTGCT GACCTATCTG ACTGGATGGA TGGAAGAATA TCTTGTTTCT 11160 
GAGAGAAAAA AAGACTTTGG GTTTAAATTT GTACTTGATG AATTAAGGTA CTTTTAATAT 11220 
TCAAATGGAT TTGCCTGGCA GGCACTTGAA GATATTAGTC TAAATCTCAG AAACAGAATA 11280 
TGATCTGAAG CTCTAAATTT GTGATATTCA ATATAAATAC TTTAGAGTCA TTGGGATAAA 11340 
TATGGTAGTT GTAGCTAAAA GCAAAAATAA GATACTAGGG AGAAAGGATA AAGTTAGAAG 11400 
AAAGAAGAAT CTAGAATTGA CCTTGAAGTA TATCAGCATG TGTAAAGATC AGGAATTGAT 11460 
CATTTTTATT TTCCAGAAAG TAGCTTTTCT TAGGGTTCCA TATTTACTCC CATAGATTCT 11520 
TCCC 

Seq ID NO: 465 Protein sequence 
Protein Accession #: BAB2152S . 1 

1 11 21 31 41 51 

I I I I I I 

MNSLSEANTK FMFDLFQQFR KSKENNIFYS PISITSALGM VLLGAKDNTA QQISKVLHFD 60 
QVTENTTEKA ATYHVDRSGN VHHQFQKLLT EFNKSTDAYE LKIANKLFGE KTYQFLQEYL 120 
DAIKKFYQTS VESTDFANAP EESRKKINSW VESQTNEKIK NLPPDGTIGN DTTLVLVNAI 180 
YFKGQWENKF KKENTKEEKF WPNKNTYKSV QMMRQYNSFN FALLEDVQAK VLEIPYKGKD 240 
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LSMIVLLPNE IDGLQKLEEK LTAEKLMEWT SLQNMRETCV DLHLPRFKME ESYDLKDTLR 
TMGMVNIFNG DADLSGMTHS HGLSVSKVLH KAFVEVTEEG VEAAAATAW WELSSPSTN 
EEFCCNHPFL FFIRQNKTNS ILPYGRFSSP 

Seq ID HO: 466 dna sequence 

Nucleic Acid Accession 8: NM_001910.1 

Coding sequence: SO.. 1240 
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GGAGAGAAGA 
CCTTCTTTTG 
GCCCCTCAGG 
GTTCTGGAAA 
GAGTGCCAAG 
TGGCTCCCCA 
CCCCTCTGTG 
GTCCAGCACA 
GTCCGGGATC 
GTTTGGAGAA 
TCTGGGCCTG 
GATGGCTCAG 
AGGTGGTGCG 
CCTGAATTGG 
GGTGGGAGGC 
TTCCCTCATC 
CCCCGTGGAT 
CTTCACCATT 
CGTGGATGGA 
TGGGCCCCTC 
TGGGAATAAC 
CCTGCCTGTC 
GTTATTTTCC 
ACATGAGAAT 
ACTCCCACCA 
TGATTATGAA 
GCAGAGATCA 
CACACGGCCA 
GTACCTGGAT 
TAACATCCTT 
CTGTTTTTGT 
TTTGAAATGT 
TTGAAATCCC 
AGTTAGATCT 
CTTGTTGCAT 
CTGCTGGGTG 



11 

i 

AAGGAGGGGG 
CTGCTGGTGC 
AGGCATCCGT 
TCCCATAATT 
GAACCCCTCA 
CCACAGAACT 
TACTGCACTA 
TACAGCCAGC 
ATTGGAGCCG 
AGTGTCACAG 
GGATACCCCT 
AACCTGGTGG 
GGGAGCGAGC 
GTCCCAGTCA 
ACTGTTATGT 
ACTGGCCCTT 
GGAGAATATG 
AACGGAGTCC 
ATGCAGTTCT 
TGGATCCTGG 
CGTGTGGGAC 
TGACAGACCT 
AGAGAATGTA 
ACACACACAC 
CCGTCATGAT 
AATCAAAAAT 
TGGTATAATA 
GGCCTGTTTA 
CATTCTGAAG 
AAATATACAA 
AGTTGGATTG 
CTGTAAGTCT 
GAGGTGTCAT 
GGAGGTCTGA 
CCTGTCAGGA 
GGTTTGGAGT 



21 
I 

CAAGGGAGAA 
TCCTGGAGCT 
CCCTCAAGAA 
TGGACATGAT 
TCAACTACTT 
TCACTGTCAT 
GCCCAGCCTG 
CAGGTCAATC 
ACCAAGTCTC 
AGCCAGGCCA 
CCTTGGCTGT 
ACTTGCCGAT 
TGATTTTTGG 
CCAAGCAAGC 
TCTGCTCCGA 
CCGACAAGAT 
CTGTGGAGTG 
CCTATACCCT 
GCAGCAGTGG 
GGGATGTCTT 
TGGCCCCAGC 
TGAATATGTT 
GCTGTTTCCA 
ACACACATAT 
GGAGGAATTA 
TTTCACATTT 
AATCCCTTTG 
TCTACACTGC 
CAAATTCCGA 
TCGGAATTCA 
TTTGTATTAG 
CTTTCCATCT 
TTGACATGGT 
TTTTGTGGCA 
GGCAGATAAT 
TCTTGGCTTT 



31 
I 

GCTGCTGGTC 
GGGAGAGGCC 
GAAGCTGCGG 
CCAGTTCACC 
GGATATGGAA 
CTTCGACACT 
CAAGACGCAC 
TTTCTCCATT 
TGTGGAAGGA 
GACCTTTGTG 
GGGAGGAGTG 
GTTTTCTGTC 
AGGCTACGAC 
TTACTGGCAG 
GGGCTGCCAG 
TAAGCAGCTG 
TGCCAACCTT 
CAGCCCAACT 
CTTTCAAGGA 
CATTCGACAG 
AGTCCCCTAA 
AGGCTGGGGC 
GGGTTGCAAC 
ACACACACAC 
CGTTATACAT 
GATTATGAAA 
CAACTCCACT 
TGCCCACTCC 
GCATTACATC 
AGCATCTCCC 
GATTCAAGCA 
ACAGAGTTTA 
TCTCTGAACT 
AAAATACTTC 
GCTGGTGCCT 
AATCATTCAT 



41 
I 

GGACTCACAA 
CAAGGATCCC 
GCACGGAGCC 
GAGTCCTGCT 
TACTTCGGCA 
GGCTCCTCCA 
AGCAGGTTCC 
CAGTATGGAA 
CTAACCGTGG 
GATGCAGAGT 
ACTCCAGTAT 
TACATGAGCA 
CACTCCCATT 
ATTGCACTGG 
GCCATTGTGG 
CAAAACGCCA 
AACGTCATGC 
"GCCTACACCC 
CTTGACATCC 
TTTTACTCAG 
GGAGGGGCCT 
ATTCTTTACA 
TTGAATTAAG 
ACACTTCACA 
TCATATTTTG 
ATCTCCAAAC 
CAGCCCTGAC 
TCTCTCCAGC 
ATTTTGTCCA 
ATTGTCCCAC 
AGGCCCATAT 
GCACATTTGA 
TATCTTTCCT 
CTAGGTGGTG 
CTCTATTGGT 
TACAAAGTTC 



51 

I 

TGAAAACGCT 
TTCACAGGGT 
AGCTCTCTGA 
CAATGGACCA 
CTATCTCCAT 
ACCTCTGGGT 
AGCCTTCCCA 
CCGGGAGCTT 
TTGGCCAGCA 
TTGATGGAAT 
TTGACAACAT 
GTAACCCAGA 
TCTCTGGGAG 
ATAACATCCA 
ACACAGGGAC 
TTGGGGCAGC 
CGGATGTCAC 
TACTGGACTT 
ACCCTCCAGC 
TCTTTGACCG 
TGTGTCTGTG 
CCTACAAAAA 
ACCAAACAGA 
CATACACACC 
TATTGATTTT 
ATATGCACAA 
AACCCATCCA 
TCCACATGCT 
TAAATATTTC 
AAATGTTTGG 
ATTGCATTTA 
ACGTTGCTGG 
ATAAAATGGT 
CTGGGTACTT 
AATGTTAAGA 
AGCATTTT 



Seq ID NO; 467 Protein sequence 
Protein Accession #: NP_001901.1 



I 

MKTLLLLLLV 
SMDQSAKEPL 
QPSQSSTYSQ 
FDGILGLGYP 
FSGSLNWVPV 
IGAAPVDGEY 
HPPAGPLWIL 



11 
I 

LLELGEAQGS 
INYLDMEYFG 
PGQSFSIQYG 
SLAVGGVTPV 
TKQAYWQIAL 
AVECANLNVM 
GDVFIRQFYS 



21 
I 

LHRVPLRRHP 
TISIGSPPQN 
TGSLSGIIGA 
FDNMMAQNLV 
DNIQVGGTVM 
PDVTFTINGV 
VFDRGNNRVG 



Seq ID NO: 468 DNA sequence 

Nucleic Acid Accession #: NM_018058.1 

Coding sequence: 319.. 1575 



TACGCGCTGC 
GACGGCCGGG 
TACACCGACA 
GTCAACGTGG 
AGAAAGGGCT 
CCTGATGCCC 
CTCAGAGATG 
GTGGGCCCCA 
AACTTCCTTT 
GTGGACGACC 
AAAGTGGACA 
ACCCATGGGA 
GTCCGCACGG 
AACATTGCCT 
GGAGACCCCC 
ACAGGGGGTG. 
GGAGAGTCCA 
TGGCTGCGAG 
CTCTACACCA 
TGTGAGATGG 
GTGACGTGGC 



11 
I 

GGGACCGGCA 
AGGAGATCTA 
AGTTGTTCAA 
CCCGTGGTGT 
CTGGACGCTA 
TCATTGAAAT 
TGGCTGCTGA 
TCCTCAGCAG 
TCCACAACCG 
CCCACCAGCA 
TCGTCTATGG 
AGGTCCGCTT 
TCATCACCGC 
ACCGCAGCTC 
TCATCGAGGA 
TGGTGACCGA 
TGGCTCAGCC 
TGGTGCCACG 
AGAAGAGTGG 
AGCCCGTGGC 
CAGATGGCAA 



21 
I 

GGGGAACGCC 
CTTCCTCAAC 
GTTCCGCAAT 
GGCCAGCCTC 
CTCTATCTAC 
GGACCCTGAG 
GGCTGGGGTC 
CAGTGCCTCG 
GGGCGATGGC 
TGGGCGAGGT 
CAACTGGAAT 
CCGGGACATC 
CGACTTTGAC 
CTCAGCCAAC 
GCTCAATCCC 
CTTCGACGGA 
GCTGTCCGTC 
CACCCGGGTT 
GGCCCACCTG 
ACACTTTGGC 
GATGGTGAGC 



31 
I 

ATCGGGGTCA 
ACCAATAATG 
AACCGGTGGG 
TTTGCCGGAC 
ATTGCCAATT 
GCCAGTGACC 
AGCAAATATA 
GATATCTTCT 
ACCTTTGTGG 
GTCGCCCTGG 
GGCCCCCACC 
GCCTCACCCA 
AATGACCAGG 
CGCCTCTTCC 
GGCGACGCCT 
GACGGGATGC 
TTCCGGGGCA 
GGGGCCTTTG 
AGGATCATCG 
CTGGGGAAGG 
CGGAACGTGG 



41 
I 

CAGCCTGCGA 
CCTTCTCGGG 
AAGACATCCT 
GCTCTGTGGC 
ACGCCTACGG 
TCTCCCGGGG 
CAGGGGGCCG 
GCGACAATGA 
ACGCTGCGGC 
CTGACTTCAA 
GCCTCTATCT 
AGTTCTCCAT 
AGCTGGAGAT 
GCGTCATCCG 
TGGAGCCTGA 
TGGACCTCAT 
ATCAGGGCTT 
CCAGGGGAGC 
ACGGGGGCTC 
ATGAAGCCAG 
CCAGCGGGGA 



51 
I 

CATCGACGGG 
GGTGGCCACG 
GAGCGATGAG 
CTGTGTGGAC 
TAATGTGGGC 
CATTCTGGCG 
AGGCGTCAGC 
GAATGGGCCT 
CAGTGCTGGT 
CCGTGATGGC 
GCAAATGAGC 
GCCCTCCCCT 
CTTCTTCAAC 
TAGAGAGCAC 
GGGCCGGGGC 
CTTGTCCCAT 
CAACAACAAC 
TAAGGTCGTG 
AGGCTACCTG 
CAGTGTGGAG 
GATGAACTCA 



60 
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240 
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660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



31 


41 


51 




1 

SLKKKLRARS 


1 

QLSEFWKSHN 


1 

LDMIQFTESC 


60 


FTVIFDTGSS 


NLWVPSVYCT 


SPACKTHSRF 


120 


DQVSVEGLTV 


VGQQFGESVT 


EPGQTFVDAE 


180 


DLPMFSVYMS 


SNPEGGAGSE 


LI FGGYDHSH 


240 


FCSEGCQAIV 


DTGTSLITGP 


SDKIKQLQNA 


300 


PYTLSPTAYT 


LLDFVDGMQF 


CSSGFQGLDI 


360 


LAPAVP 
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WO 02/086443 

GTGCTGGAGA TCCTCTACCC CCGGGATGAG GACACACTTC AGGACCCAGC CCCACTGGAG 
ACACCAATGA ATGCATCCAG TTCCCATTCG TGTGCCCTCG AGACAAGCCC GTATGTGTCA 
ACACCTATGG AAGCTACAGG TGCCGGACCA ACAAGAAGTG CAGTCGGGGC TACGAGCCCA 
ACGAGGATGG CACAGCCTGC GTGGGGACTC TCGGCCAGTC ACCGGGCCCC CGCCCCACCA 
CCCCCACCGC TGCTGCTGCC ACTGCCGCTG CTGCTGCCGC TGCTGGAGCT GCCACTGCTG 
CACCGGTCCT CGTAGATGGA GATCTCAATC TGGGGTCGGT GGTTAAGGAG AGCTGCGAGC 
CCAGCTGCTG AGCAGGGGTG GGACATGAAC CAGCGGATGG AGTCCAGCAG GGGAGTGGGA 
AAGTGGGCTT GTGCTGCTGC CTAGACAGTA GGGATGTAAA GGCCTGGGAG CTAGACCCTC 
CCCAAGCCCA TCCATGCACA TTACTTAGCT AACAATTAGG GAGACTCGTA AGGCCAGGCC 
CTGTGCTGGG CACATAGCTG TGATCACAGC AGACAGGGTC GCTGCCCTGA TGGCGCTTAC 
ATTCCAGTGG GTCTAATGAC CATATCTTAG GACACAGATG TGCCCAGGGA GGTGGTGTCA 
CTGCACAGGA AGTATGAGGA CTTTAGTGTC CTGAGTTCAA ATCCTGATTC AGGAACTCAC 
AAAGCTATGT GACCTTACAC CAGTCACTTA ACTTGTTAGC CATCCATTAT CGCATCTGCA 
AAATGGGGAT TAAGAATAGA ATCTTGGGGT TAGTGTGGAG ATTAGATTAA ATGTATGTAA 
GACACTTGGC ACAAAACCTG GCACATAGTA AAGGCTCAAT AAAAACAAGT GCCTCTCACT 
GGGCTTTGTC AACACGTG 

Seq ID NO: 469 Protein sequence 
Protein Accession #: NP_060528.1 



PCT/US02/12476 



MDPEASDLSR 
RGBGTFVDAA 
FRDIASPKFS 
ELNPGDALEP 
RTRVGAFARG 
KMVSRNVASG 
GAGPTRSAVG 



11 
I 

GILALRDVAA 
ASAGVDDPHQ 
MPSPVRTVIT 
EGRGTGGWT 
AKWLYTKKS 
EMNSVLEILY 
ATSPTRMAQP 



21 

I 

EAGVSKYTGG 
HGRGVALADP 
ADFDNDQELE 
DFDGDGMLDL 
GAHLRIIDGG 
PRDEDTLQDP 
AWGLSASHRA 



31 
I 

RGVSVGPILS 
NRDGKVDIVY 
IFFNNIAYRS 
IliSKGBSMAQ 
SGYLCEMEPV 
APLETPMNAS 
PAPPPPPLLL 



41 
I 

SSASDIFCDN 
GNWNGPHRLY 
SSANRLPRVI 
PLSVFRGNQG 
AHFGLGKDEA 
SSHSCALETS 
PLPLLLPLLE 



51 
I 

ENGPNPIiFHN 
LQMSTHGKVR 
RREHGDPLIE 
FNNNWLRWP 
SSVEVTWPDG 
PYVSTPMEAT 
LPLLHRSS 



Seq ID NO: 470 DNA sequence 
Nucleic Acid Accession #: AJ279016 
Coding sequence: 1..1962 



1 
i 

ATGTCCAGGA 
CAGCGGGCTG 
AGTAATCCCA 
TTTGAGATCG 
CAGAAGCGGC 
GACCGGCAGG 
GAGATCTACT 
TTGTTCAAGT 
CGTGGTGTGG 
GGACGCTACT 
ATTGAAATGG 
GCTGCTGAGG 
CTCAGCAGCA 
CACAACCGGG 
CACCAGCATG 
GTCTATGGCA 
GTCCGCTTCC 
ATCACCGCCG 
CGCAGCTCCT 
ATCGAGGAGC 
GTGACCGACT 
GCTCAGCCGC 
GTGCCACGCA 
AAGAGTGGGG 
CCCGTGGCAC 
GATGGCAAGA 
CTCTACCCCC 
TTCTCCCAGC 
GTGTGCCCTC 
AACAAGAAGT 
CTCGGCCAGT 
GCTGCTGCCG 
CTGGGGTCGG 
CCAGCGGATG 
AGGGATGTAA 
TAACAATTAG 
CAGACAGGGT 
GGACACAGAT 
CCTGAGTTCA 
AACTTGTTAG 
TTAGTGTGGA 
AAAGGCTCAA 



11 
I 

TGTTACCGTT 
AACCCATGTT 
CCCAGCTCAA 
TCGTGGCGGG 
TGGTGAACAT 
GGAACGCCAT 
TCCTCAACAC 
TCCGCAATAA 
CCAGCCTCTT 
CTATCTACAT 
ACCCTGAGGC 
CTGGGGTCAG 
GTGCCTCGGA 
GCGATGGCAC 
GGCGAGGTGT 
ACTGGAATGG 
GGGACATCGC 
ACTTTGACAA 
CAGCCAACCG 
TCAATCCCGG 
TCGACGGAGA 
TGTCCGTCTT 
CCCGGTTTGG 
CCCACCTGAG 
ACTTTGGCCT 
TGGTGAGCCG 
GGGATGAGGA 
AGGAAAATGG 
GAGACAAGCC 
GCAGTCGGGG 
CACCGGGCCC 
CTGCTGGAGC 
TGGTTAAGGA 
GAGTCCAGCA 
AGGCCTGGGA 
GGAGACTCGT 
CGCTGCCCTG 
GTGCCCAGGG 
AATCCTGATT 
CCATCCATTA 
GATTAGATTA 
TAAAAACAAG 



21 
I 

CCTGCTGCTG 
CACTGCAGTC 
CTATGGTGTG 
GTACAATGGA 
CGCGGTCGAT 
CGGGGTCACA 
CAATAATGCC 
CCGGTGGGAA 
TGCCGGACGC 
TGCCAATTAC 
CAGTGACCTC 
CAAATATACA 
TATCTTCTGC 
CTTTGTGGAC 
CGCCCTGGCT 
CCCCCACCGC 
CTCACCCAAG 
TGACCAGGAG 
CCTCTTCCGC 
CGACGCCTTG 
CGGGATGCTG 
CCGGGGCAAT 
GGCCTTTGCC 
GAT CAT CG AC 
GGGGAAGGAT 
GAACGTGGCC 
CACACTTCAG 
CCATTGCATG 
CGTATGTGTC 
CTACGAGCCC 
CCGCCCCACC 
TGCCACTGCT 
GAGCTGCGAG 
GGGGAGTGGG 
GCTAGACCCT 
AAGGCCAGGC 
ATGGCGCTTA 
AGGTGGTGTC 
CAGGAACTCA 
TCGCATCTGC 
AATGTATGTA 
TGCCTCTCAC 



31 
I 

CTCTGGTTTC 
ACCAACTCAG 
GCAGTTACTG 
CCCAACCTGG 
GAGCGCAGCT 
GCCTGCGACA 
TTCTCGGGGG 
GACATCCTGA 
TCTGTGGCCT 
GCCTACGGTA 
TCCCGGGGCA 
GGGGGCCGAG 
GACAATGAGA 
GCTGCGGCCA 
GACTTCAACC 
CTCTATCTGC 
TTCTCCATGC 
CTGGAGATCT 
GTCATCCGTA 
GAGCCTGAGG 
GACCTCATCT 
CAGGGCTTCA 
AGGGGAGCTA 
GGGGGCTCAG 
GAAGCCAGCA 
AGCGGGGAGA 
GACCCAGCCC 
GACACCAATG 
AACACCTATG 
AACGAGGATG 
ACCCCCACCG 
GCACCGGTCC 
CCCAGCTGCT 
AAAGTGGGCT 
CCCCAAGCCC 
CCTGTGCTGG 
CATTCCAGTG 
ACTGCACAGG 
CAAAGCTATG 
AAAATGGGGA 
AGACACTTGG 
TGGGCTTTGT 



41 

I • 

TGCCCATCAC 
TTCTGCCTCC 
ATGTGGACCA 
TTCTGAAGTA 
CACCCTACTA 
TCGACGGGGA 
TGGCCACGTA 
GCGATGAGGT 
GTGTGGACAG 
ATGTGGGCCC 
TTCTGGCGCT 
GCGTCAGCGT 
ATGGGCCTAA 
GTGCTGGTGT 
GTGATGGCAA 
AAATGAGCAC 
CCTCCCCTGT 
TCTTCAACAA 
GAGAGCACGG 
GCCGGGGCAC 
TGTCCCATGG 
ACAACAACTG 
AGGTCGTGCT 
GCTACCTGTG 
GTGTGGAGGT 
TGAACTCAGT 
CACTGGAGTG 
AATGCATCCA 
GAAGCTACAG 
GCACAGCCTG 
CTGCTGCTGC 
TCGTAGATGG 
GAGCAGGGGT 
TGTGCTGCTG 
ATCCATGCAC 
GCACATAGCT 
GGTCTAATGA 
AAGTATGAGG 
TGACCTTACA 
TTAAGAATAG 
CACAAAACCT 
CAACACG 



51 

i 

TGAGGGGTCC 
TGACTATGAC 
TGATGGGGAC 
TGACCGGGCC 
CGCGCTGCGG 
CGGCCGGGAG 
CACCGACAAG 
CAACGTGGCC 
AAAGGGCTCT 
TGATGCCCTC 
CAGAGATGTG 
GGGCCCCATC 
CTTCCTTTTC 
GGACGACCCC 
AGTGGACATC 
CCATGGGAAG 
CCGCACGGTC 
CATTGCCTAC 
AGACCCCCTC 
AGGGGGTGTG 
AGAGTCCATG 
GCTGCGAGTG 
CTACACCAAG 
TGAGATGGAG 
GACGTGGCCA 
GCTGGAGATC 
TGGCCAAGGA 
GTTCCCATTC 
GTGCCGGACC 
CGTGGGGACT 
CACTGCCGCT 
AGATCTCAAT 
GGGACATGAA 
CCTAGACAGT 
ATTACTTAGC 
GTGATCACAG 
CCATATCTTA 
ACTTTAGTGT 
CCAGTCACTT 
AATCTTGGGG 
GGCACATAGT 



Seq ID NO: 471 Protein sequence 
Protein Accession #: CAC08451 

1 11 21 31 41 51 

I I I I I I 

MSRMLPFLLL LWPLPITEGS QRAEPMFTAV TNSVLPPDYD SNPTQLNYGV AVTDVDHDGD 
FEIWAGYNG PNLVLKYDRA QKRLVNIAVD ERSSPYYALR DRQGNAIGVT ACDIDGDGRE 
EIYFLNTNNA FSGVATYTDK LFKFRNNRWE DILSDEVNVA RGVASLFAGR SVACVDRKGS 
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GRYSIYIANY AYGNVGPDAL IEMDPEASDL SRGILALRDV AAEAGVSKYT GGRGVSVGPI 240 

LSSSASDIFC DNENGPNFLF HNRGDGTFVD AAASAGVDDP HQHGRGVALA DFNRDGKVDI 300 

VYGNWNGPHR LYLQMSTHGK VRFRDIASPK FSMPSPVRTV ITADFDNDQE LEIFFNNIAY 360 

RSSSANRLFR VIRREHGDPL IEELNPGDAL EPEGRGTGGV VTDFDGDGML DLILSHGESM 420 

AQPLSVFRGN QGFNNNWI/RV VPRTRFGAFA RGAKWLYTK KSGAHLRIID GGSGYLCEME 480 

PVAHFGLGKD EASSVEVTWP DGKMVSRNVA SGEMNSVLBI LYPRDEDTLQ DPAPLECGQG 540 

FSQQENGHCM DTNECIQFPF VCPRDKPVCV NTYGSYRCRT NKKCSRGYEP NEDGTACVGT 600 
LGQSPGPRPT TPTAAAATAA AAAAAGAATA APVLVDGDLN LGSWKESCE PSC 

Seq ID NO: 472 DNA sequence 
Nucleic Acid Accession 8: FGENESHH 
Coding sequence: 1..4794 

1 11 21 31 41 51 

I I I I I I 

ATGGCGTGTC CGGGAGGACT CCCAGCCCGT TGCTCTGGTT GGATGGGACT GGGTGGGCCC 60 

AGCGGCTCCT CCCCAGCATC CCCTCCCCAT TCCTCCTCCA GGTACAATGG ACCCAACCTG 120 

GTTCTGAAGT ATGACCGGGC CCAGAAGCGG CTGGTGAACA TCGCGGTCGA TGAGCGCAGC 180 

TCACCCTACT ACGCGCTGCG GGACOGGCAG GGGAACGCCA TCGGGGTCAC AGCCTGCGAC 240 

ATCGACGGGG ACGGCCGGGA GGAGATCTAC TTCCTCAACA CCAATAATGC CTTCTCGGGC 300 

CACAGCAGCT CAGCGCAGGT CCCTTCTGGG CTCCACAGAA ACAGGCCTGT GCTGAAGOCT 360 

CCACCTACAA CCCCTGCAGG CCTCCTGGGT CTGCCTCCAC TCAGCGGAAG GGACTTTTCC 420 

TCCTCCCTGG GTCAGGCTTC TCCGGACAGC AGGCAGGGAG AGAGGGTGCC GGTTCCCTGC 480 

TGTCGGGGTG GACTGAGACC TACCCATGAA CCAGAACCAT TTCTTCTGAG ACCCAAATCA 540 

GGGGTGGCCA CGTACACCGA CAAGTTGTTC AAGTTCCGCA ATAACCGGTG GGAAGACATC 600 

CTGAGCGATG AGGTCAACGT GGCCCGTGGT GTGGCCAGCC TCTTTGCCGG ACGCTCTGTG 660 

GCCTGTGTGG ACAGAAAGGG CTCTGGACGC TACTCTATCT ACATTGCCAA TTACGCCTAC 720 

GGTAATGTGG GCCCTGATGC CCTCATTGAA ATGGACCCTG AGGCCAGTGA CCTCTCCCGG 780* 

GGCATTCTGG CGCTCAGAGA TGTGGCTGCT GAGGCTGGGG TCAGCAAATA TACAGAAGGC 840 

TTCTCCCACA CTGCCTCTCC AAGCATTGGT GAGATATCTG GCAGAACCGA GGAGCGGGAA 900 

GGAGGAGACC CAGAGGAGGC AGATGAGGAG CACAGTGGGG ATGGAAGCAC CAGCCAACTG 960 

TGCCGGCTGG GCTGGAAGGA CGGGCAGTTC AAGGAAGAAG CAGCAGCTTT GGTGGAGGAA 1020 

CAGAGGGAGG CTGGGGCAGC TGGCGTGCCC AGAGGACGTG TTCGAACAGC TCTGCAGACT 1080 

TCCAAAAGCC ATTTGGCTGA CAAGAACCTA TTTGGCCCAC CATGTTACTA TTCTGTCTGC 1140 

GCGCCTTCTC CAGCCCACCC TTTCCCTGCC CGCCAAGCCC CCCAACACTA CCCTGTAGCC 1200 

CCCCTTGTCA CTCAGCTAAT GACACATGGA CGTCTGGCTG GAAAACTAGC CCGGAGTGTC 1260 

CCCCACCCCC GAGCCCCAGG AATGGACCCC AAATGTAAGG GCCGCCATGC TGAGCCCGGC 1320 

CTGATGGCTG AGGCTTTGGG CGCGTGGCCA GCGCTCAGCA CCACTGTGGT GCCAGGGGGC 1380 

CTGAGAAGCT GGGAGGAAAG CAGGCAGAAG GGGCAGGCCA TGTCCAGATG TGCACTCAGG 1440 

GAGCTGGGAG GTCCCTGGAG CCAAGCCACA CAGCACCTGC CTGCTAGAGA GCTGTATGAC 1500 

CTGGGAGAAC CTCCCATTTT ACAAAGAACA GACGGAGATC CAGGGAGGAG AAGGGACTCG 1560 

CCCAAGGTCA CACAGGAGTG CCATCTAGTG GCCACCATGC CAGCTCTCGG GGGACTCGAG 1620 

GGCCCCGGGA GGGTGGCCAA GCGAGAGATT GGGAGAGAGA CTGGGGCAGT AGGAAGACCA 1680 

CTCTCCCATC CCCTGGTCCC CAACTTCCCC AGCTGCTTGA GGCCTCTTGA AGCCGGGACA 1740 

GTGCCGGGAG CTGCCCTGCC TGGGAATCCT GGGAACTGGG TTCTGGACAT GGCCAAGGCC 1800 

CTGGCGTGGA ACCAGATGGA AAAAGAGGAG GGGAAGATTC ATGGAGACCA TGAGCCCAGA 1860 

TTTAGGCTCA GGAAAGCACG GGAAGCAGAA TTCCCCCCAG GCTCCTCTGA GGAGCCTCTG 1920 

CTGCAGTTCC CCTCAGGCCT CAGAGGCAGC CCTGTCCTCC AGGTGGGCCT GGGGCTTGCT 1980 

TCTGCCACTC ACTGTGGGTC GATGTCTTTT CTAGGGGGCC GAGGCGTCAG CGTGGGCCCC 2040 

ATCCTCAGCA GCAGTGCCTC GGATATCTTC TGCGACAATG AGAATGGGCC TAACTTCCTT 2100 

TTCCACAACC GGGGCGATGG CACCTTTGTG GACGCTGCGG CCAGTGCTGA ACGTCGTTTA 2160 

GCCTTCATCG TTCACCTCAA ATATCACCTC TGCAGAGATT TTCCTCACTC CCTGTGCCAC 2220 

CTAGCAGAAA CTGGTCCTTC CTCCTCCTGC TGCCCGTGGC ATGCACGTCT TCTTCAGGCT 2280 

CCACATTGCC ATCATGGTTT GTCTATGAGC TTTACAAGGA CCGGGTCACG GTTCTATTCA 2340 

TTCTTGACGC AAGGCTTGGC CTCCAGTGCC CACCGGAGGA CACTCAGCCT CCAGGGTTCT 2400 

CAGGGGGCCC CACCCTGCCT TCTGGCAAGA GCTCCCTGTG TCCTGGGGTC TCTGATCCCC 2460 

ACTGCCTATT ACATTGTCCT GTGGTCTGCC ATCCCAGAGA GCCTGATGAC CCACAGCTAT 2520 

TTGTCCTCTG AAAGAGTCAA CGTGGGTGTG GACGACCCCC ACCAGCATGG GCGAGGTGTC 2580 

GCCCTGGCTG ACTTCAACCG TGATGGCAAA GTGGACATCG TCTATGGCAA CTGGAATGGC 2640 

CCCCACCGCC TCTATCTGCA AATGAGCACC CATGGGAAGG TCCGCTTCCG GGACATCGCC 2700 

TCACCCAAGT TCTCCATGCC CTCCCCTGTC CGCACGGTCA TCACCGCCGA CTTTGACAAT 2760 

GACCAGGAGC TGGAGATCTT CTTCAACAAC ATTGCCTACC GCAGCTCCTC AGCCAACCGC 2820 

CTCTTCCGAT GCTCCATCCT GGCTCGTGGC TCTTCATCCT TGACAGCTGG TGGGAGGAAC 2880 

GGTCAGGGAG AAGGTTTAAG AATCAGAAGG GGAGGGTTCC CAGGGCCAGG GGGTCAGGCC 2940 

AAGGTCAACA CAGGTCCCCT GATGAAGAAA CAGAAAGGAA GGAAGGACGA GGACTGGGCA 3000 

AGAGGCTGTG GGAATGCAGG GCAAAGCCTG GCCAAGGAGC CGGCCTCTGC TATTGCAGGG 3060 

AAAGGGAAGG GAAATGTGGC CCAAAGTGTG CCCAGAACCC AAGCGCCACA AGATACAAAG 3120 

CCACACTACC ACAAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG 3180 

GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC ACTACCAGGA AAAGGGGCTA 3240 

CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC AATCACTACC AGGAAAAGGG 3300 

GCTACGGGCT CCAATCACTA CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA 3360 

AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA CAGGGTCCAA TCACTACCAC 3420 

AGAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGGG GCTACGGGGT CCAATCACTA 3480 

CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC 3540 

ACTACCAGGA AAAGGGGCTA CGGGCTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC 3 600 

AATCACTACC AGGAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACAGG 3660 

GTCCAATCAC TACCACAGAA AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA 3720 

CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGAG 3780 

GCTATGGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG GTCCAATCAC TACCAGGAAA 3840 

AGGGGCTATG GGGTCCAATC ACTACCACAG AAAGGGGCTA CGGGGTCCAA CGTCATCCGT 3900 

AGAGAGCACG GAGACCCCCT CATCGAGGAG CTCAATCCCG GCGACGCCTT GGAGCCTGAG 3960 

GGCCGGGGCA CAGGGGGTGT GGTGACCGAC TTCGACGGAG ACGGGATGCT GGACCTCATC 4020 

TTGTCCCATG GAGAGTCCAT GGCTCAGCCG CTGTCCGTCT TCCGGGGCAA TCAGGGCTTC 4080 

AACAACAACT GGCTGCGAGT GGTGCCACGC ACCCGGTTTG GGGCCTTTGC CAGGGGAGCT 4140 

AAGGTCGTGC TCTACACCAA GAAGAGTGGG GCCCACCTGA GGATCATCGA CGGGGGCTCA 4200 

GGCTACCTGT GTGAGATGGA GCCCGTGGCA CACTTTGGCC TGGGGAAGGA TGAAGCCAGC 4260 

AGTGTGGAGG TGACGTGGCC AGATGGCAAG ATGGTGAGCC GGAACGTGGC CAGCGGGGAG 4320 
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ATGAACTCAG TGCTGGAGAT CCTCTACCCC CGGGATGAGG 
CCACTGGAGT GT6GCCAAGG ATTCTCCCAG CAGGAAAATG 
GAATGCATCC AGTTCCCATT CGTGTGCCCT CGAGACAAGC 
GGAAGCTACA GGTGCCGGAC CAACAAGAAG TGCAGTCGGG 
GGCACAGCCT GCGTGGGTAC TGAGCTAGGC TCTAGGCATA 
CCCAAAAAGG AGCTGCAACT TTCCCAAGGC ATCTGCACCC 
CCGGGTTGCC GGCTGCTCCT CAAAAGAGCT CAGCTCCAGG 
CAGAAAGCTC CAGGTATTCC AGAAGCCCAA GTGTATGAAC 

Seq ID NO: 473 Protein sequence 
Protein Accession 8: FGENESH predicted 



ACACACTTCA 
GCCATTGCAT 
CCGTATGTGT 
GCTACGAGCC 
CAATGACGTG 
CCGTCTGGTC 
CTGCTCCCAG 
AAGATCAGGA 



MACPGGLPAR 
SPYYALRDRQ 
PPTTPAGLLG 
GVATYTDKLF 
GNVGPDAIjIE 
GGDPEEADEE 
SKSHLADKNL 
PHPRAPGMDP 
ELGGPWSQAT 
GPGRVAKREI 
LAWNQMEKEE 
SATHCGSMSF 
AFIVHLKYHL 
FLTQGLASSA 
LSSERVNVGV 
SPKFSMPSPV 
GQGEGLRIRR 
KGKGNVAQSV 
RGPITTRKRG 
RKGLRAPITT 
NHYQEKGLQG 
AMGSNHYQEK 
GRGTGGWTD 
KWLYTKKSG 
MNSVLEILYP 
GSYRCRTNKK 
PGCRLLLKRA 



11 
I 

CSGWMGLGGP 
GNAIGVTACD 
LPPLSGRDFS 
KFRNNRWEDI 
MDPEASDLSR 
HSGDGSTSQL 
FGPPCYYSVC 
KCKGRHAEPG 
QHLPARELYD 
GRETGAVGRP 
GKIHGDHEPR 
LGGRGVSVGP 
CRDFPHSLCH 
HRRTIiSLQGS 
DDPHQHGRGV 
RTVITADFDN 
GGFPGPGGQA 
PRTQAPQDTK 
YGVQSLPGKG 
RKRGYGVQSL 
PITTRKRGYR 
GLRAPITTRK 
FDGDGMLDLI 
AHLRIIDGGS 
RDEDTLQDPA 
CSRGYEPNED 
QLQAAPSTLL 



21 
I 

SGSSPASPPH 
IDGDGREEIY 
SSLGQASPDS 
LSDEVNVARG 
GILALRDVAA 
CRLGWKDGQF 
APSPAHPFPA 
LMAEALGAWP 
LGEPPILQRT 
liSHPLVPNFP 
PRLRKAREAE 
ILSSSASDIF 
LAETGPSSSC 
QGAPPCLLAR 
ALADFNRDGK 
DQELEIFFNN 
KVNTGPLMKK 
PHYHKKGLQG 
ATGSNHYQEK 
PGKGATGSNH 
VQSLPQKGAT 
RGYGVQSLPQ 
LSHGESMAQP 
GYLCEMEPVA 
PLECGQGFSQ 
GTACVGTELG 
QKAPGIPEAQ 



31 
I 

SSSRYNGPNL 
FLNTNNAFSG 
RQGERVPVPC 
VASLFAGRSV 
EAGVSKYTEG 
KEEAAALVEE 
RQAPQHYPVA 
ALSTTWPGG 
DGDPGRRRDS 
SCLRPLEAGT 
FPPGSSEEPL 
CDNENGPNFL 
CPWHARLLQA 
APCVLGSLIP 
VDIVYGNWNG 
IAYRSSSANR 
QKGRKDEDWA 
PITTRKRGYG 
GLQGPITTRK 
YQEKGLRGPI 
GSNHYQEKGL 
KGATGSNVIR 
LSVFRGNQGF 
HFGLGKDEAS 
QENGHCMDTN 
SRHTMTWKPR 
VYEQDQE 



41 
I 

VLKYDRAQKR 
HSSSAQVPSG 
CRGGLRPTHB 
ACVDRKGSGR 
FSHTASPSIG 
QREAGAAGVP 
PLVTQLMTHG 
LRSWEESRQK 
PKVTQECHLV 
VPGAALPGNP 
LQFPSGLRGS 
FHNRGDGTFV 
PHCHHGLSMS 
TAYYIVLWSA 
PHRbYLQMST 
LFRCSILARG 
RGGGNAGQSL 
VQSLPGKGAT 
RGYGLQSLPG 
TTRKRGYGLQ 
RGPITTRKRG 
REHGDPLIEE 
NNNWLRWPR 
SVEVTWPDGK 
ECIQFPFVCP 
PKKELQLSQG 



GGACCCAGCC 
GGACACCAAT 
CAACACCTAT 
CAACGAGGAT 
GAAACCAAGG 
CTTTTTCCTG 
CACCCTTCTC 
ATAA 



51 

I 

LVNIAVDERS 
LHRNRPVLKP 
PEPFLLRPKS 
YSIYIANYAY 
EISGRTEERE 
RGRVRTALQT 
RLAGKLARSV 
GQAMSRCALR 
ATMPALGGLE 
GNWVLDMAKA 
PVLQVGLGLA 
DAAASAERRL 
FTRTGSRFYS 
-I PESIiMTHSY 
HGKVRFRDIA 
SSSLTAGGRN 
AKEPASAIAG 
GSNHYQEKGL 
KGATGSNHYH 
SLPGKGATGS 
YGLQSLPGKE 
LNPGDALEPE 
TRFGAFARGA 
MVSRNVASGE 
RDKPVCVNTY 
ICTPVWSFFL 



Seq ID NO: 474 DNA sequence 

Nucleic Acid Accession #: NMJD03661. 

Coding sequence: 1..1152 



ATGAGTGCAC 
CAAAACGTTC 
GCTGCTGGCA 
AAGGAAAAAG 
GGATTCGTGG 
GACAACCTTG 
TACAGAAACT 
AGAAGGCTCC 
AATGTGGTGT 
CTGGCACCCT 
ATCACAGCCG 
ACACAAGCCC 
GAGTTTTTGG 
ACACGAGGCA 
GTACCGCATG 
GAACAGGTGG 
ACGGATGTGG 
TCAAAGCACT 
CAGGAGCTGG 
CAAGAACTGT 



11 
I 

TTTTCCTTGG 
CAAGTGGGAC 
CCATGGACCC 
TGAGCACACA 
CTGCTGCTGA 
CAAGACAAAT 
GGTTTCTGAA 
GTGCCCTTGC 
CTGGCTCTCT 
TCACAGAGGG 
CTTTGACCGG 
AAGCCCACGA 
GTGAGAACAT 
TTGGGAAGGA 
CCTCAGCCTC 
AGAGGGTTAA 
CCCCTGTAAG 
TACATGAGGG 
AGGAGAAGCT 
GA 



21 
I 

TGTGGGAGTG 
AGATACTGGA 
AGAGAGCAGT 
GAATCTGCTA 
ACTGCCCAGG 
GATCATGAAA 
AGAGTTTCCT 
AGATGGGGTT 
CAGCATTTCC 
AGGCAGCCTT 
GATTACCAGC 
CCTGGTCATC 
ATCCAACTTT 
CATCCGTGCC 
ACGCCCCCGG 
TGAACCCAGC 
CTTCTTTCTT 
GGCAAAGTCA 
AAACATTCTC 



31 
I 

AGGGCAGAGG 
GATCCTCAAA 
ATCTTTATTG 
CTCCTGCTGA 
AATGAGGCAG 
GACAAAAACT 
CGGTTGAAAA 
CAGAAGGTCC 
TCTGGCATCC 
GTACTCTTGG 
AGTACCATGG 
AAAAGCCTTG 
CTTTCCTTAG 
CTCAGACGAG 
GTCACTGAGC 
ATCCTGGAAA 
GTGCTGGATG 
GAGACAGCTG 
AACAATAATT 



41 
I 

AAGCTGGAGC 
GTAAGCCCCT 
AGGATGCCAT 
CTGATAATGA 
ATGAGCTCCG 
GGCACGATAA 
GTGAGCTTGA 
ACAAAGGCAC 
TGACCCTCGT 
AACCTGGGAT 
ACTACGGAAA 
ACAAATTGAA 
CTGGCAATAC 
CCAGAGCCAA 
CAATCTCAGC 
TGAGCAGAGG 
TAGTCTACCT 
AGGAGCTGAA 
ATAAGATTCT 



51 
I 

GAGGGTGCAA 
CGGTGACTGG 
TAAGTATTTC 
GGCCTGGAAC 
TAAAGCTCTG 
AGGCCAGCAG 
GGATAACATA 
CACCATCGCC 
CGGCATGGGT 
GGAGTTGGGA 
GAAGTGGTGG 
GGAGGTGAGG 
TTACCAACTC 
TCTTCAGTCA 
TGAAAGCGGT 
AGTCAAGCTC 
CGTGTACGAA 
GAAGGTGGCT 
GCAGGCGGAC 



Seq ID NO: 475 Protein sequence 
Protein Accession #: NP 003652.1 



1 
I 

MSALFLGVGV 
KEKVSTQNLL 
YRNWFLKEFP 
LAPFTEGGSL 
EFLGENISNF 
EQVERVNEPS 
QELEEKLNIL 



11 
I 

RAEEAGARVQ 
LLLTDNEAWN 
RLKSELEDNI 
VLIjEPGMELG 
LSLAGNTYQL 
ILEMSRGVKL 
NNNYKILQAD 



21 
I 

QNVPSGTDTG 
GFVAAAELiPR 
RRLRALADGV 
ITAALTGITS 
TRGIGKDIRA 
TDVAPVSFFL 
QEL 



31 
I 

DPQSKPLGDW 
NEADELRKAL 
QKVHKGTTIA 
STMDYGKKWW 
LRRARANLQS 
VLDWYLVYE 



41 
I 

AAGTMDPESS 
DNLARQMIMK 
NWSGSLSIS 
TQAQAHDLVI 
VPHASASRPR 
SKHLHEGAKS 



51 
I 

IFIEDAIKYF 
DKNWHDKGQQ 
SGILTLVGMG 
KSLDKLKEVR 
VTEPISAESG 
ETAEELKKVA 



4380 
4440 
4500 
4560 
4620 
4680 
4740 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1S60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 
240 
300 
360 



Seq ID NO: 476 DNA sequence 

Nucleic Acid Accession ft: NM_014452.1 

Coding sequence: 1..1968 



11 



21 



31 



41 



51 



368 



WO 02/086443 
I I I I I I 

ATGGGGACCT CTCCGAGCAG CAGCACCGCC CTCGCCTCCT GCAGCCGCAT CGCCCGCCGA 60 

GCCACAGCCA CGATGATCGC GGGCTCCCTT CTCCTGCTTG GATTCCTTAG CACCACCACA 120 

GCTCAGCCAG AACAGAAGGC CTCGAATCTC ATTGGCACAT ACCGCCATGT TGACCGTGCC 180 

ACCGGCCAGG TGCTAACCTG TGACAAGTGT CCAGCAGGAA CCTATGTCTC TGAGCATTGT ' 240 

ACCAACACAA GCCTGCGCGT CTGCAGCAGT TGCCCTGTGG GGACCTTTAC CAGGCATGAG 300 

AATGGCATAG AGAAATGCCA TGACTGTAGT CAGCCATGCC CATGGCCAAT GATTGAGAAA '#360 

TTACCTTGTG CTGCCTTGAC TGACCGAGAA TGCACTTGCC CACCTGGCAT GTTCCAGTCT 420 

AACGCTACCT GTGCCCCCCA TACGGTGTGT CCTGTGGGTT GGGGTGTGCG GAAGAAAGGG 480 

ACAGAGACTG AGGATGTGOG GTGTAAGCAG TGTGCTCGGG GTACCTTCTC AGATGTGCCT 540 

TCTAGTGTGA TGAAATGCAA AGCATACACA GACTGTCTGA GTCAGAACCT GGTGGTGATC 600 

AAGCCGGGGA CCAAGGAGAC AGACAACGTC TGTGGCACAC TCCCGTCCTT CTCCAGCTCC 660 

ACCTCACCTT CCCCTGGCAC AGCCATCTTT CCACGCCCTG AGCACATGGA AACCCATGAA 720 

GTCCCTTCCT CCACTTATGT TCCCAAAGGC ATGAACTCAA CAGAATCCAA CTCTTCTGCC 780 

TCTGTTAGAC CAAAGGTACT GAGTAGCATC CAGGAAGGGA CAGTCCCTGA CAACACAAGC 840 

TCAGCAAGGG GGAAGGAAGA CGTGAACAAG ACCCTCCCAA ACCTTCAGGT AGTCAACCAC 900 

CAGCAAGGCC CCCACCACAG ACACATCCTG AAGCTGCTGC CGTCCATGGA GGCCACTGGG 960 

GGCGAGAAGT CCAGCACGCC CATCAAGGGC CCCAAGAGGG GACATCCTAG ACAGAACCTA 1020 

CACAAGCATT TTGACATCAA TGAGCATTTG CCCTGGATGA TTGTGCTTTT CCTGCTGCTG 1080 

GTGCTTGTGG TGATTGTGGT GTGCAGTATC CGGAAAAGCT CGAGGACTCT GAAAAAGGGG 1140 

CCCCGGCAGG ATCCCAGTGC CATTGTGGAA AAGGCAGGGC TGAAGAAATC CATGACTCCA 1200 

ACCCAGAACC GGGAGAAATG GATCTACTAC TGCAATGGCC ATGGTATCGA TATCCTGAAG 1260 

CTTGTAGCAG CCCAAGTGGG AAGCCAGTGG AAAGATATCT ATCAGTTTCT TTGCAATGCC 1320 

AGTGAGAGGG AGGTTGCTGC TTTCTCCAAT GGGTACACAG CCGACCACGA GCGGGCCTAC 1380 

GCAGCTCTGC AGCACTGGAC CATCCGGGGC CCCGAGGCCA GCCTCGCCCA GCTAATTAGC 1440 

GCCCTGCGCC AGCACCGGAG AAACGATGTT GTGGAGAAGA TTCGTGGGCT GATGGAAGAC 1500 

ACCACCCAGC TGGAAACTGA CAAACTAGCT CTCCCGATGA GCCCCAGCCC GCTTAGCCCG 1560 

AGCCCCATCC CCAGCCCCAA CGCGAAACTT GAGAATTCCG CTGTCCTGAC GGTGGAGCCT 1620 

TCCCCACAGG ACAAGAACAA GGGCTTCTTC GTGGATGAGT CGGAGCCCCT TCTCCGCTGT 1680 

GACTCTACAT CCAGCGGCTC CTCCGCGCTG AGCAGGAACG GTTCCTTTAT TACCAAAGAA 1740 

AAGAAGGACA CAGTGTTGCG GCAGGTACGC CTGGACCCCT GTGACTTGCA GCCTATCTTT 1800 

GATGACATGC TCCACTTTCT AAATCCTGAG GAGCTGCGGG TGATTGAAGA GATTCCCCAG 1860 

GCTGAGGACA AACTAGACCG GCTATTCGAA ATTATTGGAG TCAAGAGCCA GGAAGCCAGC 1920 
CAGACCCTCC TGGACTCTGT TTATAGCCAT CTTCCTGACC TGCTGTAG 

Seq ID NO: 477 Protein sequence 
Protein Accession fl: NP_055267.l 

1 11 21 31 41 51 

I I I 1 1 I 

MGTSPSSSTA LASCSRIARR ATATMIAGSL LLLGFLSTTT AQPEQKASNL IGTYRHVDRA 60 

TGQVLTCDKC PAGTYVSEHC TNTSLRVCSS CPVGTFTRHE NGIEKCHDCS QPCPWPMIEK 120 

LPCAALTDRE CTCPPGMFQS NATCAPHTVC PVGWGVRKKG TETEDVRCKQ CARGTFSDVP 180 

SSVMKCKAYT DCLSQNLWI KPGTKETDNV CGTLPSFSSS TSPSPGTAIF PRPEHMETHE 240 

VPSSTYVPKG MNSTESNSSA SVRPKVLSSI QEGTVPDNTS SARGKEDVNK TLPNLQWNH 300 

QQGPHHRHIL KLLPSMEATG GEKSSTPIKG PKRGHPRQNL HKHFDINEHL PWMIVLFLLL 360 

VLWIWCSI RKSSRTLKKG PRQDPSAIVE KAGLKKSMTP TQNREKWIYY CNGHGIDILK 420 

LVAAQVGSQW KDIYQFLCNA SEREVAAFSN GYTADHERAY AALQHWTIRG PEASLAQLIS 480 

ALRQHRRNDV VEKIRGLMED TTQLETDKLA LPMSPSPLSP SPIPSPNAKI* ENSALLTVEP 540 

SPQDKNKGFF VDESEPLLRC DSTSSGSSAL SRNGSFITKE KKDTVLRQVR LDPCDLQPIF 600 
DDMLHFLNPE ELRVIEEIPQ AEDKLDRLFE IIGVKSQEAS QTLLDSVYSH LPDLL 

Seq ID NO: 478 DNA sequence 

Nucleic Acid Accession #: XM.044533 

Coding sequence: 238.. 2751 

1 11 21 31 41 51 

I I I I I I 

GCTCTGCCCA AGCCGAGGCT GCGGGGCCGG CGCCGGCGGG AGGACTGCGG TGCCCCGCGG 60 

AGGGGCTGAG TTTGCCAGGG CCCACTTGAC CCTGTTTCCC ACCTCCCGCC CCCCAGGTCC 120 

GGAGGCGGGG GCCCCCGGGG CGACTCGGGG GCGGACCGCG GGGCGGAGCT GCCGCCCGTG 180 

AGTCCGGCCG AGCCACCTGA GCCCGAGCCG CGGGACACCG TCGCTCCTGC TCTCCGAATG 240 

CTGCGCACCG CGATGGGCCT GAGGAGCTGG CTCGCCGCCC CATGGGGCGC GCTGCCGCCT 300 

CGGCCACCGC TGCTGCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCGACC 360 

TGGGCGCTCA GCCCCCGGAT CAGCCTGCCT CTGGGCTCTG AAGAGCGGCC ATTCCTCAGA 420 

TTCGAAGCTG AACACATCTC CAACTACACA GCCCTTCTGC TGAGCAGGGA TGGCAGGACC 480 

CTGTACGTGG GTGCTCGAGA GGCCCTCTTT GCACTCAGTA GCAACCTCAG CTTCCTGCCA 540 

GGCGGGGAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAG AGAAGAAACA GCAGTGCAGC 600 

TTCAAGGGCA AGGACCCACA GCGCGACTGT CAAAACTACA TCAAGATCCT CCTGCCGCTC 660 

AGCGGCAGTC ACCTGTTCAC CTGTGGCACA GCAGCCTTCA GCCCCATGTG TACCTACATC 720 

AACATGGAGA ACTTCACCCT GGCAAGGGAC GAGAAGGGGA ATGTCCTCCT GGAAGATGGC 780 

AAGGGCCGTT GTCCCTTCGA CCCGAATTTC AAGTCCACTG CCCTGGTGGT TGATGGCGAG 840 

CTCTACACTG GAACAGTCAG CAGCTTCCAA GGGAATGACC CGGCCATCTC GCGGAGCCAA 900 

AGCCTTCGCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC TGCAAGACCC AGCTTTTGTG 960 

GCCTCAGCCT ACATTCCTGA GAGCCTGGGC AGCTTGCAAG GCGATGATGA CAAGATCTAC 1020 

TTTTTCTTCA GCGAGACTGG CCAGGAATTT GAGTTCTTTG AGAACACCAT TGTGTCCCGC 1080 

ATTGCCCGCA TCTGCAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGACC 1140 

TCCTTCCTCA AGGCCCAGCT GCTGTGCTCA CGGCCCGACG ATGGCTTCCC CTTCAACGTG 1200 

CTGCAGGATG TCTTCACGCT GAGCCCCAGC CCCCAGGACT GGCGTGACAC CCTTTTCTAT 1260 

GGGGTCTTCA CTTCCCAGTG GCACAGGGGA ACTACAGAAG GCTCTGCCGT CTGTGTCTTC 1320 

ACAATGAAGG ATGTGCAGAG AGTCTTCAGC GGCCTCTACA AGGAGGTGAA CCGTGAGACA 1380 

CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCCTGGAGC GTGCATCACC 1440 

AACAGTGCCC GGGAAAGGAA GATCAACTCA TCCCTGCAGC TCCCAGACCG CGTGCTGAAC 1500 

TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTCCGAA GCCGCATGCT GCTGCTGCAG 1560 

CCCCAGGCTC GCTACCAGCG CGTGGCTGTA CACCGCGTCC CTGGCCTGCA CCACACCTAC 1620 

GATGTCCTCT TCCTGGGCAC TGGTGACGGC CGGCTCCACA AGGCAGTGAG CGTGGGCCCC 1680 

CGGGTGCACA TCATTGAGGA GCTGCAGATC TTCTCATCGG GACAGCCCGT GCAGAATCTG 1740 



369 



WO 02/086443 

CTCCTGGACA CCCACAGGGG GCTGCTGTAT GCGGCCTCAC ACTCGGGCGT AGTCCAGGTG 1800 

CCCATGGCCA ACTGCAGCCT GTACAGGAGC TGTGGGGACT GCCTCCTCGC CCGGGACCCC i860 

TACTGTGCTT GGAGCGGCTC CAGCTGCAAG CACGTCAGCC TCTACCAGCC TCAGCTGGCC 1920 

ACCAGGCCGT GGATCCAGGA CATCGAGGGA GCCAGCGCCA AGGACCTTTG CAGCGCGTCT 1980 

TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA AGTCCAGTTC 2040 

CAGCCCAACA CAGTGAACAC TTTGGCCTGC CCGCTCCTCT CCAACCTGGC GACCCGACTC 2100 

TGGCTACGCA ACGGGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 2160 

GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACTAGAGGAG 2220 

GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG TGGAGGACGG GGTGGCAGAC 2280 

CAAACAGATG AGGGTGGCAG TGTACCCGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA 2340 

GCTGGTGGCA AGGCCAGCTG GGGTGCAGAC AGGTCCTACT GGAAGGAGTT CCTGGTGATG 2400 

TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 2460 

AACAGCATGA AAGTCTTCCT GAAGCAGGGG GAATGTGCCA GCGTGCACCC CAAGACCTGC 2520 

CCTGTGGTGC TGCCCCCTGA GACCCGCCCA CTCAACGGCC TAGGGCCCCC TAGCACCCCG 2580 

CTCGATCACC GAGGGTACCA GTCCCTGTCA GACAGCCCCC CGGGGTCCCG AGTCTTCACT 2640 

GAGTCAGAGA AGAGGCCACT CAGCATCCAA GACAGCTTCG TGGAGGTATC CCCAGTGTGC 2700 

CCCCGGCCCC GGGTCCGCCT TGGCTCGGAG ATCCGTGACT CTGTGGTGTG AGAGCTGACT 2760 

TCCAGAGGAC GCTGCCCTGG CTTCAGGGGC TGTGAATGCT CGGAGAGGGT CAACTGGACC 2820 

TCCCCTCCGC TCTGCTCTTC GTGGAACACG ACCGTGGTGC CCGGCCCTTG GGAGCCTTGG 2880 

GGCCAGCTGG CCTGCTGCTC TCCAGTCAAG TAGCGAAGCT CCTACCACCC AGACACCCAA 2940 

ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC . TGCCTAGGTT GGTGGAACAG 3000 

TGCTCCTTAT GTAAACTGAG CCCTTTGTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 3060 

GAGAGGGAAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCTCCCA 3120 

GGGGTGCTGG GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 3180 

TGGCCTCTTC ACCTTCCACA TTATCCCGCT GCCACCGGCT GCCCTGTCTC ACTGCAGATT 3240 

CAGGACCAGC TTGGGCTGCG TGCGTTCTGC CTTGCCAGTC AGCCGAGGAT GTAGTTGTTG 3300 

CTGCCGTCGT CCCACCACCT CAGGGACCAG AGGGCTAGGT TGGCACTGCG GCCCTCACCA 3360 

GGTCCTGGGC TCGGACCCAA CTCCTGGACC TTTCCAGCCT GTATCAGGCT GTGGCCACAC 3420 

GAGAGGACAG CGCGAGCTCA GGAGAGATTT CGTGACAATG TACGCCTTTC CCTCAGAATT 3480 

CAGGGAAGAG ACTGTCGCCT GCCTTCCTCC GTTGTTGCGT GAGAACCCGT GTGCCCCTTC 3540 

CCACCATATC CACCCTCGCT CCATCTTTGA ACTCAAACAC GAGGAACTAA CTGCACCCTG 3600 

GTCCTCTCCC CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 3660 

TCAACACTGC CCAGCACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTAATAAG 3720 
ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 

Seq ID NO: 479 Protein sequence 
Protein Accession #: XP_044533.3 

1 11 21 31 41 51 

I I I I I I 

MLRTAMGLRS WLAAPWGALP PRPPLLLLLL LLLLL.QPPPP TWALSPRISL PLGSEERPFL 60 

RFEAEHISNY TALLLSRDGR TLYVGAREAL FALSSNLSFL PGGBYQELLW GADAEKKQQC 120 

SFKGKDPQRD CQNYIKILLP LSGSHLFTCG TAAFSPMCTY INMENFTLAR DEKGNVLLED 180 

GKGRCPFDPN FKSTALWDG ELYTGTVSSF QGNDPAISRS QSLRPTKTES SLNWLQDPAF 240 

VASAYIPESL GSLQGDDDKI YFFFSETGQE FEFFENTIVS RIARICKGDE GGERVLQQRW 300 

TSFLKAQLLC SRPDDGFPFN VLQDVFTLSP SPQDWRDTLF YGVFTSQWHR GTTEGSAVCV 360 

FTMKDVQRVF SGLYKEVNRE TQQWYTVTHP VPTPRPGACI TNSARERKIN SSLQLPDRVL 420 

NFLKDHFLMD GQVRSRMLLL QPQARYQRVA VHRVPGLHHT YDVLFLGTGD GRLHKAVSVG 480 

PRVHIIEELQ IFSSGQPVQN LLLDTHRGLL YAASHSGWQ VPMANCSLYR SCGDCLLARD 540 

PYCAWSGSSC KHVSLYQPQI* ATRPWIQDIE GASAKDLCSA SSWSPSFVP TGEKPCEQVQ 600 

FQPNTVNTLA CPLLSNLATR LWLRNGAPVN ASASCHVLPT GDLLLVGTQQ LGEFQCWSLE 660 

EGFQQLVASY CPEWEPGVA DQTDEGGSVP VIISTSRVSA PAGGKASWGA DRSYWKEFLV 720 

MCTLFVLAVL LPVLFLLYRH RNSMKVFLKQ GECASVHPKT CPWLPPETR PLNGLGPPST 780 
PLDHRGYQSL SDSPPGSRVF TESEKRPLSI QDSFVEVSPV CPRPRVRLGS EIRDSW 

Seq ID NO: 480 dna sequence 

Nucleic Acid Accession #: NM_004217.1 

Coding sequence: 58.. 1092 

1 11 21 31 41 51 

I I I I I I 

GGCCGGGAGA GTAGCAGTGC CTTGGACCCC AGCTCTCCTC CCCCTTTCTC TCTAAGGATG 60 

GCCCAGAAGG AGAACTCCTA CCCCTGGCCC TACGGCCGAC AGACGGCTCC ATCTGGCCTG 120 

AGCACCCTGC CCCAGCGAGT CCTCCGGAAA GAGCCTGTCA CCCCATCTGC ACTTGTCCTC 180 

ATGAGCCGCT CCAATGTCCA GCCCACAGCT GCCCCTGGCC AGAAGGTGAT GGAGAATAGC 240 

AGTGGGACAC CCGACATCTT AACGCGGCAC TTCACAATTG ATGACTTTGA GATTGGGCGT 300 

CCTCTGGGCA AAGGCAAGTT TGGAAACGTG TACTTGGCTC GGGAGAAGAA AAGCCATTTC ' 360 

ATCGTGGCGC TCAAGGTCCT CTTCAAGTCC CAGATAGAGA AGGAGGGCGT GGAGCATCAG 420 

CTGCGCAGAG AGATCGAAAT CCAGGCCCAC CTGCACCATC CCAACATCCT GCGTCTCTAC 480 

AACTATTTTT ATGACCGGAG GAGGATCTAC TTGATTCTAG AGTATGCCCC CCGCGGGGAG 540 

CTCTACAAGG AGCTGCAGAA GAGCTGCACA TTTGACGAGC AGCGAACAGC CACGATCATG 600 

GAGGAGTTGG CAGATGCTCT AATGTACTGC CATGGGAAGA AGGTGATTCA CAGAGACATA 660 

AAGCCAGAAA ATCTGCTCTT AGGGCTCAAG GGAGAGCTGA AGATTGCTGA CTTCGGCTGG 720 

TCTGTGCATG CGCCCTCCCT GAGGAGGAAG ACAATGTGTG GCACCCTGGA CTACCTGCCC 780 

CCAGAGATGA TTGAGGGGCG CATGCACAAT GAGAAGGTGG ATCTGTGGTG CATTGGAGTG 840 

CTTTGCTATG AGCTGCTGGT GGGGAACCCA CCCTTTGAGA GTGCATCACA CAACGAGACC 900 

TATCGCCGCA TCGTCAAGGT GGACCTAAAG TTCCCCGCTT CTGTGCCCAC GGGAGCCCAG 960 

GACCTCATCT CCAAACTGCT CAGGCATAAC CCCTCGGAAC GGCTGCCCCT GGCCCAGGTC 1020 

TCAGCCCACC CTTGGGTCCG GGCCAACTCT CGGAGGGTGC TGCCTCCCTC TGCCCTTCAA 1080 

TCTGTCGCCT GATGGTCCCT GTCATTCACT CGGGTGCGTG TGTTTGTATG TCTGTGTATG 1140 

TATAGGGGAA AGAAGGGATC CCTAACTGTT CCCTTATCTG TTTTCTACCT CCTCCTTTGT 1200 
TTAATAAAGG CTGAAGCTTT TTGT 

Seq ID NO: 481 Protein sequence 
Protein Accession #: NP_004208 

1 11 21 31 41 51 
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WO 02/086443 
I I I I 

MAQKENSYPW PYGRQTAPSG LSTLPQRVLR KEPVTPSALV 

SSGTPDILTR HFTIDDPBIG RPLGKGKFGN VYLAREKKSH 

QLRREIEIQA HLHHPNILRL YNYFYDRRRI YLILBYAPRG 

MEELADALMY CHGKKVIHRD IKPENLLLGL KGELKIADFG 

PPEMIEGRMH NEKVDLWCIG VLCYELLVGN PPFESASHNE 

QDliISKLLRH NPSERIiPLAQ VSAHPWVRAN SRRVLPPSAL 



Seq ID NO: 4B2 dna sequence 
Nucleic Acid Accession ft: AK055663 
Coding sequence: 38.. 1423 



PCT/US02/12476 



LMSRSNVQPT 


AAPGQKVMEN 


60 


FIVALKVLFK 


SQIEKEGVEH 


120 


ELYKELQKSC 


TFDEQRTATI 


180 


WSVHAPSLRR 


KTMCGTLDYL 


240 


TYRRIVKVDL 


KPPASVPTGA 


300 


QSVA 







1 

I 

AGAACGGCTT 
AAAACCACAA 
CCGAAGGTCC 
GCTTATGTGG 
TTTTGATCTT 
TAGCCCTGTC 
AGTCTTGGCA 
ACAGCCCGAG 
CCTGTTCACG 
TACGAGCTGG 
GGGACTTAGC 
AGCATTTGCT 
CACTGCCTCT 
GTACAGTGGG 
ACTCATCAGA 
GACCCTAGGT 
TGAACAAATG 
TGTTCAAATT 
CAATGTCCTA 
TGATTTGAAC 
ATTTAACACT 
TTATGGTTTT 
TGGAGTTCCA 
TAGATATGGA 
TTATAAGGAA 
TGTTTAATCA 
TATGAAACTA 
GCTTTAAATA 
GTTTTGTAGT 
AGATCTGTCA 
TGCCACTGTG 
CTTAGTTTTT 
ATGCAGTGGC 
TGCCTCAGCC 
TGTATTTTTA 
ACCTCATGAT 
CCTGGCCGAT 
GGGAAAGGGA 
AATTGCTAAA 
TTTTTAGCAG 
GATTTTTGTT 



11 
I 

CCGGCGGGAG 
AGATCCTTTT 
TGGAAGATAC 
TGCAGTTCTA 
TTTAGTTTAA 
TATTCATTTG 
CAGTTGGGAG 
ATACACACGG 
ATGCTTTCTA 
CTTCAAGAGC 
AGTATCTTCC 
CTTTGTATTA 
GCTATAGCTA 
AAAGTCTTAC 
GAGGTATCTA 
TTTGGCTCAT 
GTTCTTGCTC 
TTCAAGGATG 
AACTTTTCAG 
CCAGTTACAT 
CCTGGGAAAA 
GGTCTCAATC 
GGAATTGGAG 
ACTAATAATA 
TATTGACTCC 
TTTACTCTAA 
TATTTTTGTA 
GGCTTCCTTT 
TGACTGCAGT 
CATTACTAAG 
CCCGGCCAAT 
GTTTTGTTTT 
ATGATCTCAG 
TCCCGAGTAG 
GTAAAGACGG 
CCACCCACCT 
ATTTTCTTTA 
AAAATGTCTG 
TTTTTCTTTG 
AAATTTTGGA 
AAAGTTTCTC 



21 
I 

CTGTGCAGCT 
TTGGCAAGTT 
TGCTCTTTGG 
CTAATAGTAT 
TGACATGTTT 
GGTTTGAAAG 
CTCTCTTTAT 
GAAGATTATT 
TTCGGAATAA 
ATGTTGCAGA 
TTCCCCGAAT 
CATATATGCT 
TTGCCTTGAT 
TCCAGACAAC 
CCTTAGATGG 
TGGCTGGATC 
ATGTGACCAA 
ACTGGATTAG 
ATCATCACGT 
CAACTCCAGC 
ATGTGAACCC 
ATGGACACAC 
CAACTCAAGG 
GAATTGGACA 
TTGGCTTCCA 
ATGTTAGATA 
AAATGTATTT 
AGAAAATGTG 
GTGATGTGAC 
ATACGATATT 
ACATTATTAT 
GTTTTTTGAG 
CTCACTGCAA 
CTGGGATTAC 
GGGATTTCAC 
TAGCCTCCCA 
ATGAAATTTA 
TTCAAAAAGT 
AGGTTCTCCT 
ATACATTCTA 
TCCTTTAAAA 



31 
I 

CCTTATCATG 
GTTACGGGAA 
TGTAATAAAC 
AGCTTTAACT 
AATAAGTTAC 
ATTAGAAGTC 
ATTAAAAGAA 
AGTTGGTACT 
ACCTTTTGCT 
TCTTAGTCGA 
GAATCCATTT 
CATTGAAATT 
GACATTTGGC 
ACCACCCCAT 
AGTTTTAGAA 
AGTGCATGTA 
CAGGCTGTAC 
GCCTGCCTTA 
AATCCCAATG 
TAAACCTAGT 
AGTTATTCTT 
ACCTTACAGC 
ATTGAGGACT 
ACCAAGACCA 
ATTTATTTAG 
ATAGTAGTCT 
GTGACAGTGA 
TTTCTTTAAA 
CTTACCTTTA 
TCTTTTTCTT 
TAACTTAAGG 
ATGGAGTCTC 
CCTCTGCCTC 
AGGCACCTGC 
CATGTTGGCC 
AAGTGCTGGG 
TAAATATGCT 
AAAGGTCTCT 
GAATTATGTC 
TCTAGCACAA 
ATTTTAGTAC 



41 
I 

GGGACAATTC 
TTTAGACTTG 
TTGATATGTA 
GCCTATACTT 
TGGGTAACAT 
CTGGCTGTAT 
AGTGCAGAAC 
TTTGTGGCTC 
TATGTCTCAG 
AGCTTGTGTG 
GTTTTGATTG 
AATAATTATT 
ACTATGTATC 
GTTATTGGTC 
GTCCGAAATG 
AGAATTCGAC 
ACTCTAGTGT 
TTGTCTGGGC 
CCTCTTTTAA 
AGTCCACCTC 
CTAAACACAC 
AGCATGCTTA 
GGTTTTACAA 
TGATAGACTC 
TAATCCAACT 
TGTTCACATT 
AATCCTCGTA 
TTTGGATTTT 
TAAGAGCCAC 
TCCGAGACGG 
CTGTACTTTA 
ACTCTGTCGC 
CTGAGTTCAA 
CACCACGCCC 
AGGCTGGTCT 
ATTAGGTGTG 
TCTTGAATAA 
TTTATAGCTT 
TTACAAACTA 
TTTGAATTTT 
ATTTGTAAAT 



51 
I 

ATCTCTTTCG 
TAGCAGCTGA 
CTGGCTTCCT 
ACCTGACCAT 
TGAGGAAACC 
TTGCCTCCAC 
GCTTTTTGGA 
TTTGTTTCAA 
AAGCTGCTAG 
GAATTATTCC 
ATCTTGCTGG 
TTGCCGTAGA 
CCATGAGTGT 
AGTTGGACAA 
AACATTTTTG 
GAGATGCCAA 
CTACTCTAAC 
CTGTTGCAGC 
AGGGTACTGA 
CAGAATTTTC 
AAACAAGGCC 
ATCAAGGACT 
ATATACCAAG 
TAACTTATTT 
TTGCATTGAC 
TCATGAAACC 
AATGTTAAAG 
GGTATCTTTG 
TTGATGGAGT 
AGTCTTGCTC 
TTAAGGCTTC 
CCAGGCTGGA 
ATGATTCTCC 
AGCTAATTTT 
TGAACTCCTG 
AGCCACCGCA 
TACACATTTT 
TTCCAAACTT 
AAAGCAAAAA 
TAATTATCAA 



Seq ID NO: 483 Protein sequence 
Protein Accession #: BAB70980.1 



MGTIHLFRKP 
TAYTYLTIFD 
ESAERFLEQP 
RSLCGIIPGL 
GTMYPMSVYS 
VRIRRDANEQ 
MPIiLKGTDDL 
SSMLNQGLGV 



11 
I 

QRSFFGKLLR 
LFSLMTCLIS 
EIHTGRLLVG 
SSIFLPRMNP 
GKVLLQTTPP 
MVLAHVTNRIi 
NPVTSTPAKP 
PGIGATQGLR 



21 
I 

EFRLVAADRR 
YWVTLRKPSP 
TFVALCFNLF 
FVLIDLAGAF 
HVIGQLDKLI 
YTLVSTLTVQ 
SSPPPEFSFN 
TGFTNIPSRY 



31 
I 

SWKILLFGVI 
VYSFGFERLE 
TMLSIRNKPF 
ALCITYMLIE 
REVSTLDGVL 
IFKDDWIRPA 
TPGKNVNPVI 
GTNNRIGQPR 



41 

! 

NLICTGFLLM 
VLAVFASTVL 
AYVSEAASTS 
INNYFAVDTA 
EVRNEHFWTL 
LLSGPVAANV 
LLNTQTRPYG 
P 



51 

! 

WCSSTNSIAL 
AQLGALFILK 
WLQEHVADLS 
SAIAIALMTF 
GFGSLAGSVH 
LNFSDHHVIP 
FGLNHGHTPY 



Seq ID NO: 484 DNA sequence 

Nucleic Acid Accession ft: FGENESH predicted 

Coding sequence: 1..900 



1 
I 

ATGCCGCCGC 
CCGCGGCGGC 
GCCGTGGGCA 
CGGCCCACTG 
GGCTGCGGCG 
GGACCCCGGG 
CTTCCTAACT 
CCGGTGCGCA 
CTTTGCTACC 
TTTCAAAACA 
GTGCTGCTGG 



11 
I 

GGGAGCTGAG 
GTAGCGCX3CC 
AGAGCAGCCT 
CGCTGGACAC 
GGGCTGTGCA 
GAGGAGACTG 
CAGGCTCTCC 
TTGAGCTCTG 
CGGATACCGA 
TCACAGAGAA 
TGGGCACCCA 



21 
I 

CGAGGCCGAG 
CCCAGAGCTG 
CATCGTCAGC 
CTTCTCTGGT 
CCGGGGAGCT 
GAGCAGGCCC 
CCGCCCCGCC 
GGACACAGCG 
TGTCTTCCTG 
ATGGCTGCCC 
GGCCGACCTG 



31 
I 

CCGCCCCCGC 
GGCATCAAGT 
TACACCTGCA 
ACGTACGTTC 
GGGGCGGGCG 
CGAGGTGGCG 
CCTGCAGTGC 
GGACAGGAGG 
GCGTGCTTCA 
GAGATCCGCA 
AGGGACGATG 



41 
I 

TCCGGGCCCC 
GCGTGCTGGT 
ATGGGTACCC 
AATCGCCCGT 
TCTCGGCGGG 
CTGGTGCGGC 
AAGTCCTGGT 
ATTTTGACCG 
GCGTGGTGCA 
CGCACAACCC 
TCAACGTACT 



51 
I 

GACCCCTCCC 
GGGCGACGGC 
CGCGCGCTAC 
GCGGCCX5CGT 
AGGGCGCAGA 
CCAGGACGCT 
GGATGGAGCT 
ACTTCGTTCC 
GCCCAGCTCC 
CCAGGCGCCT 
AATTCAGCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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GACCAGGGGG GCCGGGAGGG CCCCGTGCCC CAACCCCAGG CTCAGGGTCT GGCCGAGAAG 720 

ATCCGAGCCT GCTGCTACCT TGAGTGCTCA GCCTTGACGC AGAAGAACTT GAAGGAAGTA 780 

TTTGACTCGG CTATTCTCAG TGCCATTGAG CACAAAGCCC GGCTGGAGAA GAAACTGAAT 840 
GCCAAAGGTG TGCGCACCCT CTCCCGCTGC CGCTGGAAGA AGTTCTTCTG CTTCGTTTGA 

Seq ID NO: 485 Protein sequence 
Protein Accession #: PGENESH predicted 

1 11 21 31 41 51 

I I II 1 I 

MPPRELSEAE PPPLRAPTPP PRRRSAPPEL GIKCVLVGDG AVGKSSLIVS YTCNGYPARY 60 

RPTALDTFSG TYVQSPVRPR GCGGAVHRGA GAGVSAGGRR GPRGGDWSRP RGGAGAAQDA 120 

LPNSGSPRPA PAVQVLVDGA PVRIELWDTA GQEDFDRLRS LCYPDTDVFL ACFSWQPSS 180 

FQNITEKWLP EIRTHNPQAP VLLVGTQADL RDDVNVLIQL DQGGREGPVP QPQAQGLAEK 240 
IRACCYIiECS ALTQKNLKEV FDSAILSAIE HKARLEKKLN AKGVRTLSRC RWKKFFCFV 

Seq ID NO: 486 DNA sequence 

Nucleic Acid Accession #: XM_063832.2 

Coding sequence: 1..711 

1 11 21 31 41 51 

I 1 I I I I 

ATGCCGCCGC GGGAGCTGAG CGAGGCCGAG CCGCCCCCGC TCCGGGCCCC GACCCCTCCC 60 

CCGCGGCGGC GTAGCGCGCC CCCAGAGCTG GGCATCAAGT GCGTGCTGGT GGGCGACGGC 120 

GCCGTGGGCA AGAGCAGCCT CATCGTCAGC TACACCTGCA ATGGGTACCC CGCGCGCTAC 180 

CGGCCCACTG CGCTGGACAC CTTCTCTGTG CAAGTCCTGG TGGATGGAGC TCCGGTGCGC 240 

ATTGAGCTCT GGGACACAGC GGGACAGGAG GATTTTGACC GACTTCGTTC CCTTTGCTAC 300 

CCGGATACCG ATGTCTTCCT GGCGTGCTTC AGCGTGGTGC AGCCCAGCTC CTTTCAAAAC 360 

ATCACAGAGA AATGGCTGCC CGAGATCCGC ACGCACAACC CCCAGGCGCC TGTGCTGCTG 420 

GTGGGCACCC AGGCCGACCT GAGGGACGAT GTCAACGTAC TAATTCAGCT GGACCAGGGG 480 

GGCCGGGAGG GCCCCGTGCC CCAACCCCAG GCTCAGGGTC TGGCCGAGAA GATCCGAGCC 540 

TGCTGCTACC TTGAGTGCTC AGCCTTGACG CAGAAGAACT TGAAGGAAGT ATTTGACTCG 600 

GCTATTCTCA GTGCCATTGA GCACAAAGCC CGGCTGGAGA AGAAACTGAA TGCCAAAGGT 660 
GTGCGCACCC TCTCCCGCTG CCGCTGGAAG AAGTTCTTCT GCTTCGTTTG A 

Seq ID NO: 487 Protein sequence 
Protein Accession #: XP_063832.1 

1 11 21 31 41 51 

I I I I I I 

MPPRELSEAE PPPLRAPTPP PRRRSAPPEL GIKCVLVGDG AVGKSSLIVS YTCNGYPARY 60 

RPTALDTFSV QVLVDGAPVR IELWDTAGQE DFDRLRSLCY PDTDVFLACF SWQPSSFQN 120 

ITEKWLPEIR THNPQAPVLL VGTQADLRDD VNVLIQLDQG GREGPVPQPQ AQGLABKIRA 180 
CCYLECSALT QKNLKEVFDS AILSAIEHKA RLEKKLNAKG VRTLSRCRWK KFFCFV 

Seq ID NO: 488 DNA sequence 

Nucleic Acid Accession #: NM_014398.l 

Coding sequence : 64 . . 1314 

1 11 21 31 41 51 

I I I I I I 

GGCACCGATT CGGGGCCTGC CCGGACTTCG CCGCACGCTG CAGAACCTCG CCCAGCGCCC 60 

ACCATGCCCC GGCAGCTCAG CGCGGCGGCC GCGCTCTTCG CGTCCCTGGC CGTAATTTTG 120 

CACGATGGCA GTCAAATGAG AGCAAAAGCA TTTCCAGAAA CCAGAGATTA TTCTCAACCT 180 

ACTGCAGCAG CAACAGTACA GGACATAAAA AAACCTGTCC AGCAACCAGC TAAGCAAGCA' 240 

CCTCACCAAA CTTTAGCAGC AAGATTCATG GATGGTCATA TCACCTTTCA AACAGCGGCC 300 

ACAGTAAAAA TTCCAACAAC TACCCCAGCA ACTACAAAAA ACACTGCAAC CACCAGCCCA 360 

ATTACCTACA CCCTGGTCAC AACCCAGGCC ACACCCAACA ACTCACACAC AGCTCCTCCA 420 

GTTACTGAAG TTACAGTCGG CCCTAGCTTA GCCCCTTATT CACTGCCACC CACCATCACC 480 

CCACCAGCTC ATACAGCTGG AACCAGTTCA TCAACCGTCA GCCACACAAC TGGGAACACC 540 

ACTCAACCCA GTAACCAGAC CACCCTTCCA GCAACTTTAT CGATAGCACT GCACAAAAGC 600 

ACAACCGGTC AGAAGCCTGA TCAACCCACC CATGCCCCAG GAACAACGGC AGCTGCCCAC 660 

AATACCACCC GCACAGCTGC ACCTGCCTCC ACGGTTCCTG GGCCCACCCT TGCACCTCAG 720 

CCATCGTCAG TCAAGACTGG AATTTATCAG GTTCTAAACG GAAGCAGACT CTGTATAAAA 780 

GCAGAGATGG GGATACAGCT GATTGTTCAA GACAAGGAGT CGGTTTTTTC ACCTCGGAGA 840 

TACTTCAACA TCGACCCCAA CGCAACGCAA GCCTCTGGGA ACTGTGGCAC CCGAAAATCC 900 

AACCTTCTGT TGAATTTTCA GGGCGGATTT GTGAATCTCA CATTTACCAA GGATGAAGAA 960 

TCATATTATA TCAGTGAAGT GGGAGCCTAT TTGACCGTCT CAGATCCAGA GACAGTTTAC 1020 

CAAGGAATCA AACATGCGGT GGTGATGTTC CAGACAGCAG TCGGGCATTC CTTCAAGTGC 1080 

GTGAGTGAAC AGAGCCTCCA GTTGTCAGCC CACCTGCAGG TGAAAACAAC CGATGTCCAA 1140 

CTTCAAGCCT TTGATTTTGA AGATGACCAC TTTGGAAATG TGGATGAGTG CTCGTCTGAC 1200 

TACACAATTG TGCTTCCTGT GATTGGGGCC ATCGTGGTTG GTCTCTGCCT TATGGGTATG 1260 

GGTGTCTATA AAATCCGCCT AAGGTGTCAA TCATCTGGAT ACCAGAGAAT CTAATTGTTG 1320 

CCCGGGGGGA ATGAAAATAA TGGAATTTAG AGAACTCTTT CATCCCTTCC AGGATGGATG 1380 

TTGGGAAATT CCCTCAGAGT GTGGGTCCTT CAAACAATGT AAACCACCAT CTTCTATTCA 1440 

AATGAAGTGA GTCATGTGTG ATTTAAGTTC AGGCAGCACA TCAATTTCTA AATACTTTTT 1500 

GTTTATTTTA TGAAAGATAT AGTGAGCTGT TTATTTTCTA GTTTCCTTTA GAATATTTTA 1560 

GCCACTCAAA GTCAACATTT GAGATATGTT GAATTAACAT AATATATGTA AAGTAGAATA 1620 

AGCCTTCAAA TTATAAACCA AGGGTCAATT GTAACTAATA CTACTGTGTG TGCATTGAAG 1680 

ATTTTATTTT ACCCTTGATC TTAACAAAGC CTTTGCTTTG TTATCAAATG GACTTTCAGT 1740 

GCTTTTACTA TCTGTGTTTT ATGGTTTCAT GTAACATACA TATTCCTGGT GTAGCACTTA 1800 

ACTCCTTTTC CACTTTAAAT TTGTTTTTGT TTTTTGAGAC GGAGTTTCAC TCTTGTCACC 1860 

CAGGCTGGAG TACAGTGGCA CGATCTCGGC TTATGGCAAC CTCCGCCTCC CGGGTTCAAG 1920 

TGATTCTCCT GCTTCAGCTT CCCGAGTAGC TGGGATTACA GGCACACACT ACCACGCCTG 1980 

GCTAATTTTT GTATTTTTAT TATAGACGGG TTTCACCATG TTGGCCAGAC TGGTCTTGAA 2040 

CTCTTGACCT CAGGTGATCC ACCCACCTCA GCCTCCCAAA GTGCTGGGAT TACAGGCATG 2100 

AGCCATTGCG CCCGGCCTTA AATGTTTTTT TTAATCATCA AAAAGAACAA CATATCTCAG 2160 
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GTTGTCTAAG TGTTTTTATG TAAAACCAAC AAAAAGAACA AATCAGCTTA TATTTTTTAT 2220 

CTTGATGACT CCTGCTCCAG AATTGCTAGA CTAAGAATTA GGTGGCTACA GATGGTAGAA 2280 

CTAAACAATA AGCAAGAGAC AATAATAATG GCCCTTAATT ATTAACAAAG TGCCAGAGTC 2340 

TAGGCTAAGC ACTTTATCTA TATCTCATTT CATTCTCACA ACTTATAAGT GAATGAGTAA 2400 

ACTGAGACTT AAGGGAACTG AATCACTTAA ATGTCACCTG GCTAACTGAT GGCAGAGCCA 2460 

GAGCTTGAAT TCATGTTGGT CTGACATCAA GGTCTTTGGT CTTCTCCCTA CACCAAGTTA 2520 

CCTACAAGAA CAATGACACC ACACTCTGCC TGAAGGCTCA CACCTCATAC CAGCATACGC 2580 

TCACCTTACA GGGAAATGGG TTTATCCAGG ATCATGAGAC ATTAGGGTAG ATGAAAGGAG 2640 

AGCTTTGCAG ATAACAAAAT AGCCTATCCT TAATAAATCC TCCACTCTCT GGAAGGAGAC 2700 

TGAGGGGCTT TGTAAAACAT TAGTCAGTTG CTCATTTTTA TGGGATTGCT TAGCTGGGCT 2760 

GTAAAGATGA AGGCATCAAA TAAACTCAAA GTATTTTTAA ATTTTTTTGA TAATAGAGAA 2820 

ACTTCGCTAA CCAACTGTTC TTTCTTGAGT GTATAGCCCC ATCTTGTGGT AACTTGCTGC 2880 

TTCTGCACTT CATATCCATA TTTCCTATTG TTCACTTTAT TCTGTAGAGC AGCCTGCCAA 2940 

GAATTTTATT TCTGCTGTTT TTTTTGCTGC TAAAGAAAGG AACTAAGTCA GGATGTTAAC 3000 

AGAAAAGTCC ACATAACCCT AGAATTCTTA GTCAAGGAAT AATTCAAGTC AGCCTAGAGA 3060 

CCATGTTGAC TTTCCTCATG TGTTTCCTTA TGACTCAGTA AGTTGGCAAG GTCCTGACTT 3120 
TAGTCTTAAT AAAACATTGA ATTGTAGTAA AGGTTTTTGC AATAAAAACT TACTTTGG 

Seq ID NO: 489 Protein sequence 
Protein Accession ft: NP_055213.1 

1 11 21 31 41 51 

III 111 

MPRQIiSAAAA LFASLAVILH DGSQMRAKAF PETRDYSQPT AAATVQDIKK PVQQPAKQAP 60 

HQTLAARFMD GHITFQTAAT VKIPTTTPAT TKNTATTSPI TYTLVTTQAT PNNSHTAPPV 120 

TEVTVGPSLA PYSLPPTITP PAHTAGTSSS TVSHTTGNTT QPSNQTTLPA TLSIALHKST 180 

TGQKPDQPTH APGTTAAAHN TTRTAAPAST VPGPTLAPQP SSVKTGIYQV LNGSRLCIKA 240 

EMGIQLIVQD KESVFSPRRY FNIDPNATQA SGNCGTRXSN LLLNFQGGFV NLTFTKDEES 300 

YYISEVGAYL TVSDPETVYQ GIKHAWMFQ TAVGHSFKCV SEQSLGLSAH LQVKTTDVQL 360 
OAFDFEDDHF GNVDECSSDY TIVLPVIGAI WGLCLMGMG VYKIRLRCQS SGYQRI 

Seq ID NO: 490 DNA sequence 

Nucleic Acid Accession Jh NM_005409.3 

Coding sequence: 94.-378 

1 11 21 31 41 51 

I I 1 I I I 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 

CAACAGCACC AGCAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCTATAGCC 120 

TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 180 

TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 

ATAATGTACC CAAGTAACAA CTGTGACAAA ATAGAAGTGA TTATTACCCT GAAAGAAAAT 300 

AAAGGACAAC GATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 

GAAAGAAAGA ATTTTTAAAA ATATCAAAAC ATATGAAGTC CTGGAAAAGG GCATCTGAAA 420 

AACCTAGAAC AAGTTTAACT GTGACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG 480 

AGACTTTTCT ATGGTTTTGT GACTTTCAAC TTTTGTACAG TTATGTGAAG GATGAAAGGT 540 

GGGTGAAAGG ACCAAAAACA GAAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 600 

CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 

GGTTACCATC GGAGTTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT ATACATTCAT 720 

GCATTTCTAG GCTAGAGAAC CTTCTAGATT TGATGCTTAC AACTATTCTG TTGTGACTAT 780 

GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 

CTGTGGTTAC AGTGGAGACA TTGACATTAT TACTGGAGTC AAGCCCTTAT AAGTCAAAAG 900 

CATCTATGTG TCGTAAAGCA TTCCTCAAAC ATTTTTTCAT GCAAATACAC ACTTCTTTCC 960 

CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1020 

TTTATAACCA ATTCATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATACGCTA 1080 

TGGGATACTG GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 1140 

GATGTTTTTC AACTTTTATT CATTGAGATG TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 12 0O 

TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

TACAAAATGT TTTTGTCTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TATACCTAGC 1320 

AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 

TTGTTCATGC CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 
GTCATTTTTT TCTCTAATAA ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 

Seq ID NO: 491 Protein sequence 
Protein Accession #: NP_005400.1 

1 11 21 31 41 51 

I I I I I I 

MSVKGMAIAL AVILCATWQ GFPMFKRGRC LCIGPGVKAV KVADIEKASI MYPSNNCDKI 60 
EVIITLKENK GQRCLiNPKSK QARLIIKKVE RKNF 

Seq ID NO: 492 DNA sequence 

Nucleic Acid Accession #: NM_000577.1 

Coding sequence *. 4 1 . . 5 2 0 

1 11 21 31 41 51 

III III 

GGCACGAGGG GAAGACCTCC TGTCCTATCA GGCCCTCCCC ATGGCTTTAG AGACGATCTG 60 

CCGACCCTCT GGGAGAAAAT CCAGCAAGAT GCAAGCCTTC AGAATCTGGG ATGTTAACCA 120 

GAAGACCTTC TATCTGAGGA ACAACCAACT AGTTGCCGGA TACTTGCAAG GACCAAATGT 180 

CAATTTAGAA GAAAAGATAG ATGTGGTACC CATTGAGCCT CATGCTCTGT TCTTGGGAAT -240 

CCATGGAGGG AAGATGTGCC TGTCCTGTGT CAAGTCTGGT GATGAGACCA GACTCCAGCT 300 

GGAGGCAGTT AACATCACTG ACCTGAGCGA GAACAGAAAG CAGGACAAGC GCTTCGCCTT 360 

CATCCGCTCA GACAGTGGCC CCACCACCAG TTTTGAGTCT GCCGCCTGCC CCGGTTGGTT 420 

CCTCTGCACA GCGATGGAAG CTGACCAGCC CGTCAGCCTC ACCAATATGC CTGACGAAGG 480 

CGTCATGGTC ACCAAATTCT ACTTCCAGGA GGACGAGTAG TACTGCCCAG GCCTGCCTGT 540 

TCCCATTCTT GCATGGCAAG GACTGCAGGG ACTGCCAGTC CCCCTGCCCC AGGGCTCCCG 600 
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GCTATGGGGG CACTGAGGAC CAGCCATTGA GGGGTGGACC CTCAGAAGGC GTCACAACAA 660 

CCTGGTCACA GGACTCTGCC TCCTCTTCAA CTGACCAGCC TCCATGCTGC CTCCAGAATG 720 

GTCTTTCTAA TGTGTGAATC AGAGCACAGC AGCCCCTGCA CAAAGCCCTT CCATGTCGCC 780 

TCTGCATTCA GGATCAAACC CCGACCACCT GCCCAACCTG CTCTCCTCTT GCCACTGCCT 840 

CTTCCTCCCT CATTCCACCT TCCCATGCCC TGGATCCATC AGGCCACTTG ATGACCCCCA 900 

ACCAAGTGGC TCCCACACCC TGTTTTACAA AAAAGAAAAG ACCAGTCCAT GAGGGAGGTT 960 

TTTAAGGGTT TGTGGAAAAT GAAAATTAGG ATTTCATGAT TTTTTTTTTT CAGTCCCCGT 1020 

GAAGGAGAGC CCTTCATTTG GAGATTATGT TCTTTCGGGG AGAGGCTGAG GACTTAAAAT 1080 

ATTCCTGCAT TTGTGAAATG ATGGTGAAAG TAAGTGGTAG CTTTTCCCTT CTTTTTCTTC 1140 

TTTTTTTGTG ATGTCCCAAC TTGTAAAAAT TAAAAGTTAT GGTACTATGT TAGCCCCATA 1200 

ATTTTTTTTT TCCTTTTAAA ACACTTCCAT AATCTGGACT CCTCTGTCCA GGCACTGCTG 1260 

CCCAGCCTCC AAGCTCCATC TCCACTCCAG ATTTTTTACA GCTGCCTGCA GTACTTTACC 1320 

TCCTATCAGA AGTTTCTCAG CTCCCAAGGC TCTGAGCAAA TGTGGCTCCT GGGGGTTCTT 1380 

TCTTCCTCTG CTGAAGGAAT AAATTGCTCC TTGACATTGT AGAGCTTCTG GCACTTGGAG 1440 

ACTTGTATGA AAGATGGCTG TGCCTCTGCC TGTCTCCCCC ACCAGGCTGG GAGCTCTGCA 1500 

GAGCAGGAAA CATGACTCGT ATATGTCTCA GGTCCCTGCA GGGCCAAGCA CCTAGCCTCX5 1560 

CTCTTGGCAG GTACTCAGCG AATGAATGCT GTATATGTTG GGTGCAAAGT TCCCTACTTC 1620 

CTGTGACTTC AGCTCTGTTT TACAATAAAA TCTTGAAAAT GCCTAAAAAA AAAAAAAAAA 1680 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID NO: 493 Protein sequence 
Protein Accession #: NP 000568.1 



PCT7US02/12476 



1 11 21 31 41 51 

I I I I I I . 

MALETICRPS GRKSSKMQAF RIWDVNQKTF YLRNNQLVAG YLQGPNVNLE EKIDWPIEP 
HALFLGXHGG KMCLSCVKSG DETRLQLEAV NITDLSENRX QDKRFAFIRS DSGPTTSFES 
AACPGWFLCT AMEADQPVSL TNMPDEGVMV TKFYFQEDE 

Seq ID NOt 494 DNA sequence 

Nucleic Acid Accession 8: NM_002081.1 

Coding sequence: 222.. 1898 



GGCTGCCCGA 
GGCTTTTGTT 
CGGGACCTTG 
AGAGGCGCGG 
GCTGGTGGCT 
GCAAGAGCCG 
GCGACGTGCC 
CCTGCTGCAC 
CCGCGCTCCG 
TCGATGACCA 
CCGGCGCCTT 
AGCTGCGCCT 
GGGCCCGCCT 
ACTACCTGGA 
GAGAGCTGCG 
TGGGCGTGGC 
CGAGAGCTGT 
CCTGCCCTGA 
ACGCCGAGTG 
CATCGGGTGT 
CCCTCCAGGA 
AGGTCAACCC 
GGGAGAGGCC 
GCGACGTCCA 
TGAGCACTGC 
AGGTCATGGG 
CCAAGCCGGA 
TGCGCAGCGC 
GCTCGGGCAG 
GCTCCAGCTC 
AGAAGACCTC 
TCCTGGCCCT 
GAGGCCAAGG 
TGGAGAGGCC 
GTCCCAGCCC 
CAGGTCAGCT 
TCCGGCTGCC 
CTACAGAGGA 
CGCCTCCTCC 
CCTCCAGAGA 
TCTGAGATGA 
CTGCGCCCTT 
GGAGTCTGAG 
GGGGCCACTG 
GGAGGCAGCG 
GCCTTGCTGG 
CCTGCTCCCA 
CAGGGCTCAG 
CCCCGCACTG 
GCACGGGGAC 
TGTCCTTGTT 
CACCTTGGAC 



11 
I 

GCGAGCGTTC 
GTCTCCGCCT 
GCTCTGCCCT 
GCGGGTGGCC 
GCTATGTGCG 
GAGCTGCGGC 
CCAGGCGGAG 
CAGCGAGATG 
GGACAGCAGC 
CTTCCAGCAC 
CGGAGAGCTG 
GTACTACCGC 
GCTCGAGCGC 
CTGCCTGGGC 
CCTGCGGGCC 
CAGCGACGTG 
CATGAAGCTG 
CTATTGCCGA 
GAGGAACCTC 
GGAGAGTGTC 
CAACAGGGAC 
CCAGGGCCCT 
ACCTTCAGGC 
GGACTTCTGG 
CAGTGATGAC 
TGACGGCCTG 
CATGACCATC 
CTACAACGGC 
CGGTGATGGC 
CCGGACGCCC 
GGCTGCCAGC 
TACAGTAGCC 
ACTGACTTTG 
TGGGGTGGGA 
CAGGCCTGGC 
GGGAGCCAGT 
TAGCCCTCCC 
GGCCTCAAAG 
CACTGGGACT 
AGCCCCGCAC 
TGCATGATGC 
GAGGGGCCCC 
GACTGTCCTC 
ACCCACCTGC 
TGGGCTCTGC 
GGTCCAGGGC 
TCCTCACCCA 
AGTGACCCTC 
CACACGGGAA 
CTGGATAGTT 
CATGGAGAGC 
CCTGGTGACC 



21 
I 

GGACCTCGCA 
CCTCGGCCGC 
TCGCGGGCGG 
GGGGGCGCCG 
GCCGCAGCGC 
GAGGTCCGCC 
ATCTCGGGTG 
GAGGAGAACC 
CGCGTCCTGC 
CTGCTGAACG 
TACACGCAGA 
GGTGCCAACC 
CTCTTCAAGC 
AAGCAGGCCG 
ACCCGTGCCT 
GTCCGGAAAG 
GTCTACTGTG 
AATGTGCTCA 
CTGGACTCCA 
ATCGGCAGCG 
ACGCTCACGG 
GGGCCTGAGG 
ACGCTGGAGA 
ATCAGCCTCC 
CGCTGCTGGA 
GCCAACCAGA 
CGGCAGCAGA 
AACGACGTGG 
TGTCTGGATG 
TTGACCCATG 
TGCCCCCAGC 
AGGCCCCGGT 
CCAAAAATAC 
CAGGGAGGGC 
CTCGCCTGCC 
GTGCCCAAAA 
CCCAGCTCCC 
CAACCCGCTG 
CCCAGCAGAG 
GGGCTGTCTG 
CCTCCCCTCA 
AGCGTCTGCA 
CCACAGACCC 
GCTTCTGCTG 
CAATGTGGGC 
TGTTGGAGGA 
GATCAGGAAC 
GGCTGTCACC 
TGCCTAGGTC 
AAGGGCTTTT 
TGTTCGCTCC 
TCCTGTCACT 



31 
I 

CCCCGCGCGC 
CGCCGCCTCT 
GAACTGCGCA 
CCGGCCCCGC 
TGGTCGCCTG 
AGATCTACGG 
AGCACCTGCG 
TGGCCAACCG 
AGGCCATGCT 
ACTCGGAGCG 
ACGCGAGGGC 
TGCACCTGGA 
AGCTGCACCC 
AGGCGCTGCG 
TCGTGGCTGC 
TGGCTCAGGT 
CTCACTGCCT 
AGGGCTGCCT 
TGGTGCTCAT 
TGCACACGTG 
CCAAGGTCAT 
AGAAGCGGCG 
AGCTGGTCTC 
CAGGGACACT 
ACGGGATGGC 
TCAACAACCC 
TCATGCAGCT 
ACTTCCAGGA 
ACCTCTGCGG 
CCCTCCCAGG 
CCCCGACCTT 
GGCGGTAACT 
AACACAGACG 
CGGCGGCTCT 
TTTCTGCCTT 
GCCATGTATT 
TGCACCGCCG 
GAGCCCACAG 
CCCACCAGCC 
GGTGTCCGCC 
GCGCAGGCTG 
GGGTGACGCC 
TGCAGTGAGG 
GAGGAGGGGA 
TGCCCCTCGC 
CCCCGAGGGC 
CAGGGCCTCC 
TGCTCACAGG 
CCTTCCCGAC 
CCAAACATGC 
TCCCAGATGG 
CACTGAGGCC 



41 
I 

CCCGCGCCGC 
GGACCGCGAG 
GGACCCGGCC 
CATGGAGCTC 
CGCCCGCGGG 
AGCCAAGGGC 
GATCTGTCCC 
CAGCCATGCC 
TGCCACCCAG 
GACGCTGCAG 
CTTCCGGGAC 
GGAGACGCTG 
CCAGCTGCTG 
GCCCTTCGGG 
TCGCTCCTTT 
CCCCCTGGGC 
GGGAGTCCCC 
TGCCAACCAG 
CACCGACAAG 
GCTGGCGGAG 
CCAGGGCTGC 
CCGGGGCAAG 
TGAAGCCAAG 
GTGCAGTGAG 
CAGAGGCCGG 
CGAGGTGGAG 
GAAGATCATG 
CGCCAGTGAC 
CCGGAAGGTC 
CCTGTCAGAG 
CCTCCTGCCC 
GCCCCAAGGC 
ATATTTAATT 
GAGCAGGGGC 
TTAATTTTGT 
TCAGGGACCT 
CAGAAGCAGC 
CGAGCCTGTG 
AGCCCTGGCC 
ATCCAGGGTC 
CAGAGCCCGG 
TGAGACAGCA 
GGCCCTCCAT 
AGCTGGGCCC 
ACACAGGGCT 
TGAGGAGCAG 
CTGTTCACGG 
GATGCTGGTG 
CCAGCCAGCT 
ATCCATTTAC 
CTTCGGAGGC 
ATCAGGGCCC 



51 
I 

CGCCGCCGCC 
CCGCGCGCGC 
AGGATCCGAG 
CGGGCCCGAG 
GACCCGGCCA 
TTCAGCCTGA 
CAGGGCTACA 
GAGCTGGAGA 
CTGCGCAGCT 
GCCACCTTCC 
CTGTACTCAG 
GCCGAGTTCT 
CTGCCTGATG 
GAGGCCCCGA 
GTGCAGGGCC 
CCGGAGTGCT 
GGCGCCAGGC 
GCCGACCTGG 
TTCTGGGGTA 
GCCATCAACG 
GGGAACCCCA 
CTGGCCCCGC 
GCCCAGCTCC 
AAGATGGCCC 
TACCTCCCCG 
GTGGACATCA 
ACCAACCGGC 
GACGGCAGCG 
AGCAGGAAGA 
CAGGAAGGAC 
CTCCTCCTCT 
CCCAGGGACA 
CACCTCAGCC 
AGGCGCAGAG 
ATGAGGTCCT 
CAGGGGCACC 
CCCTCGAGGC 
CCTTCCTCCC 
CACCCCCCAG 
TGGCAGAGCC 
CCCCACCTCC 
CCACTGCTGA 
GCGCAGATGA 
AAAGGCCCAG 
CACAGGGCAG 
CCAGGACCCG 
TGACACAGGT 
GCTGGTGAGA 
GCACTGCAGG 
TGACACTTCC 
CCGCAGGGCC 
TGCCCCAGGC 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



374 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CTGGACGGGC CCTCCTTCCC TCCTGTGCCC CAGCTGCCAG GTGGCCCTGG GGAGGGGTGG 3180 

TGTGGTGTTG GGAAGGGGTC CTGCAGGGGG AGGAGGACTT GGAGGGTCTG GGGGCAGCTG 3240 

TCCTGAACCG ACTGACCCTG AGGAGGCCGC TTAGTGCTGC TTTGCTTTTC ATCACCGTCC 3300 

CGCACAGTGG ACGGAGGTCC CCGGTTGCTG GTCAGGTCCC CATGGCTTGT TCTCTGGAAC 3360 

CTGACTTTAG ATGTTTTGGG ATCAGGAGCC CCCAACACAG GCAAGTCCAC CCCATAATAA 3420 

CCCTGCCAGT GCCAGGGTGG GCTGGGGACT CTGGCACAGT GATGCCGGGC GCCAGGACAG 3480 

CAGCACTCCC GCTGCACACA GACGGCCTAG GGGTGGCGCT CAGACCCCAC CCTACGCTCA 3S40 

TCTCTGGAAG GGGCAGCCCT GAGTGGTCAC TGGTCAGGGC AGTGGCCAAG CCTGCTGTGT 3600 

CCTTCCTCCA CAAGGTCCCC CCACCGCTCA GTGTCAGCGG GTGACGTGTG TTCTTTTGAG 3660 
TCCTTGTATG AATAAAAGGC TGGAAACCTA AA 

Seq ID HO: 495 Protein sequence 
Protein Accession 8: np 002072.1 



PCT/US02/12476 



MBLRARGWWL 
ICPQGYTCCT 
TLQATFPGAF 
QLLLPDDYLD 
PLGPECSRAV 
TDKFWGTSGV 
RGKLAPRERP 
RGRYLPEVMG 
ASDOGSGSGS 



11 
I 

LCAAAALVAC 
SEMEENLANR 
GELYTQNARA 
CLGKQAEALR 
MKLVYCAHCL 
ESVIGSVHTW 
PSGTLEKLVS 
DGLANQINNP 



21 
I 

ARGDPASKSR 
SHAELETALR 
FRDLYSELRL 
PFGEAPRELR 
GVPGARPCPD 
LAEAINALQD 
EAKAQLRDVQ 
EVEVDITKPD 



31 
I 

SCGEVRQIYG 
DSSRVLQAML 
YYRGANLHLE 
LRATRAFVAA 
YCRNVLKGCL 
NRDTLTAKVI 
DFWISLPGTL 
MTIRQQIMQfc 



41 
I 

AKGFSLSDVP 
ATQLRSFDDH 
ETLAEFWARL 
RSFVQGLGVA 
ANQADLDAEW 
QGCGNPKVNP 
CSEKMALSTA 
KIMTNRLRSA 



SI 
I 

QAEISGEHLR 
FQHLLNDSER 
LERLFKQLHP 
SDWRKVAQV 
RNLLDSMVLI 
QGPGPEEKRR 
SDDRCWNGMA 
YNGNDVDFQD 



Seq ID NO: 496 DNA sequence 

Nucleic Acid Accession 8 : NM_001650. 

Coding sequence: 40.1011 



GGGGCAGGCA 
AGGCGGTGGG 
GGGGTCTGGA 
TTTGTTCTCC 
GTCGACATGG 
TTTGGCCATA 
AGGAAGATCA 
ATTGGAGCAG 
ACCATGGTTC 
TTTCAATTGG 
TCAATAGCTT 
ACTGGTGCCA 
GAAAACCATT 
TATGAGTATG 
AAAGCTGCCC 
GAGACGGATG 
GAGGAGAAGA 
CGCACTGAAA 
GAAACAGATT 
GTCTAAACAA 
TCCAAATCTA 
TCTAGTTACC 
CCTGACAGAA 
AGTCAATTCT 



11 
I 

ATGAGAGCTG 
GTAAGTGTGG 
CTCAAGCTTT 
TCAGCCTGGG 
TTCTCATCTC 
TCAGCGGTGG 
GCATCGCCAA 
GAATCCTCTA 
ATGGAAATCT 
TGTTTACTAT 
TAGCAATTGG 
GCATGAATCC 
GGATATATTG 
TCTTCTGTCC 
AGCAAACAAA 
ACCTGATTCT 
AGGGGAAAGA 
GCAGACAAGA 
TGTTATAAAT 
TAAATATTTC 
AAAAAAGAAA 
TTTCATTAAC 
CTCAAAGACA 
TATTTGAATA 



21 
I 

CACTCTGGCT 
ACCTTTGTGT 
CTGGAAAGCA 
ATCCACCATC 
CCTTTGCTTT 
CCACATCAAC 
GTCTGTCTTC 
TCTGGTCACA 
TACCGCTGGT 
CTTTGCCAGC 
ATTTTCTGTT 
CGCCCGATCC 
GGTTGGGCCC 
AGATGTTGAA 
AGGAAGCTAC 
AAAACCTGGA 
CCAATCTGGA 
CTCCTTAGAA 
TAGAAATGTG 
ATAATTTACA 
TATTTTTAAG 
AACCAATTTT 
CGTCTATCAG 
TTTATTCTAT 



31 
I 

GGGGAAGGCA 
ACCAGAGAGA 
GTCACAGCGG 
AACTGGGGTG 
GGACTCAGCA 
CCTGCAGTGA 
TACATCGCAG 
CCTCCCAGTG 
CATGGTCTCC 
TGTGATTCCA 
GCAATTGGAC 
TTTGGACCTG 
ATCATAGGAG 
TTCAAACGTC 
ATGGAGGTGG 
GTGGTGCATG 
GAGGTATTGT 
CTGTCCTCAG 
CAGGTTTGTT 
AAGGAGGAAC 
ATGTTCTTAA 
AACCGTGTGT 
CTTATTCCTT 
TAAACTGAGT 



41 
I 

TGAGTGACAG 
ACATCATGGT 
AATTTCTGGC 
GAACAGAAAA 
TTGCAACCAT 
CTGTGGCCAT 
CCCAGTGCCT 
TGGTGGGAGG 
TGGTTGAGTT 
AACGGACTGA 
ATTTATTTGC 
CAGTTATCAT 
CTGTCCTCGC 
GTTTTAAAGA 
AGGACAACAG 
TGATTGACGT 
CTTCAGTATG 
ATTTCCTTCC 
GTTTCATGTC 
GGAAGAAACC 
GCAAATATAT 
CAAGATTTGG 
CTCTACTGGA 
TTAACAATGG 



Seq ID NO; 497 Protein sequence 
Protein Accession #: NP_001641.1 

1 11. 21 31 41 51 

I I I I I I 

MSDRPTARRW GKCGPLCTRE NIMVAFKGVW TQAFWKAVTA EFLAMLIFVL LSLGSTINWG 
GTEKPLPVDM VLISLCFGLS IATMVQCFGH ISGGHINPAV TVAMVCTRKI SIAKSVFYIA 
AQCLGAIIGA GILYLVTPPS WGGLGVTMV HGNLTAGHGL IiVELI ITFQL VFTIFASCDS 
KRTDVTGSIA XjAIGFSVAIG HLFAINYTGA SMNPARSFGP AVIMGNWENH WIYWVGPIIG 
AVLAGGLYEY VFCPDVEFKR RFKEAFSKAA QQTKGSYMEV EDNRSQVETD DLILKPGWH 
VIDVDRGEEK KGKDQSGEVL SSV 



Seq ID NO: 498 DNA sequence 

Nucleic Acid Accession ft: AB020684.1 

Coding sequence: 1..1744 



CCCCCTTGTC 
TTGGTACCGG 
GACGGTTACC 
TGCTTGCTTT 
CATATATGGC 
CTTTTTCAAT 
CTCATATCCA 
ACTTTATAGA 
GCAGTTTGCT 
CGGGTACATT 
ACTTTGTTTT 
TTTGGTAATT 



11 

I 

ATTAATACAT 
ATTTATACCA 
AGAGGAGAAG 
TATGTTGCTG 
ACATATTTAA 
CATGGAGAGT 



GGAAGCTTGA 
CAGTTTGTAC 
GATATATGTA 
GTTTTGATGT 
ATTTGGGGTA 



21 
I 

TAAAAAGATT 
AAATAATGGA 
GACTCAGTCC 
TAATTTTTAT 
GTGGCAGCCG 
GTACCCGTGT 
TTCAGATGTT 
TTGCACTCTG 
TTCTTACTCA 
AATTACGGAA 
TTGGGAACTC 
TTCTGGCAAT 



31 
I 

CAATCTTTAC 
CTTGATTGGT 
TATTGAAAGC 
TTTAAATGGA 
ATTAGGAGGC 
AATGTGGACA 
GCTAGTGACT 
CATTTCCAAT 
GATTGCATCA 
GATCATTTAT 
AATGTTATTA 
GAAACCACAT 



41 

I 

CCTGAGGTAA 
ATTCAAACCA 
TGTGAAGGAT 
CTAATGATGG 
CTGGTTACAG 
CCACCTCTCC 
CATATTCTCA 
GTATTTTTCA 
TTATTTGCAG 
ATACACATGA 
ACTTCTTATT. 
TTCCTGAAAA 



51 
I 

TTTTGGCCAG 
AGATATGTTG 
TGGGAGATCC 
CATTATTCTT 
TGTTGTGCTT 
GTGAAAGCTT 
GGGCTACAAA 
TGCTTCCTTG 
TATATGTTGT 
TTTCTCTTGC 
ATGCTTCTTC 
TAAATGTATC 



60 
120 
180 
240 
300 
360 
420 
480 



51 

I 

ACCCACAGCA 
GGCTTTCAAA 
CATGCTTATT 
GCCTTTACCG 
GGTGCAGTGC 
GGTGTGCACC 
GGGGGCCATC 
CCTGGGAGTC 
GATAATCACA 
TGTCACTGGC 
AATCAATTAT 
GGGAAATTGG 
TGGTGGCCTT 
AGCCTTCAGC 
GAGTCAGGTA 
TGACCGGGGA 
ACTAGAAGAT 
ACCCATTAAG 
ATATTACTCA 
TATTGTGAAT 
ACCTATTTTA 
TTAAGTCTTG 
ATATTGGTAT 
C 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
160 
240 
300 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 



375 



WO 02/086443 

TGAACTTAGT ttatgGGTTA TTCAAGGATG TTTTTGGTTA TTTGGAACTG TCATACTTAA 780 

ATACTTGACA TCTAAAATTT TTGGTATTGC AGATGACGCT CATATTGGCA ACTTACTAAC 840 

ATCAAAATTC TTTAGTTATA AGGATTTTGA TACTTTATTG TATACCTGTG CAGCGGAGTT 900 

TGACTTTATG GAAAAAGAGA CTCCACTGAG ATACACAAAG ACATTATTGC TTCCAGTTGT 9 SO 

TCTTGTAGTG TTTGTTGCTA TTGTTAGAAA GATTATTAGT GATATGTGGG GTGTCTTAGC 1020 

TAAACAACAG ACACATGTAA GAAAACACCA GTTTGATCAT GGAGAGCTGG TTTACCATGC 1080 

ATTGCAATTG TTAGCATATA CAGCCCTTGG TATTTTAATT ATGAGACTAA AACTCTTCTT 1140 

GACACCACAC ATGTGTGTTA TGGCATCACT GATCTGCTCA AGACAGCTAT TTGGATGGCT 1200 

CTTTTGCAAA GTACATCCTG GTGCTATTGT GTTTGCTATA TTAGCAGCAA TGTCAATACA 1260 

AGGTTCAGCA AATCTGCAAA CCCAGTGGAA TATTGTAGGG GAGTTCAGCA ATTTGCCCCA 1320 

AGAAGAACTT ATAGAATGGA TCAAATATAG TACTAAACCA GATGCAGTGT TTGCGGGTGC 1380 

CATGCCCACG ATGGCAAGTG TTAAGCTCTC TGCACTTCGG CCCATTGTGA ATCATCCACA 1440 

TTATGAAGAC GCAGGCTTGA GAGCCAGAAC AAAAATAGTA TACTCAATGT ATAGTCGGAA 1500 

AGCAGCCGAA GAAGTGAAGC GAGAACTGAT AAAGTTAAAA GTGAACTATT ACATTCTAGA 1560 

AGAGTCATGG TGTGTAAGAA GATCCAAGCC TGGTTGCAGT ATGCCTGAAA TTTGGGATGT 1620 

AGAAGATCCT GCCAATGCTG GGAAAACTCC CTTATGTAAC CTCTTGGTGA AGGATTCCAA 1680 

ACCTCACTTC ACCACTGTAT TCCAGAACAG TGTTTACAAA GTCCTAGAAG TTGTAAAAGA 1740 

ATGACTGCTA CATGACCTGC TGCCTACGGA GAACTACATC TGTAATGGTT TTAATGTTTT 1800 

GCTAAGTCAT GTGTTGTTCA TATCCCAAAA ACTTTTATAG GTAACTGTTT TCAAATAGAA 1860 

AACGTTTTAT TTGGTCAATT TGAATGTCAT TCTAATTATA AAAATGACTT ACACCTTTAT 1920 

CAATTGGTTA CTATTTCAAT GCACCCTTTA AAATTTGCTA TGCAAATGAG TATATGCTTG 1980 

TACTTGACTT TAATATTTGT GCTAAAGTGA GCAAAGCTAC CTGTATAAAG AAAACACAGT 2040 

GGGTTGTGAC AAGGATGACA TGAAAATACA GGACAATTCT GACAATGTAG GGGCTGATTT 2100 

TATAGTGTAA GAACTATTAA TGCCCCTTGC TTCTTTTTTC TGCCTCTTGC TCTTGTCTTT 2160 

TGGACATTTC AGTGATTGTA AGTTCTTCGG TCATGTCAGC CCCTGTCATC AACTTGAGTT ' 2220 

ACAGTAGATG GGGCAGACAT GG AGTGTTTG CTATATAAAA CTATCTGTTT GTTTTACTTC 2280 

CTTGTGCGCT TTTTGT.TCTC TGTTCTCTTG TTAATGAAGC TTTTCCTGCC CATTATTAAT 2340 

CCAAACTCTT GGACCTTGTG GTTAGGAAAT TCCCTTAACT TCCAGCCATA TGGCATTATC 2400 

GTGTCTCTTT CTCTCTCTCT CTTGCTCTCT CTCTTCTCCT CTTCCCCATA TTTTCTGTCA 2460 

AATAAGTACT GTTTACTCAT TTAGTTGCTT ATCAAGTACT TATTCTTGGT TTTAAAAAAA 2520 

ATTAATGGTA ACTGTATTTT TCTCATTTTT AGCATTATTC AAATGTTTAT ATTTTAATAC 2580 

CTTTAAACCA CTTTAAAGTT TTTTCATGTT TAATTATAGT TTTAAGAAAA ACTATTTTGA 2640 

ACAACCCCAA ATATAGTGCA TCTAGAAACT AATGTATATT TGATTAGACA TCATTTATAG 2700 

TGGAACAGTA GACTGTAGTA CATGGTAATT TTTCTTTTAC TATTAAGATA CAATAAAACA 2760 

TGACTAATTT TGCTGTCAAA AATGTAAAGA ATAATGATAA ATGGAGTTTT TTATATTTTA 2820 

CTTTTAAGAT TGCCTGTCTT TAATAAGACA AAGCCTTAAG CCTTATGTTA TAATTTTGGT 2880 

TCTAAAAACC ATCATTTCAG TATAAGGAAT AAGTATATTT CGTCCTCCTC TTTAGT TT TT 2940 

TTCTTCCTAT TTATTTTTAT TTTGAAAAAT TTCTACACCT TCTTTGAATT CCTTGTATGA 3000 

ATTTTTGTTT CTTAGAAGTT AATTTGTGTG AAATGAGATT CTTCAAAACG ATGAAACCTC 3 060 

ATAGCTCTGA GAAAAGGTTT TAGGGTTTTA AATTCTAAGC AAAGCGTGAC TATGGCTGAC 3120 

AGACTACACA TTTAATTATA CAGCTTCTCT TTCTTAACCA CAGGCAGATT AACCTCATTG 3180 

TGGATTGTCC TTCAGACCTT AGTCCTCAGG CATGGTTTCT GGTGCCCACT CCTGGAAGCC 3240 

GCTGTTCCCT TTCTACCTTC TTACCAGAGC CCAAGGGCAG GCCTGGTCCC GGGGAAGCAG 3300 

CAGCTTGCTG ACATAAGTCA GCTGCAAAGG CTGAGGAGTG TGCCCTCAGA GAAGCACCGC 3360 

CCCCCAGTCT TGTGCCAGCG CCTAGAGCCG CAGCTCCCAG GGATGCTCCT TCCCTGGAGG 3420 

CAGCCCAGGA GAGGGACTCT GGCAGCGTTC TTCAGATTTG TGGCCACTGT TTCTCATTTG 3480 

CTGGTTGACT GTTTTTATTT CTTAGGCTTT TGCTAGTTTT AGAAAATAGG GAAGCAG CCC 3540 

TTGATTTGTG GATTAAAAGC AACATTTGAG CGATGATGCA CAACAGTCCA GGAAAATGGG 3600 

CGGTGGACAC TTGAGGCTGA GGATGGGAGT TGACATGAGC AGGGAGAGGG AGGTGCGCGC 3660 

TGCTTATCTG TGATTGTTGC TCACCTGAGT GTGGCTGATT GTGTACATCC AGCAGTTACA 3720 

ATTTTTAAAA ATTATACTTT TACATTTATT TTATATTTTT CTCACCCCCA GTAATTTCCT 3780 

TCCAAAGAAG TTCACATGTA ATAAGTAGAA ATTCTGTATA GGAAAAAAGC ATTAAAAATA 3840 

CTATTATAAC TGCTTCATTT GCTGGGAACC ATTAAAAGTA ATATAAATTA GCTTTTTCCA 3900 

GAAGGATCCT TTTGTAGCAG TGTTTATGAA TGTAACCCCC AGCAAAATAT GGCTATATAT 3960 

TAGGGGAGCC AGTTTGGAGC AGAGGCCTGA AGGTCCCTGC TATGCAGCCG TGGCCACAGC 4020 

TCGCAGCCCA AGCACTGTGG AGCATCCACA CCTTTGATGG CAATGCAGAT TGGTAGCAGG 4080 

TTCCATAGGC GTACAAAACA GTATTAAAGC TCAGTGTTTT GCATATTGTT AGCATTTACA 4140 

AATATTTTTG CTTTAGTATG AGGAAAGTAA GGATGGGCAA AGAAGCGATC AAAATAGCTA 4200 

TTGCTACAAC ATTTTCGAAA ACAAAGTTGG GGCTGTATTT CTTTAAAAAG ATAAGCCTCT 4260 

AAAAATGCTT GGCAAAAAAA ATATAGTGTT AAAATAGGCC AGTGATATTA ATGAGAAAAT 4320 

GAAAGTATGT ATCAGGAATA AAGTGATATT GCATAGGAGT ATTGTATTTT TATGAATTTT 4380 
ATGCCAGTTG TTTACATGTA CTATATATGT TAAATTAAAA AAAATCATGA GAAATG 

Seq ID NO: 499 Protein sequence 
Protein Accession 8: BAA74 900.1 

1 11 21 31 41 51 

I I I I I I 

PLVINTLKRF NLYPEVILAS WYRIYTKIMD LIGIQTKICW TVTRGEGLSP IESCEGLGDP 60 

ACFYVAVIFI LNGLMMALFF IYGTYLSGSR LGGLVTVLCF FFNHGECTRV MWTPPLRESF 120 

SYPFLVLQML LVTHILRATK LYRGSLIALC ISNVFFMLPW QFAQFVLLTQ IASLFAVYW 180 

GYIDICKLRK IIYIHMISLA LCFVLMFGNS MLLTSYYASS LVIIWGILAM KPHFLKINVS 240 

ELSLWVIQGC- FWLFGTVILK YLTSKIFGIA DDAHIGNLLT SKFFSYKDFD TIiLYTCAAEF 300 

DFMEKETPLR YTKTLLLPW LWFVAIVRK IISDMWGVLA KQQTHVRKHQ FDHGELVYHA 360 

LQLLAYTALG ILIMRLKLFL TPHMCVMASL ICSRQLFGWL FCKVHPGAIV FAILAAMSIQ 420 

GSANLQTQWN IVGEFSNLPQ EELIEWIKYS TKPDAVFAGA MPTMASVKLS ALRPIVNHPH 480 

YEDAGLRART KIVYSMYSRK AAEEVKRELI KLKVNYYILE ESWCVRRSKP GCSMPEIWDV 540 
EDPANAGKTP LCNLLVKDSK PHFTTVFQNS VYKVLEWKE 

Seq ID NO: 500 DNA sequence 

Nucleic Acid Accession ft: NM_001276.1 

Coding sequence: 127.. 1278 

1 11 21 31 41 51 

I I I I I I 

AGTGGAGTGG GACAGGTATA TAAAGGAAGT ACAGGGCCTG GGGAAGAGGC CCTGTCTAGG 60 

TAGCTGGCAC CAGGAGCCGT GGGCAAGGGA AGAGGCCACA CCCTGCCCTG CTCTGCTGCA 120 



376 



WO 02/086443 

GCCAGAATGG GTGTGAAGGC GTCTCAAACA GGCTTTGTGG TCCTGGTGCT GCTCCAGTGC 180 

TGCTCTGCAT ACAAACTGGT CTGCTACTAC ACCAGCTGGT CCCAGTACCG GGAAGGCGAT 240 

GGGAGCTGCT TCCCAGATGC CCTTGACCGC TTCCTCTGTA CCCACATCAT CTACAGCTTT 300 

GCCAATATAA GCAACGATCA CATCGACACC TGGGAGTGGA ATGATGTGAC GCTCTAOGGC 360 

5 ATGCTCAACA CACTCAAGAA CAGGAACCCC AACCTGAAGA CTCTCTTGTC TGTCGGAGGA 420 

TGGAACTTTG GGTCTCAAAG ATTTTCCAAG ATAGCCTCCA ACACCCAGAG TCGCCGGACT 480 

TTCATCAAGT CAGTACCGCC ATTCCTGCGC ACCCATGGCT TTGATGGGCT GGACCTTGCC 540 

TGGCTCTACC CTGGACGGAG AGACAAACAG CATTTTACCA CCCTAATCAA GGAAATGAAG 600 

GCCGAATTTA TAAAGGAAGC CCAGCCAGGG AAAAAGCAGC TCCTGCTCAG CGCAGCACTG 660 

10 TCTGOGGGGA AGGTCACCAT TGACAGCAGC TATGACATTG CCAAGATATC CCAACACCTG 720 

GATTTCATTA GCATCATGAC CTACGATTTT CATGGAGCCT GGCGTGGGAC CACAGGCCAT 780 

CACAGTCCCC TGTTCCGAGG TCAGGAGGAT GCAAGTCCTG ACAGATTCAG CAACACTGAC 840 

TATGCTGTGG GGTACATGTT GAGGCTGGGG GCTCCTGCCA GTAAGCTGGT GATGGGCATC 900 

CCCACCTTCG GGAGGAGCTT CACTCTGGCT TCTTCTGAGA CTGGTGTTGG AGCCCCAATC 960 

15 TCAGGACCGG GAATTCCAGG CCGGTTCACC AAGGAGGCAG GGACCCTTGC CTACTATGAG 1020 

ATCTGTGACT TCCTCCGCGG AGCCACAGTC CATAGAACCC TCGGCCAGCA GGTCCCCTAT 1080 

GCCACCAAGG GCAACCAGTG GGTAGGATAC GACGACCAGG AAAGCGTCAA AAGCAAGGTG 114,0 

CAGTACCTGA AGGATAGGCA GCTGGCAGGC GCCATGGTAT GGGCCCTGGA CCTGGATGAC 1200 

TTCCAGGGCT CCTTCTGCGG CCAGGATCTG CGCTTCCCTC TCACCAATGC CATCAAGGAT 1260 

20 GCACTCGCTG CAACGTAGCC CTCTGTTCTG CACACAGCAC GGGGGCCAAG GATGCCCCGT 1320 

CCCCCTCTGG CTCCAGCTGG CCGGGAGCCT GATCACCTGC CCTGCTGAGT CCCAGGCTGA 1380 

GCCTCAGTCT CCCTCCCTTG GGGCCTATGC AGAGGTCCAC AACACACAGA TTTGAGCTCA 1440 

GCCCTGGTGG GCAGAGAGGT AGGGATGGGG CTGTGGGGAT AGTGAGGCAT CGCAATGTAA " 1500 

GACTCGGGAT TAGTACACAC TTGTTGATGA TTAATGGAAA TGTTTACAGA TCCCCAAGCC 1560 

25 TGGCAAGGGA ATTTCTTCAA CTCCCTGCCC CCTAGCCCTC CTTATCAAAG GACACCATTT 1620 

TGGCAAGCTC TATCACCAAG GAGCCAAACA TCCTACAAGA CACAGTGACC ATACTAATTA 1680 

TACCCCCTGC AAAGCCAGCT TGAAACCTTC ACTTAGGAAC GTAATCGTGT CCCCTATCCT 1740 

ACTTCCCCTT CCTAATTCCA CAGCTGCTCA ATAAAGTACA AGAGTTTAAC AGTGTGTTGG 1800 

CGCTTTGCTT TGGTCTATCT TTGAGCGCCC ACTAGACCCA CTGGACTCAC CTCCCCCATC 1860 

30 TCTTCTGGGT TCCTTCCTCT GAGCCTTGGG ACCCCTGAGC TTGCAGAGAT GAAGGCCGCC 1920 
ATGTT 
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35 
40 
45 
50 
55 
60 
65 



Seq ID NO: 501 Protein sequence 
Protein Accession #: NP_001267.1 



MGVKASQTGF 
ISNDHIDTWE 
KSVPPFLRTH 
GKVTIDSSYD 
VGYMLRLGAP 
DPLRGATVHR 
GSFCGQDLRF 



11 
I 

WLVLLQCCS 
WNDVTLYGML 
GFDGLDLAWL 
IAKISQHLDF 
ASKLVMGIPT 
TLGQQVPYAT 
PLTNAIKDAL 



21 
I 

AYKLVCYYTS 
NTLKNRNPNL 
YPGRRDKQHF 
ISIMTYDFHG 
FGRSFTLASS 
KGNQWVGYDD 
AAT 



31 
I 

WSQYREGDGS 
KTLLSVGGWN 
TTLIKEMKAE 
AWRGTTGHHS 
ETGVGAP1SG 
QESVKSKVQY 



41 
I 

CFPDAIiDRFL 
FGSQRFSKIA 
FIKEAQPGKK 
PLFRGQEDAS 
PGIPGRFTKE 
LKDRQLAGAM 



51 
I 

CTHIIYSFAN 
SNTQSRRTFI 
QLLLSAALSA 
PDRFSNTDYA 
AGTLAYYEIC 
VWALDLDDFQ 



Seq ID NO: 502 DMA sequence 

Nucleic Acid Accession #: NM__006474.1 

Coding sequence: 181.. 669 



GCTGCCTAGG 
TCCGGCCCCC 
TTCCCCCAGC 
ATGTGGAAGG 
GAAGGAGCCA 
GTTGCCATGC 
AAGTCTGGCT 
GAGGATCTGC 
GCCTCAAACG 
GTTGAGAAAG 
GCCATCGGTT 
•TCGCCCTAAA 
TTCTGACTCT 
CGGGCCCATT 
TCACCAGATT 



11 
I 

GTCTGGAAAG 
CCACCGTCGC 
TCAGAATCTT 
TGTCAGCTCT 
GCACAGGCCA 
CAGGTGCCGA 
TGACAACTCT 
CAACTTCAGA 
TGGCCACCAG 
ATGGTTTGTC 
TCATTGGTGG 
GAGCTGAAGG 
GTGGCCCTGT 
CAGATTCCAC 
TGGTTCTTAA 



21 

! 

CTCGGGCACC 
GCTCCTCCAG 
GCTGCTCGGC 
GCTCTTCGTT 
GCCAGAAGAT 
AGATGATGTG 
GGTGGCAACA 
AAGCACAGTC 
TCACTCCACG 
AACAGTGACC 
AATCATCGTT 
GTTACGCCCT 
CCCTGAGCTC 
GGTGACTTTC 
ACTTT 



31 

I 

CTCCCTCTCC 
GCTGGGCCTG 
CCCCAGGAGA 
TTGGGAAGCG 
GACACTGAGA 
GTGACTCCAG 
AGTGTCAACA 
CACGCGCAAG 
GAGAAAGTGG 
CTGGTTGGAA 
GTGGTTATGC 
GCTTGCCAAC 
GTGGGGAGAA 
CGTTTGCCAA 



41 

I 

GGGGCTCCTG 
TGGCCGCGGT 
GCAACAACTC 
CGTCGCTCTG 
CTACAGGTTT 
GAACCAGCGA 
GTGTAACAGG 
AACAAAGTCC 
ATGGAGACAC 
TCATAGTTGG 
GAAAAATGTC 
GTGCTTTAAA 
GATGACCCTG 
ATTAACCGAG 



51 
I 

CTCCCACCCC 
GCTTTTAATT 
AACGGGAACG 
GGTCCTGGCA 
GGAAGGCGGC 
AGACCGCTAT 
CATTCGCATC 
AAGCGCCACA 
ACAGACAACA 
GGTCTTACTA 
GGGAAGGTAC 
AAAAGACCGT 
GGAACATTTG 
GAAAGACCTT 



60 
120 
180 
240 
300 
360 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



70 
75 
80 
85 



Seq ID NO: 503 Protein sequence 
Protein Accession #: NP_006465.1 

1 11 21 31 41 51 

I I I I I I 

MWKVSALLFV LGSASLWVLA EGASTGQPED DTETTGLEGG VAMPGAEDDV VTPGTSEDRY 
KSGLTTLVAT SVNSVTGIRI EDLPTSESTV HAQEQSPSAT ASNVATSHST EKVDGDTQTT 
VEKDGLSTVT LVGIIVGVLL AIGFIGGIIV WMRKMSGRY SP 

Seq ID NO: 504 DNA sequence . 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 62.. 895 



CACTGCTCTG 
TATGATCATC 
CAAGGATGGA 
AGAAGCACGG 
TGAAGGCGGC 



11 
I 

AGAATTTGTG 
TTAATTTACT 
ATTTTTCATA 
TCTGGCAAAT 
CATCTCGCAA 



21 



31 



AGCAGCCCCT AACAGGCTGT 
TATTTCTCTT GCTATGGGAA 
ACTCCATATG GCTTGAACGA 
ACAAGCTCAC CTACGCAGAA 
CTTACAAGCA GCTAGAGGCA 



41 

I 

TACTTCACTA 
GACACTCAAG 
GCAGCCGGTG 
GCTAAGGCGG 
GCCAGAAAAA 



51 
I 

CAACTGACGA 
GATGGGGATT 
TGTACCACAG 
TGTGTGAATT 
TTGGATTTCA 



60 
120 



60 
120 
180 
240 
300 



377 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 



WO 02/086443 

TGTCTGTGCT GCTGGATGGA TGGCTAAGGG CAGAGTTGGA TACCCCATTG TGAAGCCAGG 
GCCCAACTGT GGATTTGGAA AAACTGGCAT TATTGATTAT GGAATCCGTC TCAATAGGAG 
TGAAAGATGG GATGCCTATT GCTACAACCC ACACGCAAAG GAGTGTGGTG GCGTCTTTAC 
AGATCCAAAG CAAATTTTTA AATCTCCAGG CTTCCCAAAT GAGTACGAAG ATAACCAAAT 
CTGCTACTGG CACATTAGAC TCAAGTATGG TCAGCGTATT CACCTGAGTT TTTTAGATTT 
TGACCTTGAA GATGACCCAG GTTGCTTGGC TGATTATGTT GAAATATATG ACAGTTACGA 
TGATGTCCAT GGCTTTGTGG GAAGATACTG TGGAGATGAG CTTCCAGATG ACATCATCAG 
TACAGGAAAT GTCATGACCT TGAAGTTTCT AAGTGATGCT TCAGTGACAG CTGGAGGTTT 
CCAAATCAAA TATGTTGCAA TGGATCCTGT ATCCAAATCC AGTCAAGGAA AAAATACAAG 
TACTACTTCT ACTGGAAATA AAAACTTTTT AGCTGGAAGA TTTAGCCACT TATAAAAAAA 
AAAAAAAGGA TGATCAAAAC ACACAGTGTT TATGTTGGAA TCTTTTGGAA CTCCTTTGAT 
CTCACTGTTA TTATTAACAT TTATTTATTA TTTTTCTAAA TGTGAAAGCA ATACATAATT 
TAGGGAAAAT TGGAAAATAT AGGAAACTTT AAACGAGAAA ATGAAACCTC TCATAATCCC 
ACTGCATAGA AATAACAAGC GTTAACATTT TCATATTTTT TTCTTTCAGT CATTTTTCTA 
TTTGTGGTAT ATGTATATAT GTACCTATAT GTATTTGCAT TTGAAATTTT GGAATCCTGC 
TCTATGTACA GTTTTGTATT ATACTTTTTA AATCTTGAAC TTTATAAACA TTTTCTGAAA 
TCATTGATTA TTCTACAAAA ACATGATTTT AAACAGCTGT AAAATATTCT ATGATATGAA 
TGTTTTATGC ATTATTTAAG CCTGTCTCTA TTGTTGGAAT TTCAGGTCAT TTTCATAAAT 
ATTGTTGCAA TAAATATCCT TGAACACACA AAAAAAAAAA AA 

Seq ID NO: 505 Protein sequence 
Protein Accession #: Eos sequence 

11 
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41 



51 



1 11 21 31 

) I I I 

MIILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 
EGGHLATYKQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNCGFGKTGI IDYGIRLNRS 
ERWDAYCYNP HAKECGGVFT DPKQIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 
DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 
QIKYVAMDPV SKSSQGKNTS TTSTGNKNFL AGRFSHL 

Seq ID NO: 506 DNA sequence 

Nucleic Acid Accession 8: NM_007115.1 

Coding sequence: 69.. 902 



GAATTCGCAC 
CTGACGATAT 
GGGGATTCAA 
ACCACAGAGA 
GTGAATTTGA 
GATTTCATGT 
AGCCAGGGCC 
ATAGGAGTGA 
TCTTTACAGA 
ACCAAATCTG 
TAGATTTTGA 
GTTACGATGA 
TCATCAGTAC 
GAGGTTTCCA 
ATACAAGTAC 
AAAAAAAAAA 
TTGATCTCAC 
TAATTTAGGG 
ATCCCACTGC 
TTGTATTTGT 
„ CCTGCTCTAT 
TGAAATCATT 
ATGAATGTTT 
TAAATATTGT 



11 
I 

TGCTCTGAGA 
GATCATCTTA 
GGATGGAATT 
AGCACGGTCT 
AGGCGGCCAT 
CTGTGCTGCT 
CAACTGATGA 
AAGATGGGAT 
TCCAAAGCGA 
CTACTGGCAC 
CCTTGAAGAT 
TGTCCATGGC 
AGGAAATGTC 
AATCAAATAT 
TACTTCTACT 
AAGGATGATC 
TGTTATTATT 
AAAATTGGAA 
ATAGAAATAA 
GGTATATGTA 
GTACAGTTTT 
GATTATTCTA 
TATGCATTAT 
TGCAATAAAT 



21 

I 

ATTTGTGAGC 
ATTTACTTAT 
TTTCATAACT 
GGCAAATACA 
CTCGCAACTT 
GGATGGATGG 
TTTGGAAAAA 
GCCTATTGCT 
ATTTTTAAAT 
ATTAGACTCA 
GACCCAGGTT 
TTTGTGGGAA 
ATGACCTTGA 
GTTGCAATGG 
GGAAATAAAA 
AAAACACACA 
AACATTTATT 
AATATAGGAA 
CAAGCGTTAA 
TATATGTACC 
GTATTATACT 
CAAAAACATG 
TTAAGCCTGT 
ATCCTTCGGA 



31 
I 

AGCCCCTAAC 
TTCTCTTGCT 
CCATATGGCT 
AGCTCACCTA 
ACAAGCAGCT 
CTAAGGGCAG 
CTGGCATTAT 
ACAACCCACA 
CTCCAGGCTT 
AGTATGGTCA 
GCTTGGCTGA 
GATACTGTGG 
AGTTTCTAAG 
ATCCTGTATC 
ACTTTTTAGC 
GTGTTTATGT 
TATTATTTTT 
ACTTTAAACG 
CATTTTCATA 
TATATGTATT 
TTTTAAATCT 
ATTTTAAACA 
CTCTATTGTT 
ATTC 



41 
I 

AGGCTGTTAC 
ATGGGAAGAC 
TGAACGAGCA 
CGCAGAAGCT 
AGAGGCAGCC 
AGTTGGATAC 
TGATTATGGA 
CGCAAAGGAG 
CCCAAATGAG 
GCGTATTCAC 
TTATGTTGAA 
AGATGAGCTT 
TGATGCTTCA 
CAAATCCAGT 
TGGAAGATTT 
TGGAATCTTT 
CTAAATGTGA 
AGAAAATGAA 
TTTTTTTCTT 
TGCATTTGAA 
TGAACTTTAT 
GCTGTAAAAT 
GGAATTTCAG 



51 
I 

TTCACTACAA 
ACTCAAGGAT 
GCCGGTGTGT 
AAGGCGGTGT 
AGAAAAATTG 
CCCATTGTGA 
ATCCGTCTCA 
TGTGGTGGCG 
TACGAAGATA 
CTGAGTTTTT 
ATATATGACA 
CCAGATGACA 
GTGACAGCTG 
CAAGGAAAAA 
AGCCACTTAT 
TGGAACTCCT 
AAGAAATACA 
ACCTCTCATA 
TCAGTCATTT 
ATTTTGGAAT 
GAACATTTTC 
ATTCTATGAT 
GTCATTTTCA 



Seq ID NO: 507 Protein sequence 
Protein Accession #: NP 009046.1 



MIILIYLFLL 
EGGHLATYKQ 
ERWDAYCYNP 
DLEDDPGCLA 
QIKYVAMDPV 



11 
I 

LWEDTQGWGF 
LEAARKIGFH 
HAKECGGVFT 
DYVEIYDSYD 
SKSSQGKNTS 



21 



31 



41 



51 



KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 
VCAAGWMAKG RVGYPIVKPG PNXXFGKTGI IDYGIRLNRS 
DPKRIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 
DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 
TTSTGNKNFL AGRFSHL 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 



75 
80 
85 



Seq ID NO: 508 DNA sequence 

Nucleic Acid Accession #: NM__0 01044.1 

Coding sequence: 129.. 1991 



1 
I 

ACCGCTCCGG 
AAAGCCCAGG 
GTGTGCCCAT 
CTAAGGAGCC 
ACGGAGTGCA 
AGGATCGGGA 
TGGACCTGGC 



11 
I 

AGCGGGAGGG 
CCCGGGCGGC 
GAGTAAGAGC 
CAATGCCGTG 
GCTCACCAGC 
GACCTGGGGC 
CAACGTCTGG 



21 

! 

GAGGCTTCGC 
CAGACCAAGA 
AAATGCTCCG 
GGCCCGAAGG 
TCCACCCTCA 
AAGAAGATCG 
CGGTTCCCCT 



31 
I 

GGAACGCTCT 
GGGAAGAAGC 
TGGGACTCAT 
AGGTGGAGCT 
CCAACCCGCG 
ACTTTCTCCT 
ACCTGTGCTA 



41 
I 

CGGCGCCAGG 
ACAGAATTCC 
GTCTTCCGTG 
CATCCTTGTC 
GCAGAGCCCC 
GTCCGTCATT 
CAAAAATGGT 



51 

! 

ACTCGCGTGC 
TCAACTCCCA 
GTGGCCCCGG 
AAGGAGCAGA 
GTGGAGGCCC 
GGCTTTGCTG 
GGCGGTGCCT 



60 
120 
180 
240 
300 
360 
420 



378 



WO 02/086443 

TCCTGGTCCC CTACCTGCTC 
TGGCCCTCGG CCAGTTCAAC 
TGAAAGGTGT GGGCTTCACG 
TCATCATOGC CTGGGCGCTG 
5 TCCACTGCAA CAACTCCTGG 
GTGGAGACAG CTCGGGCCTC 
AACGTGGCGT GCTGCACCTC 
GGCAGCTCAC AGCCTGCCTG 
GCGTGAAGAC CTCAGGGAAG 

10 CTGCCCTGCT CCTGCGTGGG 
TGAGCGTTGA CTTCTACCGG 
TGTGCTTCTC CCTGGGCGTG 
TCACCAACAA CTGCTACAGG 
TCTCCTCCGG CTTCGTCGTC 

15 CCATCGGGGA CGTGGCCAAG 
TCGCCACGCT CCCTCTGTCC 
TGGGTATCGA CAGCGCCATG 
TCCAGCTGCT GCACAGACAC 
TCCTGTCCCT GTTCTGCGTC 

20 TTGCAGCCGG CACGTCCATC 
TCTATGGTGT TGGGCAGTTC 
TGTACTGGCG GCTGTGCTGG 
TCAGCATTGT GACCTTCAGA 
ACGCGCTGGG CTGGGTCATC 

25 ACAAGTTCTG CAGCCTGCCT 
AGAAGGACCG TGAGCTGGTG 
TCAAGGTGTA GAGGGAGCAG 
ACGAACAAAC CAAGGAAATC 
CTCTACTGAA AACACAAACA 

30 TCCGTGCCGG GAGCGCACCT 
GCAGCGAGGT CCACCCCGTT 
TCTGTGTGAG GCTCCCTCCC 
CACTGTGTTC TCAGGCGGGG 
CTCACAGTAG CTTCCTAGAC 

35 GGTTTAGCTG TGCAGAAGGT 
GATGCGTGGC TCCCAGCAGA 
GTCTGTTCAG AGGCATTGGA 
ATGTTCTTGC AGCAGAGAGA 
GGCAGCCTGT GGGTCCTTGT 

40 GACGCATGCA GGGCCCCCAC 
AGGAGCATGT CCTATCCCTG 
AACGCATGCA GGGCCCCCAC 
TGGAGCGTGT ACTACCCCAG 
GACCGGACGC ATGCAGGGCC 

45 CCCACAGGAG CGTGTACTAC 
CCCAGGACGC ATGCAGGGCC 
TGAGCCGTGA CCTCCAGGAA 
CATCAATAAC AACAGTTTTT 
ATCAAAACAA ATTCAAGAAT 

50 TTTACAAGAA TAATTAGCAA 
CACACTGCCC TCTGCCACTG 
TCGGTGGGCC TGCCTACGTG 
TCCCCTGCAA GTGGACGTGG 
GCCTGTGAAC TGCCAGGCAG 

55 GGGAGGGACA CAGAGGACGG 
GCGGCTTCCC CATTGCCTTC 
GTTGTTGAAG ACAGCACAGA 
AGCAACCCAG GTGTTGTCCG 
TAAGCACAAT AAAAGACATC 

60 



TTCATGGTCA 
AGGGAAGGGG 
GTCATCCTCA 
CACTATCTCT 
AACAGCCCCA 
AACGACACTT 
CACCAGAGCC 
GTGCTGGTCA 
GTGGTATGGA 
GTCACCCTCC 
CTCTGOGAGG 
GGGTTCGGGG 
GACGCGATTG 
TTCTCCTTCC 
GACGGGCCAG 
TCAGCCTGGG 
GGTGGTATGG 
CGTGAGCTCT 
AC CAACG GTG 
CTCTTTGGAG 
AGCGACGACA 
AAGCTGGTCA 
CCCCCCCACT 
GCCACATCCT 
GGGTCCTTTC 
GACAGAGGGG 
AGACGAAGAC 
TAAGTTTCGA 
ACAAAGCAGA 
CGCCGTGTCT 



TCCCTGCTCC 
ATCACGATCC 
CATTTACTTT 
GAAATGGAGG 
GGCCGTAAAT 
GGATGGGGGT 
AATAAAACTC 
GGTGTAGGGA 
AGGAGCGTGT 
GACGCATGCA 
AGGAGCGTGT 
GACGCATGCA 
CCCACAGGAG 
CCCAGGATGC 
CCCATGCAGG 
GGGACCCCAC 
ATGTTTGCGA 
GCAGTATCCG 
TACTGAGTGA 
ACAGGAAAGT 
CTGCCCGAGG 
GCTCCAGGGA 
CTGCAGTTAG 
CTTCCCCATC 
TGGGGAGGGA 
GAGCGGCTTC 
TGTCTGTTGA 
CACAATGGAA 



TTGCTGGGAT 
CCGCTGGTGT 
TCTCACTGTA 
TCTCCTCCTT 
ACTGCTCGGA 
TTGGGACCAC 
ATGGCATCGA 
TCGTGCTGCT 
TCACAGCCAC 
CTGGAGCCAT 
CGTCTGTTTG 
TGCTGATCGC 
TCACCACCTC 
TGGGGTACAT 
GGCTGATCTT 
CCGTGGTCTT 
AGTCAGTGAT 
TCACGCTCTT 
GCATCTACGT 
TGCTCATCGA 
TCCAGCAGAT 
GCCCCTGCTT 
ACGGAGCCTA 
CCATGGCCAT 
GAGAGAAACT 
AGGTGCGCCA 
CCCAGGAAGT 
GAGAAAGGAG 
AGACTCCTCT 
TGTGTTGCTG 
CAGGGCAGAA 
CTGCTCCCGG 
TTGTAGAOGC 
GCCCATATTA 
AAACCACAAA 
TGAGCGTTCA 
CCTGGTATGT 
CTTGAAACCA 
ACGGCCTGAG 
CCTATCCCCG 
GGGCCCCCAC 
ACTACCCCAG 
GGGCCCCCAC 
CGTGTACTAC 
ATGCAGGGCC 
CAGCCTGCAG 
TGGAATTTTA 
ATGGCTTTTT 
CGAGCCTGCT 
AGGATGTTGG 
GGATGCCATA 
GCAGGGGCCG 
CTGGAGTGTA 
CACAGAGGAT 
GCCTTCTGGC 
CACAGAGGAC 
CCCATCGCCT 
CCAATCTCTA 
AAAAAAAAAG 
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GCCACTTTTC 
CTGGAAGATC 
TGTCGGCTTC 
CACCACGGAG 
TGCCCATCCT 
ACCTGCTGCC 
CGACCTGGGG 
CTACTTCAGC 
CATGCCATAC 
AGACGGCATC 
GATTGACGCG 
CTTCTCCAGC 
CATCAACTCC 
GGCACAGAAG 
CATCATCTAC 
CTTCATCATG 
CACCGGGCTC 
CATCGTCCTG 
CTTCACGCTC 
AGCCATCGGA 
GACCGGGCAG 

CATCTTCCCC 
GGTGCCCATC 
GGCCTACGCC 
GTTCACGCTC 
CATCCTGCAA 
GGCAACTTCT 
CTTCTGACTG 
TAATAACGAC 
AAACGTCTAA 
CTCTGAGGCT 
ACCTGCTGAG 
AAAAGCCAAG 
TTCATGCAAA 
GTTGACACAT 
CTCACCAGGA 
GCTCAGGCTA 
AGGAGCGTGT 
GACGCATGCA 
AGGAGCGTGT 
GACGCATGCA 
AGGAGCGTGT 
CCCAGGACGC 
CCCACAGGAG 
ACCAACACTC 
TTTCTCTCAG 
AAAATCATAT 
TGCTGATATT 
CCAAAAGCTG 
GTTTGAATTC 
TGCAGGGCCA 
ATGCTCGGTG 
GGCTTCCCCA 
CGCTGCAGTC 
AGTTTCCCCA 
TCTGGGGAGG 
TTCAGCATCG 
GAATTC 



TACATGGAGC 
TGCCCCATAC 
TTCTACAACG 
CTCCCCTGGA 
GGTGACTCCA 
GAGTACTTTG 
CCTCCGCGGT 
CTCTGGAAGG 
GTGGTCCTCA 
AGAGCATACC 
GCCACCCAGG 
TACAACAAGT 
CTGACGAGCT 
CACAGTGTGC 
CCGGAAGCCA 
CTGCTCACCC 
ATCGATGAGT 
GCGACCTTCC 
CTGGACCATT 
GTGGCCTGGT 
CGGCCCAGCC 
GTGGTCGTGG 
GACTGGGCCA 
TATGCGGCCT 
ATTGCACCCG 
CGCCACTGGC 
TGGGAGAGAC 
ACTCTTCAAC 
TTTACACCTT 
GTAGATCTGT 
CTTCATGCTG 
GCCCCAGGGG 
AATCCCCGTG 
TGTCCTGCTT 
GTCCTTTCCC 
TGCACACACA 
AATTCTGTTT 
CTGCCACTCA 
CCTATCCCCG 
GGGCCCCCAC 
ACTACCCCAG 
GGGCCCCCAC 
CCTATCCCCG 
ATGCAGGGCC 
CGTGTACTAC 
TGCCTGGCCT 
GTGCGTGCCA 
TTACCTGTGA 
GCAGTTTTTG 
CTTTCCATGG 
ATGCCTCAAG 
GTCATGGCTG 
GGAGCCGTCA 
TTGCCTTCTG 
AGCACAGAGA 
TCGCCTTCTG 
GGCTCCGTGT 
TGTGGGTCCC 



460 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 



Seq ID NO: 509 Protein sequence 
Protein Accession #: NP 001035-1 



65 
70 
75 
80 
85 



MSKSKCSVGL 
ETWGKKIDFL 
GQFNREGAAG 
NNSWNSPNCS 
TACLVLVIVL 
DFYRLCEASV 
GFWFSFLGY 
DSAMGGMESV 
GTSILFGVLI 
VTFRPPHYGA 
RELVDRGEVR 



11 
I 

MSSWAPAKE 
LSVIGFAVDL 
VWKICPILKG 
DAHPGDSSGD 
LYFSLWKGVK 
WIDAATQVCF 
MAQKHSVPIG 
ITGLIDEFQL 
EAIGVAWFYG 
YIFPDWANAI* 
QFTLRHWLKV 



21 
I 

PNAVGPKEVE 
ANVWRFPYLC 
VGFTVILISL 
SSGLNDTFGT 
TSGKWWITA 
SLGVGFGVLI 
DVAKDGPGLI 
LHRHRELFTL 
VGQFSDDIQQ 
GWVIATSSMA 



31 
I 

LILVKEQNGV 
YKNGGGAFLV 
YVGFFYNVI I 
TPAAEYFERG 
TMPYWLTAL 
AFSSYNKFTN 
FIIYPEAIAT 
FIVLATFLLS 
MTGQRPSLYW 
MVPIYAAYKF 



41 

i 

QLTSSTLTNP 
PYLLFMVIAG 
AWALHYLFSS 
VLHLHQSHGI 
LLRGVTLPGA 
NCYRDAIVTT 
LPLSSAWAW 
LFCVTNGGIY 
RLCWKLVSPC 
CSLPGSFREK 



51 
I 

RQSPVEAQDR 
MPLFYMEIAL 
FTTELPWIHC 
DDLGPPRWQL 
IDGIRAYLSV 
SINSLTSFSS 
FFIMLLTLGI 
VFTLLDHFAA 
FLLFVWVSI 
LAYAIAPEKD 



Seq ID NO: 510 DNA sequence 

Nucleic Acid Accession #: NM__001216>1 

Coding sequence: 4 3.. 14 22 



1 11 21 31 41 51 

I I I I I I 

GCCCGTACAC ACCGTGTGCT GGGACACCCC ACAGTCAGCC GCATGGCTCC CCTGTGCCCC 
AGCCCCTGGC TCCCTCTGTT GATCCCGGCC CCTGCTCCAG GCCTCACTGT GCAACTGCTG 
CTGTCACTGC TGCTTCTGAT GCCTGTCCAT CCCCAGAGGT TGCCCCGGAT GCAGGAGGAT 
TCCCCCTTGG GAGGAGGCTC TTCTGGGGAA GATGACCCAC TGGGCGAGGA GGATCTGCCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 
240 



379 
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AGTGAAGAGG ATTCACCCAG AGAGGAGGAT CCACCCGGAG AGGAGGATCT ACCTGGAGAG 300 

GAGGATCTAC CTGGAGAGGA GGATCTACCT GAAGTTAAGC CTAAATCAGA AGAAGAGGGC 360 

TCCCTGAAGT TAGAGGATCT ACCTACTGTT GAGGCTCCTG GAGATCCTCA AGAACCCCAG 420 

AATAATGCCC ACAGGGACAA AGAAGGGGAT GACCAGAGTC ATTGGCGCTA TGGAGGCGAC 480 

CCGCCCTGGC CCCGGGTGTC CCCAGCCTGC GCGGGCCGCT TCCAGTCCCC GGTGGATATC 540 

CGCCCCCAGC TCGCCGCCTT CTGCCCGGCC CTGCGCCCCC TGGAACTCCT GGGCTTCCAG 600 

CTCCCGCCGC TCCCAGAACT GCGCCTGCGC AACAATGGCC ACAGTGTGCA ACTGACCCTG 660 

CCTCCTGGGC TAGAGATGGC TCTGGGTCCC GGGCGGGAGT ACCGGGCTCT GCAGCTGCAT 720 

CTGCACTGGG GGGCTGCAGG TCGTCCGGGC TCGGAGCACA CTGTGGAAGG CCACCGTTTC 780 

CCTGCCGAGA TCCACGTGGT TCACCTCAGC ACCGCCTTTG CCAGAGTTGA CGAGGCCTTG 840 

GGGOGCCOGG GAGGCCTGGC OGTGTTGGCC GCCTTTCTGG AGGAGGGCCC GGAAGAAAAC 900 

AGTGCCTATG AGCAGTTGCT GTCTCGCTTG GAAGAAATCG CTGAGGAAGG CTCAGAGACT 960 

CAGGTCCCAG GACTGGACAT ATCTGCACTC CTGCCCTCTG ACTTCAGCCG CTACTTCCAA 1020 

TATGAGGGGT CTCTGACTAC ACCGCCCTGT GCCCAGGGTG TCATCTGGAC TGTGTTTAAC 1080 

CAGACAGTGA TGCTGAGTGC TAAGCAGCTC CACACCCTCT CTGACACCCT GTGGGGACCT 1140 

GGTGACTCTC GGCTACAGCT GAACTTCCGA GCGACGCAGC CTTTGAATGG GCGAGTGATT 1200 

GAGGCCTCCT TCCCTGCTGG AGTGGACAGC AGTCCT OGGG CTGCTGAGCC AGTCC AGCTG 1260 

AATTCCTGCC TGGCTGCTGG TGACATCCTA GCCCTGGTTT TTGGCCTCCT TTTTGCTGTC 1320 

ACCAGCGTCG CGTTCCTTGT GCAGATGAGA AGGCAGCACA GAAGGGGAAC CAAAGGGGGT 1380 

GTGAGCTACC GCCCAGCAGA GGTAGCCGAG ACTGGAGCCT AGAGGCTGGA TCTTGGAGAA 1440 

TGTGAGAAGC CAGCCAGAGG CATCTGAGGG GGAGCCGGTA ACTGTCCTGT CCTGCTCATT 1500 
ATGCCACTTC CTTTTAACTG CCAAGAAATT TTTTAAAATA AATATTTATA AT 

Seq ID NO: 511 Protein sequence 
Protein Accession NP_001207,1 

1 11 21 31 41 51 

v r i i i i 

MAPLCPSPWL PLLIPAPAPG LTVQLLLSLL LLMPVHPQRL PRMQEDSPLG GGSSGEDDPL 60 

GEEDLPSEED SPREEDPPGE EDLPGEEDLP GEEDLPBVKP KSEEEGSLKL EDIjPTVEAPG 120 

DPQEPQNNAH RDKEGDDQSH WRYGGDPPWP RVSPACAGRF QSPVDIRPQL AAFCPALRPL 180 

ELLGFQLPPL PELRLRNNGH SVQLTLPPGL EMALGPGREY RAIiQLHLHWG AAGRPGSEHT 240 

VEGHRFPAEI HWHLSTAFA RVDEALGRPG GLAVLAAFLE EGPEENSAYE QLLSRLEEIA 300 

EEGSETQVPG LDISALLPSD FSRYFQYEGS LTTPPCAQGV IWTVFNQTVM LSAKQLHTLS 360 

DTLWGPGDSR LQLNFRATQP LNGRVIEASF PAGVDSSPRA AEPVQLNSCL AAGDILALVF 420 
GLLFAVTSVA FLVQMRRQHR RGTKGGVSYR PAEVAETGA 

Seq ID NO: 512 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence .* 1 . . 3 97 8 

1 ii 21 31 41 51 

111)11 

ATGGTGGGTG AAGGACCCTA CCTTATCTCA GATCTGGACC AGCGAGGCCG GCGGAGATCC 60 

TTTGCAGAAA GATATGACCC CAGCCTGAAG ACCATGATCC CAGTGCGACC CTGTGCAAGG 120 

TTAGCACCCA ACCCGGTGGA TGATGCCGGG CTACTCTCCT TCGCCACATT TTCCTGGCTC 180 

ACGCCGGTGA TGGTGAAAGG CTACCGGCAA AGGCTGACCG TAGACACCCT GCCCCCATTG 240 

TCGACATATG ACTCATCTGA CACCAATGCC AAAAGATTTC GAGTCCTTTG GGATGAAGAG 300 

GTAGCAAGGG TGGGTCCTGA GAAGGCCTCT CTGAGCCACG TGGTGTGGAA ATTCCAGAGG 3 60 

ACACGCGTGT TGATGGACAT CGTGGCCAAC ATCCTGTGCA TCATCATGGC AGCCATAGGG 420 

CCGACAGTTC TCATTCACCA AATCCTCCAG CAGACTGAGA GGACCTCTGG GAAAGTCTGG 480 

GTTGGCATTG GACTGTGCAT AGCCCTTTTT GCCACCGAGT TTACCAAAGT CTTCTTTTGG 540 

GCCCTTGCCT GGGCCATCAA CTACCGCACG GCCATCCGGT TGAAGGTGGC GCTCTCCACC 600 

TTGGTTTTTG AAAACCTAGT GTCCTTCAAG ACATTGACCC ACATCTCTGT TGGCGAGGTG 660 

CTCAATATAC TGTCAAGTGA TAGCTATTCT TTGTTTGAAG CTGCCTTGTT TTGTCCTTTG 720 

CCAGCCACCA TCCCGATCCT AATGGTCTTT TGTGCGGCGT ACGCCTTTTT CATTCTGGGG 780 

CCCACAGCTC TCATCGGGAT ATCAGTGTAT GTCATATTCA TACCCGTCCA GATGTTTATG 840 

GCCAAGCTCA ATTCAGCTTT CCGAAGGTCA GCAATTTTGG TGACAGACAA GCGAGTTCAG 900 

ACAATGAATG AGTTTCTGAC CTGCATCAGG CTGATCAAAA TGTATGCCTG GGAGAAATCT 960 

TTTACCAACA CTATCCAAGA TATAAGAAGG AGGGAAAGAA AATTACTGGA AAAAGCTGGA 1020 

TTTGTCCAAA GTGGAAACTC TGCCCTGGCC CCCATCGTGT CCACCATAGC CATCGTGCTG 1080 

ACATTATCCT GCCACATCCT CCTGAGACGC AAACTCACCG CACCCGTGGC ATTTAGTGTG 1140 

ATTGCCATGT TTAATGTAAT GAAGTTTTCC ATTGCAATCT TGCCCTTCTC CATCAAAGCA 1200 

ATGGCTGAAG CGAATGTCTC TCTAAGGAGA ATGAAGAAAA TTCTCATAGA TAAAAGCCCC 1260 

CCATCTTACA TCACCCAACC AGAAGACCCA GATACTGTCT TGCTTTTAGC AAATGCCACC 1320 

TTGACATGGG AGCATGAAGC CAGCAGGAAA AGTACCCCAA AGAAATTGCA GAACCAGAAA 1380 

AGGCATTTAT GCAAGAAACA GAGGTCAGAG GCATACAGTG AGAGGAGTCC ACCAGCCAAG 1440 

GGAGCCACTG GCCCAGAGGA GCAAAGTGAC AGCCTCAAAT CGGTTCTGCA CAGCATAAGC 1500 

TTTGTGGTGA GAAAGTTATG TCGTTATCCC GAAGCCCAGC TCCTGGCTTG GAGGTGGCCA 1560 

GCAGTGTTTG TTGGGAGAAT CATCAGAGGA TACAGGCCTC ATGGATTTTC TGCTAAAGAC 1620 

AAGGATGAAT CTAGAAGGCT TCTTACTTGG CCCCAAGAAG TGGATAGGAC TCAAAGGGCA 1680 

GCCAAATACC TGGGGAAGAT CTTGGGAATA TGTGGGAATG TGGGAAGTGG AAAGAGCTCC 1740 

CTCCTTGCAG CTCTCCTAGG ACAGATGCAG CTGCAGAAAG GGGTGGTGGC AGTCAATGGA 1800 

ACTTTGGCCT ACGTTTCACA GCAGGCATGG ATCTTTCATG GAAATGTGAG AGAAAACATA 1860 

CTCTTTGGAG AAAAGTATGA TCACCAAAGG TATCAGCACA CAGTCCGCGT CTGTGGCCTC 1920 

CAGAAGGACC TGAGCAACCT CCCCTATGGA GACCTGACTG AGATTGGGGA GCGGGGCCTC 1980 

AACCTCTCTG GGGGGCAGAG GCAGAGGATT AGCCTGGCCC GCGCTGTCTA CTCCGACCGT 2040 

CAGCTCTACC TGCTGGACGA CCCCCTGTCG GCCGTGGACG CCCACGTGGG GAAGCACGTC 2100 

TTTGAGGAGT GCATTAAGAA GACGCTCAGG GGAAAGACAG TCGTCCTGGT GACCCACCAG 2160 

CTACAGTTCT TAGAGTCTTG TGATGAAGTT ATTTTATTAG AAGATGGAGA GATTTGTGAA 2220 

AAGGGAACCC ACAAGGAGTT AATGGAGGAG AGAGGGCGCT ATGCAAAACT GATTCACAAC 2280 

CTGCGAGGAT TGCAGTTCAA GGATCCTGAA CACCTTTACA ATGCAGCAAT GGTGGAAGCC 2340 

TTCAAGGAGA GCCCTGCTGA GAGAGAGGAA GATGCTGGTA TAATCGGGTA CCTCCTTTCT 2400 

CTCTTCACTG TGTTCCTCTT CCTCCTGATG ATTGGCAGCG CTGCCTTCAG CAACTGGTGG 2460 

CTGGGTCTCT GGTTGGACAA GGGCTCACGG ATGACCTGTG GGCCCCAGGG CAACAGGACC 2520 

ATGTGTGAGG TCGGCGCGGT GCTGGCAGAC ATCGGTCAGC ATGTGTACCA GTGGGTGTAC 2580 

ACTGCAAGCA TGGTGTTCAT GCTGGTGTTT GGCGTCACCA AAGGCTTCGT CTTCACCAAG 2640 
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ACCACACTGA TGGCATCCTC CTCTCTGCAT GACACGGTGT 
CCAATGAGTT TCTTTGACAC GACTCCCACT GGCAGGCTAA 
ATGGACGAGC TGGATGTGAG GCTGCCGTTT CACGCAGAGA 
ATGGTGGTGT TTATTCTCGT GATCTTGGCT GCTGTGTTTC 
GCCAGCCTTG CTGTAGGCTT CTTCATTCTG TTACGCATTT 
CTCAAGAAGG TGGAGAATGT CAGCCGGTCA CCCTGGTTCA 
CAGGGCCTGG GCATCATTCA CGCCTATGGC AAGAAGGAGA 
TCCAAAGGCC TGTCATTGTC ATACATCATC CAGCTGAGCG 
CGAACGGGAA CAGAGACGCA AGCCAAATTC ACCTCCGTGG 
TCGACCTGTG TTCCTGAATG CACTCATCCC CTCAAAGTGG 
CCCAGCTGTG GGGAGATCAC CTTCAGAGAC TATCAGATGA 
CTTGTTCTCG ACAGCCTGAA CTTGAACATA CAAAGTGGGC 
AGAACAGGTT CCGGAAAGTC ATCGTTAGGA ATGGCTTTGT 
AGTGGCACAA TCTTTATTGA TGAGGTGGAT ATCTGCATTC 
ACCAAGCTGA CTGTGATCCC ACAGGATCCT GTCCTGTTTG 
TTGGATCCCT TTGAGAGTCA CACCGATGAG ATGCTCTGGC 
ATGAGAGACA CAATAATGAA ACTCCCAGAA AAATTACAGG 
GAAAACTTCT CAGTAGGGGA ACGTCAGCTG CTTTGTGTGG 
TCAAAGATCA TTCTCCTTGA TGAAGCCACC GCCTCTATGG 
GTTCAGAACA CCATCAAAGA TGCCTTCAAG GGCTGCACTG 
CTCAACACAG TTCTCAACTG CGATCACGTC CTGGTTATGG 
TTTGACAAGC CTGAAGTCCT TGCAGAGAAG CCAGATTCTG 
GCAGAAGTCA GATTGTAG 

Seq ID NO: 513 Protein sequence 
Protein Accession ff : Eos sequence 



PCTYUS02/12476 



I 

MVGEGPYLIS 
TPVMVKGYRQ 
TRVLMDIVAN 
ALAWAINYRT 
PATIPILMVF 
TMNEFLTCIR 
TLSCHILLRR 
PSYITQPEDP 
GATGPEEQSD 
KDESRRLLTW 
TLAYVSQQAW 
NLSGGQRQRI 
LQFLESCDEV 
FKESPAEREE 
MCEVGAVLAD 
PMSFFDTTPT 
ASLAVGFFIL 
SKGLSLSYII 
PSCGEITFRD 
SGTIFIDEVD 
MRDTIMKLPE 
VQNTIKDAFK 
AEVRL 



11 
I 

DLDQRGRRRS 
RLTVDTLPPL 
ILCIIMAAIG 
AIRLKVALST 
CAAYAFFILG 
LIKMYAWEKS 
KLTAPVAFSV 
DTVLLLANAT 
SLKSVLHSIS 
PQEVDRTQRA 
IFHGNVRENI 
SLARAVYSDR 
ILLEDGEICE 
DAGI IGYLLS 
IGQHVYQWVY 
GRLMNRFSKD 
LRIFHRGVQE 
QLSGLLQVCV 
YQMRYRDNTP 
ICILSLEDtiR 
KLQAEVTENG 
GCTVLTIAHR 



21 
I 

FAERYDPSLK 
STYDSSDTNA 
PTVLIHQILQ 
LVFENLVSFK 
PTALIGISVY 
FTNTIQDIRR 
IAMFNVMKFS 
LTWEHEASRK 
FWRKLCRYP 
AKYLGKILGI 
LFGEKYDHQR 
QLYLLDDPLS 
KGTHKELMEE 
LFTVFLFLLM 
TASMVFMLVF 
MDELDVRLPF 
LKKVENVSRS 
RTGTETQAKF 
LVLDSLNLNI 
TKLTVIPQDP 
ENFSVGERQL 
LNTVLNCDHV 



31 
I 

TMIPVRPCAR 
KRFRVLWDEE 
QTERTSGKVW 
TLTHISVGEV 
VIFIPVQMFM 
RERKLLEKAG 
IAILPFSIKA 
STPKKLQNQK 
EAQLLAWRWP 
CGNVGSGKSS 
YQHTVRVCGL 
AVDAHVGKHV 
RGRYAKLIHN 
IGSAAFSNWW 
GVTKGFVFTK 
HAENFLQQFF 
PWFTHITSSM 
TSVELLREYI 
QSGQTVGIVG 
VLFVGTVRYN 
LCVARALLRN 
LVMENGKVIE 



TTGATAAGAT 
TGAACCGTTT 
ACTTTCTGCA 
CTGCTGTCCT 
TCCACAGAGG 
CCCACATCAC 
GCTGCATCAC 
GACTGCTCCA 
AGCTGCTCAG 
GGACCTGTCC 
GATACAGAGA 
AGACAGTCGG 
TTCGTCTGGT 
TCAGCTTGGA 
TAGGTACAGT 
AGGTTCTGGA 
CAGAAGTCAC 
CCCGAGCTCT 
ACTCCAAGAC 
TGCTGACCAT 
AAAATGGGAA 
CATTTGCGAT 



41 
I 

LAPNPVDDAG 
VARVGPEKAS 
VGIGLCIALF 
LNILSSDSYS 
AKLNSAFRRS 
FVQSGNSALA 
MAEANVSLRR 
RHLCKKQRSE 
AVFVGRIIRG 
LLAALLGQMQ 
QKDLSNLPYG 
FEECIKKTLR 
LRGLQFKDPE 
LGLWLDKGSR 
TTLMASSSLH 
MWFILVILA 
QGLGIIHAYG 
STCVPECTHP 
RTGSGKSSLG 
LDPFESHTDE 
SKIILLDEAT 
FDKPEVLAEK 



CTTAAAGAGC 


2700 


TTCCAAGGAT 


2760 


GCAGTTTTTT 


2820 


TTTAGTCGTG 


2880 


AGTCCAGGAG 


2940 


CTCCTCCATG 


3000 


CTATACTTCA 


3060 


AGTGTGTGTG 


3120 


GGAATACATT 


3180 


CAAGGACTGG 


3240 


CAACACCCCC 


3300 


GATTGTTGGA 


3360 


GGAGCCAGCC 


3420 


AGACCTCAGA 


3460 


AAGGTACAAC 


3540 


GAGAACATTC 


3600 


AGAAAATGGA 


3660 


TCTCCGTAAT 


3720 


TGACACCCTG 


3780 


CGCCCACCGC 


3840 


GGTGATTGAG 


3900 


GTTACTAGCA 


3960 


51 




I 

LLSFATFSWL 


60 


LSHWWKFQR 


120 


ATEFTKVFFW 


180 


LFEAALFCPL 


240 


AILVTDKRVQ 


300 


PIVSTIAIVL 


360 


MKKILIDKSP 


420 


AYSERSPPAK 


480 


YRPHGFSAKD 


540 


LQKGWAVNG 


600 


DLTEIGERGL 


660 


GKTWLVTHQ 


720 


HLYNAAMVEA 


780 


MTCGPQGNRT 


840 


DTVFDKILKS 


900 


AVFPAVLLW 


960 


KKESCITYTS 


1020 


LKVGTCPKDW 


1080 


MALFRLVEPA 


1140 


MLWQVLERTF 


1200 


ASMDSKTDTL 


1260 


PDSAFAMLLA 


1320 



Seq ID NO: 514 DNA sequence 
Nucleic Acid Accession #: Z31560 
Coding sequence: 1-966 



CACAGCGCCC 
ACTTCGGGGG 
AGCCCGGACC 
CGCAAGATGG 
GCCGAGTGGA 
CTGCGAGCGC 
AAGACGCTCA 
AATAGCATGG 
ATGGACAGTT 
CAGCTGGGCT 
ATGCACCGCT 
ATGAACGGCT 
CTTGGCTCCA 
TCTTCCTCCC 
TATCTCCCCG 
CACTACCAGA 
ATGTGAGGGC 
TGGGAGGGGT 
AAAAA 



11 
I 

GCATGTACAA 
GCGGCGGCGG 
GCGTCAAGCG 
CCCAGGAGAA 
AACTTTTGTC 
TGCACATGAA 
TGAAGAAGGA 
CGAGCGGGGT 
ACGCGCACAT 
ACCCGCAGCA 
ACGACGTGAG 
CGCCCACCTA 
TGGGTTCGGT 
ACTCCAGGGC 
GCGCCGAGGT 
GCGGCCCGGT 
CGGACAGCGA 
GCAAAAGAGG 



21 
I 

CATGATGGAG 
CAACTCCACC 
GCCCATGAAT 
CCCCAAGATG 
GGAGACGGAG 
GGAGCACCCG 
TAAGTACACG 
CGGGGTGGGC 
GAACGGCTGG 
CCCGGGCCTC 
CGCCCTGCAG 
CAGCATGTCC 
GGTCAAGTCC 
GCCCTGCCAG 
GCCGGAACCC 
GCCCGGCACG 
ACTGGAGGGG 
AGAGTAAGAA 



31 
I 

ACGGAGCTGA 
GCGGCGGCGG 
GCCTTCATGG 
CACAACTCGG 
AAGCGGCCGT 
GATTATAAAT 
CTGCCCGGCG 
GCCGGCCTGG 
AGCAACGGCA 
AATGCGCACG 
TACAACTCCA 
TACTCGCAGC 
GAGGCCAGCT 
GCCGGGGACC 
GCCGCCCCCA 
GCCATTAACG 
GGAGAAATTT 
ACAGCATGGA 



41 
I 

AGCCGCCGGG 
CCGGCGGCAA 
TGTGGTCCCG 
AGATCAGCAA 
TCATCGACGA 
ACCGGCCCCG 
GGCTGCTGGC 
GCGCGGGCGT 
GCTACAGCAT 
GCGCAGCGCA 
TGACCAGCTC 
AGGGCACCCC 
CCAGCCCCCC 
TCCGGGACAT 
GCAGACTTCA 
GCACACTGCC 
TCAAAGAAAA 
GAAAACCCGG 



51 
I 

CCCGCAGCAA 
CCAGAAAAAC 
CGGGCAGCGG 
GCGCCTGGGC 
GGCTAAGCGG 
GCGGAAAACC 
CCCCGGCGGC 
GAACCAGCGC 
GATGCAGGAC 
GATGCAGCCC 
GCAGACCTAC 
TGGCATGGCT 
TGTGGTTACC 
GATCAGCATG 
CATGTCCCAG 
CCTCTCACAC 
ACGAGGGAAA 
TACGCTCAAA 



Seq ID NO: 515 Protein sequence 
Protein Accession #: CAA83435 
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51 



1 11 21 31 41 

I I I I I I 

HSARMYNMME TELKPPGPQQ TSGGGGGNST AAAAGGNQKN SPDRVKRPMN AFMVWSRGQR 
RKMAQENPKM HNSEISKRLG AEWKLLSETE KRPFIDEAKR LRALHMKEHP DYKYRPRRKT 
KTLMKKDKYT LPGGLLAPGG NSMASGVGVG AGLGAGVNQR MDSYAHMNGW SNGSYSMMQD 



60 
120 
180 



381 
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QL6YPQHPGL NAHGAAQMQP MHRYDVSALQ YNSMTSSQTY MNGSPTYSMS YSQQGTPGMA 240 

LGSMGSWKS EASSSPPWT SSSHSRAPCQ AGDLRDMISM YLPGAEVPEP AAPSRLHMSQ 300 
HYQSGPVPGT AINGTLPLSH M 

Seq ID NO: 516 DNA sequence 
Nucleic Acid Accession 8: U91618 
Coding sequence: 29., 541 

1 11 21 31 41 51 

i I I I I I 

CGGACTTGGC TTGTTAGAAG GCTGAAAGAT GATGGCAGGA ATGAAAATCC AGCTTGTATG 60 

CATGCTACTC CTGGCTTTCA GCTCCTGGAG TCTGTGCTCA GATTCAGAAG AGGAAATGAA 120 

AGCATTAGAA GCAGATTTCT TGACCAATAT GCATACATCA AAGATTAGTA AAGCACATGT 180 

TCCCTCTTGG AAGATGACTC TGCTAAATGT TTGCAGTCTT GTAAATAATT TGAACAGCCC 240 

AGCTGAGGAA ACAGGAGAAG TTCATGAAGA GGAGCTTGTT GCAAGAAGGA AACTTCCTAC 300 

TGCTTTAGAT GGCTTTAGCT TGGAAGCAAT GTTGACAATA TACCAGCTCC ACAAAATCTG 360 

TCACAGCAGG GCTTTTCAAC ACTGGGAGTT AATCCAGGAA GATATTCTTG ATACTGGAAA 420 

TGACAAAAAT GGAAAGGAAG AAGTCATAAA GAGAAAAATT CCTTATATTC TGAAACGGCA 480 

GCTGTATGAG AATAAACCCA GAAGACCCTA CATACTCAAA AGAGATTCTT ACTATTACTG 540 

AGAGAATAAA TCATTTATTT ACATGTGATT GTG ATT CATC ATCCCTTAAT TAAATATCAA 600 

ATTATATTTG TGTGAAAATG TGACAAACAC ACTTATCTGT CTCTTCTACA ATTGTGGTTT 660 

ATTGAATGTG TTTTTCTGCA CTAATAGAAA TTAGACTAAG TGTTTTCAAA TAAATCTAAA 720 
TCTTCAAAAA AAAAAAAAAA AAATGGGGCC GCAATT . 

Seq ID NO: 517 Protein sequence 
Protein Accession tt: AAB50564 

1 11 21 31 41 51 

1111)1 

MMAGMKIQLV CMLLLAFSSW SLCSDSEEEM KALEADFLTN MHTSKISKAH VPSWKMTLLN 60 

VCSLVNNLNS PAEETGEVHE EELVARRKLP TALDGFSLEA MLTIYQLHKI CHSRAFQHWE 120 
LIQEDILDTG NDKNGKEEVI KRKIPYILKR QLYENKPRRP YILKRDSYYY 

Seq ID NO: 518 DNA sequence 

Nucleic Acid Accession #: NM_006536.2 

'Coding sequence: 10 9.. 2 940 

1 11 21 31 41 51 

I I I I I I 

ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CATCTGCATC CATATTGAAA ACCTGACACA 60 

ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAGGCTTC TCTACAACAT GACCCAAAGG 120 

AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC TCCTGGTTGC CTTAAGTTCA 180 

GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA ATGGGTATAA TGGATTGCTC 240 

ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA GAAGAGTATT TTTCAGAAAT 360 

ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

TCATATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG GGGCACATGG AGATGATCCA 480 

TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT ACATTCATTT CACACCTAAT 540 

TTCCTACTGA ATGATAACTT AACAGCTGGC TACGGATCAC GAGGCCGAGT GTTTGTCCAT 600 

GAATGGGCCC ACCTCCGTTG GGGTGTGTTC GATGAGTATA ACAATGACAA ACCTTTCTAC 660 

ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATCAC AGGCATTTTT 720 

GTGTGTGAAA AAGGTCCTTG CCCCCAAGAA AACTGTATTA TTAGTAAGCT TTTTAAAGAA 780 

GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG CATCAATAAT GTTCATGCAA 840 

AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC ACAACCAAGA AGCACCAAAC 900 

CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG TAATCACAGA CTCTGCTGAC 960 

TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 1020 

GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GATGGCAGAG 1080 

GCTGACAGAC TCCTTCAACT ACAACAAGCC GCAGAATTTT ATTTGATGCA GATTGTTGAA 1140 

ATTCATACCT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG GAGAGATCAG AGCCCAGCTA 1200 

CACCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT CATATCTGCC CACCACTGTA 1260 

TCAGCTAAAA CAGACATCAG CATTTGTTCA GGGCTTAAGA AAGGATTTGA GGTGGTTGAA 1320 

AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG TGACCAGCGG AGATGATAAG 1380 

CTTCTTGGCA ATTGCTTACC CACTGTGCTC AGCAGTGGTT CAACAATTCA CTCCATTGCC 1440 

CTGGGTTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC GTCTTACAGG AGGTTTAAAG 1500 

TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG ATGCTTTCAG TAGAATTTCC 1560 

TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAGTACAGG TGAAAATGTC 1620 

AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAACGACACT 1680 

ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA TTATATTATT TGATCCTGAT 1740 

GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTCGGAC AGCTAGTCTT 1800 

TGGATTCCAG GAACAGCTAA GCCTGGGCAC TGGACTTACA CCCTGAACAA TACCCATCAT 1860 

TCTCTGCAAG CCCTGAAAGT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 1920 

GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 1980 

TATGCCAATG TGAAACAGGG ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 2040 

GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGACTCCTTG ATGATGGAGC AGGTGCTGAT 2100 

GTTATAAAAA ATGATGGAAT TTACTCGAGG TATTTTTTCT CCTTTGCTGC AAATGGTAGA 2160 

TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA GCACCCCAGC CCACTCTATT 2220 

CCAGGGAGTC ATGCTATGTA TGTACCAGGT TACACAGCAA ACGGTAATAT TCAGATGAAT 2280 

GCTCCAAGGA AATCAGTAGG CAGAAATGAG GAGGAGCGAA AGTGGGGCTT TAGCCGAGTC 2340 

AGCTCAGGAG GCTCCTTTTC AGTGCTGGGA GTTCCAGCTG GCCCCCACCC TGATGTGTTT 2400 

CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG AAGAGGAATT GACCCTATCT 2460 

TGGACAGCAC CTGGAGAAGA CTTTGATCAG GGCCAGGCTA CAAGCTATGA AATAAGAATG 2520 

AGTAAAAGTC TACAGAATAT CCAAGATGAC TTTAACAATG CTATTTTAGT AAATACATCA 2580 

AAGCGAAATC CTCAGCAAGC TGGCATCAGG GAGATATTTA CGTTCTCACC CCAGATTTCC 2640 

ACGAATGGAC CTGAACATCA GCCAAATGGA 6AAACACATG AAAGCCACAG AATTTATGTT 2700 

GCAATACGAG CAATGGATAG GAACTCCTTA CAGTCTGCTG TATCTAACAT TGCCCAGGCG 2760 

CCTCTGTTTA TTCCCCCCAA TTCTGATCCT GTACCTGCCA GAGATTATCT TATATTGAAA 2820 

GGAGTTTTAA CAGCAATGGG TTTGATAGGA ATCATTTGCC TTATTATAGT TGTGACACAT 2880 
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CATACTTTAA GCAGGAAAAA GAGAGCAGAC AAGAAAGAGA ATGGAACAAA ATTATTATAA 2940 

ATAAATATCC AAAGTGTCTT CCTTCTTAGA TATAAGACCC ATGGCCTTCG ACTACAAAAA 3000 

CATACTAACA AAGTCAAATT AACATCAAAA CTGTATTAAA ATGCATTGAG TTTTTGTACA 3060 

ATACAGATAA GATTTTTACA TGGTAGATCA ACAATTCTTT TTGGGGGTAG ATTAGAAAAC 3120 

CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT TTAAAGTAAT GTCTTTAAAG 3180 

GCAAAGGGAA GGGTAAAGTC GGACCAGTGT CAAGGAAAGT TTGTTTTATT GAGGTGGAAA 3240 

AATAGCCCCA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA TAACTGTCTG TGTGAAGCAA 3300 

TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TGTGCAGTAC AGGTTGCTTG 3360 

TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAATGCAAAG CTCTTTACCT 3420 

CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT GCTAATGCTC AGAGATCTTT 3480 

TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC ■ CTTTGTCTCT TCATACCGGT 3540 

TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT TCTACTCCCA TCAAAGCAGC 3600 

TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT TATAGCCCCN TATAATGCCT 3660 
TACCTAGGAA A 

Seq ID NO: 519 Protein sequence 
Protein Accession ft: NPJ)Q6527.1 

1 11 21 31 41 51 

I I I I I I 

MTQRSIAGPI CNLKFVTLLV ALSSELPFLG AGVQLQDNGY NGLLIAINPQ VPENQNLISN 60 

IKEMITEASF YLFNATKRRV FFRNIKILIP ATWKANNNSK IKQESYEKAN VIVTDWYGAH 120 

GDDPYTLQYR GCGKEGKYIH FTPNFLLNDN LTAGYGSRGR VFVHEWAHLR WGVFDEYNND 180 

KPFYINGQNQ IKVTRCSSDI TGIFVCEKGP CPQENCIISK LFKEGCTFIY NSTQNATASI 240 

MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSAWDVIT DSADFHHSFP MNGTELPPPP 300 

TFSLVQAGDK WCLVLDVSS KMAEADRLLQ LQQAAEFYLM QIVEIHTFVG IASFDSKGEI 360 

-RAQLHQINSN DDRKLLVSYL PTTVSAKTDI SICSGLKKGF EWEKLNGKA YGSVMILVTS 420 

GDDKLLGNCL PTVLSSGSTI HSIALGSSAA PNLEELSRIjT GGLKFFVPDI SNSNSMIDAF 480 

SRISSGTGDI FQQHIQLEST GENVKPHHQL KNTVTVDNTV GNDTMFLVTW QASGPPEIIL 540 

FDPDGRKYYT NNFITNLTFR TASLWIPGTA KPGHWTYTLN NTHHSLQALK VTVTSRASNS 600 

AVPPATVEAF VERDSLHFPH PVMIYANVKQ GFYPILNATV TATVEPETGD PVTLRLLDDG 660 

AGADVIKNDG IYSRYFFSFA ANGRYSLKVH VNHSPSISTP AHS IPG SHAM YVPGYTANGN 720 

IQMNAPRKSV GRNEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKII DLEAVKVEEE 780 

LTLSWTAPGE DFDQGQATSY EIRMSKSLQN IQDDFNNAIL VNTSKRNPQQ AGIREIFTFS 840 

PQISTNGPEH QPNGETHESH RIYVAIRAMD RNSLQSAVSN IAQAPLFIPP NSDPVPARDY 900 
LILKGVLTAM GLIGIICLII WTHHTLSRK KRADKKENGT KLL 

Seq ID NO: 520 DNA sequence 

Nucleic Acid Accession #: NM_000228.1 

Coding sequence: 82.. 3600 

1 11 21 31 41 51 

I I I I I I 

GCTTTCAGGC GATCTGGAGA AAGAACGGCA GAACACACAG CAAGGAAAGG TCCTTTCTGG 60 

GGATCACCCC ATTGGCTGAA GATGAGACCA TTCTTCCTCT TGTGTTTTGC CCTGCCTGGC 120 

CTCCTGCATG CCCAACAAGC CTGCTCCCGT GGGGCCTGCT ATCCACCTGT TGGGGACCTG 180 

CTTGTTGGGA GGACCCGGTT TCTCCGAGCT TCATCTACCT . GTGGACTGAC CAAGCCTGAG 240 

ACCTACTGCA CCCAGTATGG CGAGTGGCAG ATGAAATGCT GCAAGTGTGA CTCCAGGCAG 300 

CCTCACAACT ACTACAGTCA CCGAGTAGAG AATGTGGCTT CATCCTCCGG CCCCATGCGC 360 

TGGTGGCAGT CCCAGAATGA TGTGAACCCT GTCTCTCTGC AGCTGGACCT GGACAGGAGA 420 

TTCCAGCTTC AAGAAGTCAT GATGGAGTTC CAGGGGCCCA TGCCCGCCGG CATGCTGATT 480 

GAGCGCTCCT CAGACTTCGG TAAGACCTGG CGAGTGTACC AGTACCTGGC TGCCGACTGC 540 

ACCTCCACCT TCCCTCGGGT CCGCCAGGGT CGGCCTCAGA GCTGGCAGGA TGTTCGGTGC 600 

CAGTCCCTGC CTCAGAGGCC TAATGCACGC CTAAATGGGG GGAAGGTCCA ACTTAACCTT 660 

ATGGATTTAG TGTCTGGGAT TCCAGCAACT CAAAGTCAAA AAATTCAAGA GGTGGGGGAG 720 

ATCACAAACT TGAGAGTCAA TTTCACCAGG CTGGCCCCTG TGCCCCAAAG GGGCTACCAC 780 

CCTCCCAGCG CCTACTATGC TGTGTCCCAG CTCCGTCTGC AGGGGAGCTG CTTCTGTCAC 840 

GGCCATGCTG ATCGCTGCGC ACCCAAGCCT GGGGCCTCTG CAGGCCCCTC CACCGCTGTG 900 

CAGGTCCACG ATGTCTGTGT CTGCCAGCAC AACACTGCCG GCCCAAATTG TGAGCGCTGT 960 

GCACCCTTCT ACAACAACCG GCCCTGGAGA CCGGCGGAGG GCCAGGACGC CCATGAATGC 1020 

CAAAGGTGCG ACTGCAATGG GCACTCAGAG ACATGTCACT TTGACCCCGC TGTGTTTGCC 1080 

GCCAGCCAGG GGGCATATGG AGGTGTGTGT GACAATTGCC GGGACCACAC CGAAGGCAAG 1140 

AACTGTGAGC GGTGTCAGCT GCACTATTTC CGGAACCGGC GCCCGGGAGC TTCCATTCAG 1200 

GAGACCTGCA TCTCCTGCGA GTGTGATCCG GATGGGGCAG TGCCAGGGGC TCCCTGTGAC 1260 

CCAGTGACCG GGCAGTGTGT GTGCAAGGAG CATGTGCAGG GAGAGCGCTG TGACCTATGC 1320 

AAGCCGGGCT TCACTGGACT CACCTACGCC AACCCGCAGG GCTGCCACCG CTGTGACTGC 1380 

AACATCCTGG GGTCCCGGAG GGACATGCCG TGTGACGAGG AGAGTGGGCG CTGCCTTTGT 1440 

CTGCCCAACG TGGTGGGTCC CAAATGTGAC CAGTGTGCTC CCTACCACTG GAAGCTGGCC 1500 

AGTGGCCAGG GCTGTGAACC GTGTGCCTGC GACCCGCACA ACTCCCCTCA GCCCACAGTG 1560 

CAACCAGTTC ACAGGGCAGT GCCCTGTCGG GAAGGCTTTG GTGGCCTGAT GTGCAGCGCT 1620 

GCAGCCATCC GCCAGTGTCC AGACCGGACC TATGGAGACG TGGCCACAGG ATGCCGAGCC 1680 

TGTGACTGTG ATTTCCGGGG AACAGAGGGC CCGGGCTGCG ACAAGGCATC AGGCCGCTGC 1740 

CTCTGCCGCC CTGGCTTGAC CGGGCCCCGC TGTGACCAGT GCCAGCGAGG CTACTGCAAT 1800 

CGCTACCCGG TGTGCGTGGC CTGCCACCCT TGCTTCCAGA CCTATGATGC GGACCTCCGG 1860 

GAGCAGGCCC TGCGCTTTGG TAGACTCCGC AATGCCACCG CCAGCCTGTG GTCAGGGCCT 1920 

GGGCTGGAGG ACCGTGGCCT GGCCTCCCGG ATCCTAGATG CAAAGAGTAA GATTGAGCAG 1980 

ATCCGAGCAG TTCTCAGCAG CCCCGCAGTC ACAGAGCAGG AGGTGGCTCA GGTGGCCAGT 2040 

GCCATCCTCT CCCTCAGGCG AACTCTCCAG GGCCTGCAGC TGGATCTGCC CCTGGAGGAG 2100 

GAGACGTTGT CCCTTCCGAG AGACCTGGAG AGTCTTGACA GAAGCTTCAA TGGTCTCCTT 2160 

ACTATGTATC AGAGGAAGAG GGAGCAGTTT GAAAAAATAA GCAGTGCTGA TCCTTCAGGA 2220 

GCCTTCCGGA TGCTGAGCAC AGCCTACGAG CAGTCAGCCC AGGCTGCTCA GCAGGTCTCC 2280 

GACAGCTCGC GCCTTTTGGA CCAGCTCAGG GACAGCCGGA GAGAGGCAGA GAGGCTGGTG 2340 

CGGCAGGCGG GAGGAGGAGG AGGCACCGGC AGCCCCAAGC TTGTGGCCCT GAGGCTGGAG 2400 

ATGTCTTCGT TGCCTGACCT GACACCCACC TTCAACAAGC TCTGTGGCAA CTCCAGGCAG 2460 

ATGGCTTGCA CCCCAATATC ATGCCCTGGT GAGCTATGTC CCCAAGACAA TGGCACAGCC 2520 

TGTGGCTCCC GCTGCAGGGG TGTCCTTCCC AGGGCCGGTG GGGCCTTCTT GATGGCGGGG 2580 

CAGGTGGCTG AGCAGCTGCG GGGCTTCAAT GCCCAGCTCC AGCGGACCAG GCAGATGATT 2640 
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AGGGCAGCCG AGGAATCTGC CTCACAGATT CAATCCAGTG CCCAGCGCTT GGAGACCCAG 2700 

GTGAGCGCCA GCCGCTCCCA GATGGAGGAA GATGTCAGAC GCACACGGCT CCTAATCCAG 2760 

CAGGTCCGGG ACTTCCTAAC AGACCCOGAC ACTGATGCAG CCACTATCCA GGAGGTCAGC 2820 

GAGGCCGTGC TGGCCCTGTG GCTGCCCACA GACTCAGCTA CTGTTCTGCA GAAGATGAAT 2880 

GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT TGGTGCTGTC CCAGACCAAG 2940 

CAGGACATTG CGCGTGCCCG CCGGTTGCAG GCTGAGGCTG AGGAAGCCAG GAGCCGAGCC 3000 

• CATGCAGTGG AGGGCCAGGT GGAAGATGTG GTTGGGAACC TGCGGCAGGG GACAGTGGCA 3060 

CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT CCCTTCGGCT TATCCAGGAC 3120 

AGGGTTGCTG AGGTTCAGCA GGTACTGCGG CCAGCAGAAA AGCTGGTGAC AAGCATGACC 3180 

AAGCAGCTGG GTGACTTCTG GACACGGATG GAGGAGCTCC GCCACCAAGC CCGGCAGCAG 3240 

GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GCGGAAGGTG CCAGCGAGCA GGCATTGAGT 3300 

GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGTATGCTG AGTTGAAGGA CCGGTTGGGT 3360 

CAGAGTTCCA TGCTGGGTGA GCAGGGTGCC CGGATCCAGA GTGTGAAGAC AGAGGCAGAG 3420 

GAGCTGTTTG GGGAGACCAT GGAGATGATG GACAGGATGA AAGACATGGA GTTGGAGCTG 3480 

CTGCGGGGCA GCCAGGCCAT CATGCTGCGC TCGGCX5GACC TGACAGGACT GGAGAAGCGT 3540 

GTGGAGCAGA TCCGTGACCA CATCAATGGG CGCGTGCTCT ACTATGCCAC CTGCAAGTGA 3600 

TGCTACAGCT TCCAGCCCGT TGCCCCACTC ATCTGCCGCC TTTGCTTTTG GTTGGGGGCA 3660 

GATTGGGTTG GAATGCTTTC CATCTCCAGG AGACTTTCAT GCAGCCTAAA GTACAGCCTG 3720 

GACCACCCCT GGTGTGTAGC TAGTAAGATT ACCCTGAGCT GCAGCTGAGC CTGAGCCAAT 3780 

GGGACAGTTA CACTTGACAG ACAAAGATGG TGGAGATTGG CATGCCATTG AAACTAAGAG 3840 

CTCTCAAGTC AAGGAAGCTG GGCTGGGCAG TATCCCCCGC CTTTAGTTCT CCACTGGGGA 3900 

GGAATCCTGG ACCAAGCACA AAAACTTAAC AAAAGTGATG TAAAAATGAA AAGCCAAATA 3960 
AAAATCTTTG G 

Seq ID NO: 521 Protein sequence 
Protein Accession fh NP_000219.1 

1 11 21 31 41 51 

I I I 1 I I 

MRPFFLLCFA LPGLLHAQQA CSRGACYPPV GDLLVGRTRF LRASSTCGLT KPETYCTQYG 60 

EWQMKCCKCD SRQPHNYYSH RVENVASSSG PMRWWQSQND VNPVSLQLDL DRRFQLQBVM 120 

MEFQGPMPAG MLIERSSDFG KTWRVYQYLA ADCTSTFPRV ROGRPQSWQD VRCQSLPQRP 180 

NARLNGGKVQ LNLMDLVSGI PATQSQKIQE VGEITNLRVN FTRLAPVPQR GYHPPSAYYA 240 

VSQLRLQGSC FCHGHADRCA PKPGASAGPS TAVQVHDVCV CQHNTAGPNC ERCAPFYNNR 300 

PWRPAEGQDA HECQRCDCNG HSETCHFDPA VFAASQGAYG GVCDKCRDHT EGKNCERCQL 360 

HYFRNRRPGA SIQETCISCE CDPDGAVPGA PCDPVTGQCV CKEHVQGERC DLCKPGFTGL 420 

TYANPQGCHR CDOilLGSRR DMPCDEESGR CLCLPNWGP KCDQCAPYHW KliASGQGCEP 480 

CACDPHNSPQ PTVQPVHRAV PCREGFGGLM CSAAAIRQCP DRTYGDVATG CRACDCDFRG 540 

TEGPGCDKAS GRCLCRPGLT GPRCDQCQRG YCNRYPVCVA CHPCFQTYDA DLREQALRFG 600 

RLRNATASLW SGPGLEDRGL ASRILDAKSK IEQIRAVLSS PAVTEQEVAQ VASAILSLRR 660 

TLQGLQLDLP LEEETLSLPR DLESLDRSFN GLLTMYQRKR EQFEKISSAD PSGAFRMLST 720 

AYEQSAQAAQ QVSDSSRLLD QLRDSRREAE RLVRQAGGGG GTGSPKLVAL RLEMSSLPDL 780 

TPTFNKLCGN SRQMACTPIS CPGELCPQDN GTACGSRCRG VLPRAGGAFL MAGQVAEQLR 840 

GFNAQLQRTR QMIRAAEESA SQIQSSAQRL ETQVSASRSQ MEEDVRRTRL LIQQVRDFLT 900 

DPDTDAATIQ EVSEAVLALW LPTDSATVLQ KMNEIQAIAA RLPNVDI»VLS QTKQDIARAR 960 

RLQAEAEEAR SRAHAVEGQV EDWGNLRQG TVALQEAQDT MQGTSRSLRL IQDRVAEVQQ 1020 

VLRPAEKLVT SMTKQLGDFW TRMEELRHQA RQQGAEAVQA QQLAEGASEQ ALSAQEGFER 1080 

IKQKYAELKD RLGQSSMLGE QGARIQSVKT EAEELFGETM EMMDRMKDME LELLRGSQAI 1140 
MLRSADLTGL EKRVEQIRDH INGRVLYYAT CK 

Seq ID NO: 522 DNA sequence 

Nucleic Acid Accession #: NM_001944.1 

Coding sequence: 84.. 3083 

1 11 21 31 41 51 

I I I I I I 

TTTTCTTAGA CATTAACTGC AGACGGCTGG CAGGATAGAA GCAGCGGCTC ACTTGGACTT 60 

TTTCACCAGG GAAATCAGAG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG * 120 

CCATCTTCGT GGTGGTCATA TTGGTTCATG GAGAATTGCG AATAGAGACT AAAGGTCAAT 180 

ATGATGAAGA AGAGATGACT ATGCAACAAG CTAAAAGAAG GCAAAAACGT GAATGGGTGA 240 

AATTTGCCAA ACCCTGCAGA GAAGGAGAAG ATAACTCAAA AAGAAACCCA ATTGCCAAGA 300 

TTACTTCAGA TTACCAAGCA ACCCAGAAAA TCACCTACCG AATCTCTGGA GTGGGAATCG 360 

ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 420 

CTATAGTCGA CCGGGAGGAA ACTCCAAGCT TCCTGATCAC ATGTCGGGCT CTAAATGCCC 480 

AAGGACTAGA TGTAGAGAAA CCACTTATAC TAACGGTTAA AATTTTGGAT ATTAATGATA 540 

ATCCTCCAGT ATTTTCACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCTCAA 600 

ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 660 

AAATTGCCTT CAAAATTGTC TCTCAGGAAC CAGCAGGCAC ACCCATGTTC CTCCTAAGCA 720 

GAAACACTGG GGAAGTCCGT ACTTTGACCA ATTCTCTTGA CCGAGAGCAA GCTAGCAGCT 780 

ATCGTCTGGT TGTGAGTGGT GCAGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 840 

GTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCCCAAT GTTTAGAGAC TCTCAGTATT 900 

CAGCACGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TCGATTTCAA GTAACAGATT 960 

TGGATGAAGA GTACACAGAT AATTGGCTTG CAGTATATTT CTTTACCTCT GGGAATGAAG 1020 

GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AGGCATCCTG AAAGTGGTGA 1080 

AGGCTCTAGA TTATGAACAA CTACAAAGCG TGAAACTTAG TATTGCTGTC AAAAACAAAG 1140 

CTGAATTTCA CCAATCAGTT ATCTCTCGAT ACCGAGTTCA GTCAACCCCA GTCACAATTC 1200 

AGGTAATAAA TGTAAGAGAA GGAATTGCAT TCCGTCCTGC TTCCAAGACA TTTACTGTGC 1260 

AAAAAGGCAT AAGTAGCAAA AAATTGGTGG ATTATATCCT GGGAACATAT CAAGCCATCG 1320 

ATGAGGACAC TAACAAAGCT GCCTCAAATG TCAAATATGT CATGGGACGT AACGATGGTG 1380 

GATACCTAAT GATTGATTCA AAAACTGCTG AAATCAAATT TGTCAAAAAT ATGAACCGAG 1440 

ATTCTACTTT CATAGTTAAC AAAACAATCA CAGCTGAGGT TCTGGCCATA GATGAATACA 1500 

CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC CGATTTCAAT GACAATTGTC . 1560 

CAACAGCTGT CCTCGAAAAA GATGCAGTTT GCAGTTCTTC ACCTTCCGTG GTTGTCTCCG 1620 

CTAGAACACT GAATAATAGA TACACTGGCC CCTATACATT TGCACTGGAA GATCAACCTG 1680 

TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC TACCTCGGCC CTCCTCAGAG 1740 

CCCAGGAACA GATACCTCCT GGAGTATACC ACATCTCCCT GGTACTTACA GACAGTCAGA 1800 

ACAATCGGTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTGTCAGTGT GACAACAGGG 1860 
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GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC 
CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT 
TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG 
GTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT 
GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT 
GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC 
TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG 
GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC 
CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT 
GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT 
TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG 
TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC 
GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT 
TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA 
CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA 
CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG 
CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT 
TAACGGAGAC TTACTCGGCT TCTGGTTCCC TCGTGCAACC 
CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT 
CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA 
ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT 
ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC 
TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT 
TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT 
TCTTAAAGTT TTTCAAAACC CTAAAATCAT ATTCGC 

Seq ID NO: 523 Protein sequence 
Protein Accession #: NP_001935.1 



PCT/US02/12476 



MMGIjFPRTTG 
GEDNSKRNPI 
PSFLITCRAL 
ATDADEPNHL 
DKDGBGLSTQ 
WLAVYFFTSG 
SRYRVQSTPV 
SNVKYVMGRN 
VYVRVPDFND 
ITTLNATSAL 
TTSPGTRYGR 
GSEGTIHQWG 
MTTKLGAATE 
DGAISMNFLD 
LDDSFLDSLG 
LSGSQGASAL 
VTERVICPIS 



11 

I 

ALAIFWVIL 
AKITSDYQAT 
NAQGLDVEKP 
NSKIAFKIVS 
CECNIKVKDV 
NEGNWFEIQT 
TIQVINVREG 
DGGYLMIDSK 
NCPTAVLEKD 
LRAQEQIPPG 
PHSGRLGPAA 
IEGAHPEDKE 
SGGAAGFATG 
SYFSQKAFAC 
PKFKKLAEIS 
SASGSVQPAV 
SVPGNLAGPT 



21 
I 

VHGELRIETK 
QKITYRISGV 
LILTVKILDI 
QEPAGTPMFL 
NDNFPMPRDS 
DPRTNEGILK 
IAFRPASKTF 
TAEIKFVKNM 
AVCSSSPSW 
VYHISLVLTD 
IGLLLLGLLL 
ITNICVPPVT 
TVSGAASGFG 
AEEDDGQEAN 
LGVDGEGKEV 
SIPDPLQHGN 
QLRGSHTMLC 



31 
I 

GQYDEEEMTM 
GIDQPPFGIF 
NDNPPVFSQQ 
LSRNTGEVRT 
QYSARIEENI 
WKALDYEQL 
TVQKGISSKK 
NRDSTFIVNK 
VSARTLNNRY 
SQNNRCEMPR 
LLLAPLLLLT 
ANGADFMESS 
AATGVGICSS 
DCLLIYDNEG 
QPPSKDSGYG 
YLVTETYSAS 
TEDPCSRLI 



CAGGTATGGC 
TGGTCTCCTG 
TTCTACTGGG 
TCATCAGTGG 
GCCTCCTGTA 
GTATGCCAGA 
AGCAGCCACT 
TTCAGGATTC 
GAGAACAAGG 
GAATTTTCTG 
CCAGGAAGCA 
TCCTGTGGGC 
GGACTCACTT 
AGGCAAAGAA 
TCCCATAGAA 
AGCTTCTGCT 
GCAGCATGGT 
TTCCACTGCA 
CTGTCCCATT 
TACTATGCTC 
ACCACACTGA 
ACTGTATTGG 
AAATTAAATG 
ACTATTCAAA 



41 
I 

QQAKRRQKRE 
WDKNTGDIN 
IFMGEIEENS 
LTNSLDREQA 
LSSELLRFQV 
QSVKLSIAVK 
LVDYILGTYQ 
TITAEVLAID 
TGPYTFALED 
SLTLEVCQCD 
CDCGAGSTGG 
EVCTNTYARG 
GQSGTMRTRH 
ADATGSPVGS 
IESCGHPIEV 
GSLVQPSTAG 



AGGCCGCACT 
CTGCTGCTGT 
GGAGTGACAG 
GGAATTGAAG 
ACAGCCAATG 
GGCACAGCGG 
GAATCTGGAG 
GGAGCAGCCA 
CATTCCACTG 
GACTCCTACT 
AATGACTGCT 
TCCGTGGGTT 
GGACCCAAAT 
GTTCAGCCAC 
GTCCAGCAGA 
TTGTCCGCCT 
AACTATTTAG 
GGCTTTGATC 
TCCAGTGTTC 
TGTACAGAGG 
CCAAATCTGG 
GCTAATAATT 
TTTGGGTTCA 
TTGTAGTAAA 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 



51 
I 

WVKFAKPCRE 
ITAIVDREET 
ASNStiVMILN 
SSYRLWSGA 
TDLDEEYTDN 
NKAEFHQSVI 
AIDEDTNKAA 
EYTGKTSTGT 
QPVKLPAVWS 
NRGICGTSYP 
VTGGFIPVPD 
TAVEGTSGME 
STGGTNKDYA 
VGCCSFIADD 
QQTGFVKCQT 
FDPLLTQNVI 



Seq ID NO: 524 DNA sequence 

Nucleic Acid Accession #: XMJ)58069.2 

Coding sequence: 1..1413 



I 

ATGAAGTTTC 
AGCTCTACAA 
TATGGCCTTG 
AAGGAAAAAA 
ACATCTACCC 
AGGGAAATGC 
TACACACCTG 
TGGAGTAATG 
GTGGTTTTTG 
CTAGCCCATG 
GAATTCTGGA 
GGCCATTCCT 
AAATATGTTG 
CTGTATGGAG 
CTCTGTGACC 
TTCAAAGACA 
ATTTCTTCCT 
AGAAATCAAG 
GAGCCAAATT 
GATGCAGCTG 
TGGAGGTATG 
AACTTCCAAG 
TATTTCTTCC 
ACACTGAAAA 



11 
I 

TTCTAATACT 
GCCTGGAAAA 
AGATAAACAA 
TCCAAGAAAT 
TGGAGATGAT 
CAGGGGGGCC 
ACATGAACCG 
TTACCCCCTT 
CCCGTGGAGC 
CTTTTGGACC 
CTACACATTC 
TAGGTCTTGG 
ACATCAACAC 
ACCCAAAAGA 
CCAATTTGAG 
GGTTCTTCTG 
TATGGCCAAC 
TTTTTCTTTT 
ATCCCAAGAG 
TTTTTAACCC 
ATGAAAGGAG 
GAATCGGGCC 
AAGGATCTAA 
GCAATAGCTG 



21 
I 

GCTCCTGCAG 
AAATAATGTG 
ACTTCCAGTG 
GCAGCACTTC 
GCACGCACCT 
CGTATGGAGG 
TGAGGATGTT 
GAAATTCAGC 
TCATGGAGAC 
TGGATCTGGC 
AGGAGGCACA 
CCATTCTAGT 
ATTTCGCCTC 
GAACCAACGC 
TTTTGATGCT 
GCTGAAGGTT 
CTTGCCATCT 
TAAAGATGAC 
CATACATTCT 
ACGTTTTTAT 
ACAGATGATG 
TAAAATTGAT 
CCAATTTGAA 
GTTTGGTTGT 



31 
I 

GCCACTGCTT 
CTATTTGGTG 
ACAAAAATGA 
TTGGGTCTGA 
CGATGTGGAG 
AAACATTATA 
GACTACGCAA 
AAGATTAACA 
TTCCATGCTT 
ATTGGAGGGG 
AACTTGTTCC 
GATCCAAAGG 
TCTGCTGATG 
TTGCCAAATC 
GTCACTACCG 
TCTGAGAGAC 
GGCATTGAAG 
AAATACTGGT 
TTTGGTTTTC 
AGGACCTACT 
GACCCTGGTT 
GCAGTCTTCT 
TATGACTTCC 
TGA 



41 
I 

CTGGAGCTCT 
AAAGATACTT 
AATATAGTGG 
AAGTGACCGG 
TCCCCGATGT 
TCACCTACAG 
TCCGGAAAGC 
CAGGCATGGC 
TTGATGGCAA 
ATGCACATTT 
TCACTGCTGT 
CCGTAATGTT 
ACATACGTGG 
CTGACAATTC 
TGGGAAATAA 
CAAAGACCAG 
CTGCTTATGA 
TAATTAGCAA 
CTAACTTTGT 
TCTTTGTAGA 
ATCCCAAACT 
ACTCTAAAAA 
TACTCCAACG 



51 
I 

TCCCCTGAAC 
AGAAAAATTT 
AAACTTAATG 
GCAACTGGAC 
CCATCATTTC 
AATCAATAAT 
TTTCCAAGTA 
TGACATTTTG 
AGGTGGAATC 
CGATGAGGAC 
TCACGAGATT 
CCCCACCTAC 
CATTCAGTCC 
AGAACCAGCT 
GATCTTTTTC 
TGTTAATTTA 
AATTGAAGCC 
TTTAAGACCA 
GAAAAAAATT 
TAACCAGTAT 
GATTACCAAG 
CAAATACTAC 
TATCACCAAA 



Seq ID NO: 525 Protein sequence 
Protein Accession fl: P39900 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



41 



51 



1 11 21 31 

I I I I' I I 

MKFUjILLLQ ATASGAIiPLN SSTSLEKNNV LFGERYLEKF YGLEINKLPV TKMKYSGNLM 
KEKIQEMQHF LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 



60 
120 
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YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WPARGAHGD FHAFDGXGGI 180 

LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHEI GHSLGLGHSS DPKAVMFPTY 240 

KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIPF 300 

PKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA RNQVFLPKDD KYWLISNLRP 360 

EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY WRYDERRQMM DPGYPKLITK 420 
NFQGIGPKID AVFYSKNKYY YFFQGSNQFE YDFLLQRITK TLKSNSWFGC 

Seq ID NO: 52 S DNA sequence 

Nucleic Acid Accession #: NM_024423.1 

Coding sequence: 64.. 2 5 90 

1 11 21 31 41 51 

I I I I I I 

GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 

CCGATGGCCG CCGCTGGGCC COGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 120 

CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 

CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 

TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGTCAGTG 300 

TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 360 

GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 

TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 480 

ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 540 

GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 

AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 

CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 720 

GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 

. CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 

ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC OGGACACAAT GCATACGCGC 900 

CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 

AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 

TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 1080 

ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 

GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 

GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 

GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 

GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 1500 

ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 

AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 

ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 

ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 

GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 

AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 

GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 

ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 

GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 

CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 

AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 

GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 

ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAGAAT CCATTAGAGG ACACACTGGT 2580 

TAAAAATTAA ACATAAAAGA AATTGCATCG ATGTAATCAG AATGAAGACC GCATGCCATC 2640 

CCAAGATTAT GTCCTCACTT ATAACTATGA GGGAAGAGGA TCTCCAGCTG GTTCTGTGGG 2700 

CTGCTGCAGT GAAAAGCAGG AAGAAGATGG CCTTGACTTT TTAAATAATT TGGAACCCAA 2760 

ATTTATTACA TTAGCAGAAG CATGCACAAA GAGATAATGT CACAGTGCTA CAATTAGGTC 2820 

TTTGTCAGAC ATTCTGGAGG TTTCCAAAAA TAATATTGTA AAGTTCAATT TCAACATGTA 2880 

TGTATATGAT GATTTTTTTC TCAATTTTGA ATTATGCTAC TCACCAATTT ATATTTTTAA 2940 

AGCCAGTTGT TGCTTATCTT TTCCAAAAAG TGAAAAATGT TAAAACAGAC AACTGGTAAA 3000 

TCTCAAACTC CAGCACTGGA ATTAAGGTCT CTAAAGCATC TGCTCTTTTT TTTTTTTACG 3060 

GATATTTTAG TAATAAATAT GCTGGATAAA TATTAGTCCA ACAATAGCTA AGTTATGCTA 3120 

ATATCACATT ATTATGTATT CACTTTAAGT GATAGTTTAA AAAATAAACA AGAAATATTG 3180 

AGTATCACTA TGTGAAGAAA GTTTTGGAAA AGAAACAATG AAGACTGAAT TAAATTAAAA 3240 

ATGTTGCAGC TCATAAAGAA TTGGGACTCA CCCCTACTGC ACTACCAAAT TCATTTGACT 3300 

TTGGAGGCAA AATGTGTTGA AGTGCCCTAT GAAGTAGCAA TTTTCTATAG GAATATAGTT 3360 

GGAAATAAAT GTGTGTGTGT ATATTATTAT TAATCAATGC AATATTTAAA ATGAAATGAG 3420 

AACAAAGAGG AAAATGGTAA AAACTTGAAA TGAGGCTGGG GTATAGTTTG TCCTACAATA 3480 

GAAAAAAGAG AGAGCTTCCT AGGCCTGGGC TCTTAAATGC TGCATTATAA CTGAGTCTAT 3540 

GAGGAAATAG TTCCTGTCCA ATTTGTGTAA TTTGTTTAAA ATTGTAAATA AATTAAACTT 3600 

TTCTGGTTTC TGTGGGAAGG AAATAGGGAA TCCAATGGAA CAGTAGCTTT GCTTTGCAGT 3660 

CTGTTTCAAG ATTTCTGCAT CCACAAGTTA GTAGCAAACT GGGGAATACT CGCTGCAGCT 3720 

GGGGTTCCCT GCTTTTTGGT AGCAAGGGTC CAGAGATGAG GTGTTTTTTT CGGGGAGCTA 3780 

ATAACAAAAA CATTTTAAAA CTTACCTTTA CTGAAGTTAA ATCCTCTATT GCTGTTTCTA 3840 

TTCTCTCTTA TAGTGACCAA CATCTTTTTA ATTTAGATCC AAATAACCAT GTCCTCCTAG 3900 

AGTTTAGAGG CTAGAGGGAG CTGAGGGGAG GATCTTACTG AAAGCACCCT GGGGAGATTG 3960 

ATTGTCCTTA AACCTAAGCC CCACAAACTT GACACCTGAT CAGGTCTGGG AGCTACAAAA 4020 

TTTCATTTTT CTCCTCACTG CCCTTCTTCT GAGTGGCATT GGCCTGAATC AAGGAAAGCC 4080 

AGGCCTTGTG GGCCCCCTTC TTTCGGCTTT CTGCTAAAGC AACACCTCCA GCAGAGATTC 4140 

CCTTAAGTGA CTCCAGGTTT TCCACCATCC TTCAGCGTGA ATTAATTTTT AATCAGTTTG 4200 

CTTTCTCCAG AGAAATTTTA AAATAATAGA AGAAATAGAA ATTTTGAATG TATAAAAGAA 4260 

AAAGATCAAG TTGTCATTTT AGAACAGAGG GAACTTTGGG AGAAAGCAGC CCAAGTAGGT 4320 

TATTTGTACA GTCAGAGGGC AACAGGAAGA TGCAGGCCTT CAAGGGCAAG GAGAGGCCAC 4380 

AAGGAATATG GGTGGGAGTA AAAGCAACAT CGTCTGCTTC ATACTTTTTC CTAGGCTTGG 4440 
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CACTGCCTTT TCCTTTCTCA GGCCAATGGC AACTGCCATT TGAGTCCGGT GAGGGATCAG 4500 

CCAACCTCTT CTCTATGGCT CACCTTATTT GGAGTGAGAA ATCAAGGAGA CAGAGCTGAC 4560 

TGCATGATGA GTCTGAAGGC ATTTGCAGGA TGAGCCTGAA CTGGTTGTGC AGAACAAACA 4620 

AGGCATTCAT GGGAATTGTT GTATTCCTTC TGCAGCCCTC CTTCTGGGCA CTAAGAAGGT 4680 

CTATGAATTA AATGCCTATC TAAAATTCTG AT TTATTC CT ACATTTTCTG TTTTCTAATT 4740 

TGACCCTAAA ATCTATGTGT TTTAGACTTA GACTTTTTAT TGCCCCCCCC CCCTTTTTTT 4800 

TTGAGACGGA GTCTCGCTCT GACGCACAGG CTGGAGTGCA GTGGCTCCGA TCTCTGCTCA 4860 

CTGAAAGCTC CGCCTCCCGG GTTCATGCCA TTCTCCTGCC TCAGCCTCCT GAGTAGCTGG 4920 

GACTAGAGGC GCCCACCACC ACGCCCGGCT AATTTTTTGT ATTTTTAATA GAGACGGGGT 4980 

TTCACTGTGT TAGCCAGGAT GGTCTCGATC TCCTGACCTC GTGATCCGCC TGCCTCGGCC 5040 

TCCCAAAGTG CTGGGATTAC AGGCATGACC CACCGCTCCC GGCCTTGTTT TCCGTTTAAA 5100 

GTCGTCTTCT TTTAATGTAA TCATTTTGAA CATGTGTGAA AGTTGATCAT ACGAATTGGA 5160 

TCAATCTTGA AATACTCAAC CAAAAGACAG TOGAGAAGCC AGGGGGAGAA AGAACTCAGG 5220 

GCACAAAATA TTGGTCTGAG AATGGAATTC TCTGTAAGCC TAGTTGCTGA AATTTCCTGC 5280 

TGTAACCAGA AGCCAGTTTT ATCTAACGGC TACTGAAACA CCCACTGTGT TTTGCTCACT 5340 

CCCACTCACC GATCAAAACC TGCTACCTCC CCAAGACTTT ACTAGTGCCG ATAAACTTTC 5400 

TCAAAGAGCA ACCAGTATCA CTTCCCTGTT TATAAAACCT CTAACCATCT CTTTGTTCTT 5460 

TGAACATGCT GAAAACCACC TGGTCTGCAT GTATGCCCGA ATTTGTAATT CTTTTCTCTC 5520 

AAATGAAAAT TTAATTTTAG GGATTCATTT CTATATTTTC ACATATGTAG TATTATTATT 5580 

TCCTTATATG TGTAAGGTGA AATTTATGGT ATTTGAGTGT GCAAGAAAAT ATATTTTTAA 5640 

AGCTTTCATT TTTCCCCCAG TGAATGATTT AGAATTTTTT ATGTAAATAT ACAGAATGTT 5700 

TTTTCTTACT TTTATAAGGA AGCAGCTGTC TAAAATGCAG TGGGGTTTGT TTTGCAATGT 5760 

TTTAAACAGA GTTTTAGTAT TGCTATTAAA AGAAGTTACT TTGCTTTTAA AGAAACTTGG 5820 

CTGCTTAAAA TAAGCAAAAA TTGGATGCAT AAAGTAATAT TTACAGATGT GGGGAGATGT 5880 

AATAAAACAA TATTAACTTG GCTGCTTAAA ATAAGCAAAA ATTGGATGCA TAAAGTAATA 5940 

TTTACAGATG TGGGGAGATG TAATAAAACA ATATTAACTT GGTTTCTTGT TTTTGCTGTA 6000 

TTTAGAGATT . AAATAATTCT AAGATGATCA CTTTGCAAAA TTATGCTTAT GGCTGGCATG 6060 

GAAATAGAAA TACTCAATTA TGTCTTTGTT GTATTAATGG GGAATATTTT GGACAATGTT 6120 

TCATTATCAA ATTGTCGACA TCATTAATAT ATATTGTAAT GTTGGGAAGA GATCACTATT 6180 

TTGAAGCACA GCTTTACAGA TGAGTATCTA TGATACATAT GTATAATAAA TTTTGATCGG 6240 

GTATTAAAAG TATTAGAAGG TGGTTATAAT TGCAGAGTAT TCCATGAATA GTACACTGAC 6300 

ACAGGGGTTT TACTTTGAGG ACCAGTGTAG TCAAGGGAAA ACATGAGTTA AAAAGAAAAG 6360 

CAGGCAATAT TGCAGTCTTG ATTCTGCCAC TTACAGGATA GATAATGCCT GAACTTTAAT 6420 

GACAAGATGA TCCAACCATA AAGGTGCTCT GTGCTTCACA GTGAATCTTT TCCCCATGCA 6480 

GGAGTGTGCT CCCCTACAAA CGTTAAGACT GATCATTTCA AAAATCTATT AGCTATATCA 6540 

AAAGCCTTAC ATTTTAATAT AGGTTGAACC AAAATTTCAA TTCCAGTAAC TTCTATTGTA 6600 

ACCATTATTT TTGTGTATGT CTTCAAGAAT GTTCATTGGA TTTTTGTTTG TAATAGTAAA 6660 

ATACCGGATA CATTTCACGT GTCCTTCAGT ATTGATTTGG TTGAATATTG GGTCATAATG 6720 

GTTGAGAAGC ATGGACACTA GAGCCAGAAT GCTTGGATAT GAATCCTGGA TCTGTCACTT 6780 

ACTTCTGTGT GACCTTTGAA AGGCTACTTA TTTCCTCTCT TAGCTTTCTC ATTAAAATCA 6840 

ATGAACAATG CCAGCCTCAT GGGGTTGTTG AATGATTAAA TTAGTTAATA TACCTAAAGT 6900 

ACATAGAACA CTGCCTGCAC ATAGTAAAAG AATTATAAGT GTGAGGTAGT TGGTAAAATT 6960 

ATGTAGTTGG ATATACTACC GAACAATATC TAATCTCTTT TTAGGGAAAT AAAGTTTGTG 7020 
CATATATATA ATCCCGAAAC ATG 

Seq ID NO: 527 Protein sequence 
Protein Accession #: NP_077741.1 

1 11 21 31 41 51 

I I I I I I 

MAAAGPRRSV RGAVCLHLLL TLVIFSRDGE ACKKVILNVP SKLEADKIIG RVNLEECFRS 60 

ADLIRSSDPD FRVLNDGSVY TARAVALSDK KRSPTIWLSD KRKQTQKEVT VLLEHQKKVS 120 

KTRHTRETVL RRAKRRWAPI PCSMQENSLG PFPLFLQQVE SDAAQNYTVF YS I SGRGVDK 180 

EPLNLFYIER DTGNLFCTRP VDREEYDVFD LIAYASTADG YSADLPLPLP IRVEDENDNH 240 

PVFTEAIYNF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLFSVHPS 300 

TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CIITVTDSND NAPTFRQNAY 360 

EAFVEENAFN VEILRIPIED KDLINTANWR VNFTILKGNE NGHFKISTDK ETNEGVLSW 420 

KPLNYEENRQ VNLEIGVNNE APFARDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 480 

KENLAVGSKI NGYKAYDPEN RNGNGLRYKK LHDPKGWITI DEISGS1ITS KILDREVETP 540 

KNELYNITVL AIDKDDRSCT GTLAVNIEDV NDNPPEILQE YWICKPKMG YTDILAVDPD 600 

EPVHGAPFYF SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 720 

KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKNGGQETIE 780 
MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EESIRGHTG 

Seq ID NO: 528 DNA sequence 

Nucleic Acid Accession #: NMJJ01941.2 

Coding sequence: 64.-2754 

1 11 21 31 41 • 51 

I I I I I I 

GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 

CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 120 

CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 

CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 

TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGTCAGTG 300 

TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 360 

GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 

TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 480 

ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 540 

GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 

AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 

CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 720 

GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 

CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 

ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 900 

CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 
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AGCACAGGCG TAATCACCAC AGTCTCTCAT 
TCATTGATAA TGAAAGTACA AGACATGGAT 
ACTTGTATCA TAACAGTAAC AGATTCAAAT 
TATGAAGCAT TTGTAGAGGA AAATGCATTC 
GATAAGGATT TAATTAACAC TGCCAATTGG 
GAAAATGGAC ATTTCAAAAT CAGCACAGAC 
GTAAAGCCAC TGAATTATGA AGAAAACCGT 
GAAGCGCCAT TTGCTAGAGA TATTCCCAGA 
GTTCATGTGA GGGATCTGGA TGAGGGGCCT 
ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG 
AATAGAAATG GCAATGGTTT AAGGTACAAA 
ATTGATGAAA TTTCAGGGTC AATCATAACT 
CCCAAAAATG AGTTGTATAA TATTACAGTC 
ACTGGAACAC TTGCTGTGAA CATTGAAGAT 
GAATATGTAG TCATTTGCAA ACCAAAAATG 
GATGAACCTG TCCATGGAGC TCCATTTTAT 
AGTAGACTGT GGAGCCTCAC CAAAGTTAAT 
AATGCTGGAT TTCAAGAATA TACCATTCCT 
GCAACAAAAT TATTGAGAGT TAATCTGTGT 
ACTTCAAGGA GTACAGGAGT AATACTTGGA 
ATAGCACTGC TCTTTTCTGT ATTGCTAACT 
GGGAAACGTT TTCCTGAAGA TTTAGCACAG 
CCTGGAGACG ATAGAGTGTG CTCTGCCAAT 
AGCCAAGGTT TTTGTGGTAC TATGGGATCA 
GAAATGATGA AAGGAGGAAA CCAGACCTTG 
ACCCTGGACT CCTGCAGGGG AGGACACACG 
GAGTGGCACA GTTTTACTCA ACCCCGTCTC 
GAAGACCGCA TGCCATCCCA AGATTATGTC 
CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA 
AATAATTTGG AACCCAAATT TATTACATTA 
AGTGCTACAA TTAGGTCTTT GTCAGACATT 
TTCAATTTCA ACATGTATGT ATATGATGAT 
CCAATTTATA TTTTTAAAGC CAGTTGTTGC 
AACAGACAAC TGGTAAATCT CAAACTCCAG 
TCTTTTTTTT TTTTACGGAT ATTTTAGTAA 
ATAGCTAAGT TATGCTAATA TCACATTATT 
ATAAACAAGA AATATTGAGT ATCACTATGT 
ACTGAATTAA ATTAAAAATG TTGCAGCTCA 
ACCAAATTCA TTTGACTTTG GAGGCAAAAT 
TCTATAGGAA TATAGTTGGA AATAAATGTG 
ATTTAAAATG AAATGAGAAC AAAGAGGAAA 
TAGTTTGTCC TACAATAGAA AAAAGAGAGA 
ATTATAACTG AGTCTATGAG GAAATAGTTC 
GTAAATAAAT TAAACTTTTC TGGTTTCTGT 
TAGCTTTGCT TTGCAGTCTG TTTCAAGATT 
GAATACTCGC TGCAGCTGGG GTTCCCTGCT 
TTTTTTTCGG GGAGCTAATA ACAAAAACAT 
CTCTATTGCT GTTTCTATTC TCTCTTATAG 
TAACCATGTC CTCCTAGAGT TTAGAGGCTA 
GCACCCTGGG GAGA TTG ATT GTCCTTAAAC 
GTCTGGGAGC TACAAAATTT CATTTTTCTC 
CTGAATCAAG GAAAGCCAGG CCTTGTGGGC 
ACCTCCAGCA GAGATTCCCT TAAGTGACTC 
AATTTTTAAT CAGTTTGCTT TCTCCAGAGA 
TTGAATGTAT AAAAGAAAAA GATCAAGTTG 
AAGCAGCCCA AGTAGGTTAT TTGTACAGTC 
GGGCAAGGAG AGGCCACAAG GAATATGGGT 
CTTTTTCCTA GGCTTGGCAC TGCCTTTTCC 
GTCCGGTGAG GGATCAGCCA ACCTCTTCTC 
AAGGAGACAG AGCTGACTGC ATGATGAGTC 
GTTGTGCAGA ACAAACAAGG CATTCATGGG 
CTGGGCACTA AGAAGGTCTA TGAATTAAAT 
TTTTCTGTTT TCTAATTTGA CCCTAAAATC 
CCCCCCCCCC TTTTTTTTTG AGACGGAGTC 
GCTCCGATCT CTGCTCACTG AAAGCTCCGC 
GCCTCCTGAG TAGCTGGGAC TACAGGCGCC 
TTTAATAGAG ACGGGGTTTC ACTGTGTTAG 
ATCCGCCTGC CTCGGCCTCC CAAAGTGCTG 
CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT 
TGATCATACG AATTGGATCA ATCTTGAAAT 
GGGAGAAAGA ACTCAGGGCA CAAAATATTG 
TTGCTGAAAT TTCCTGCTGT AACCAGAAGC 
ACTGTGTTTT GCTCACTCCC TCACTCACCG 
CTAGTGCCGA TAAACTTTCT CAAAGAGCAA 
TAACCATCTC TTTGTTCTTT GAACATGCTG 
TTTGTAATTC TTTTCTCTCA AATGAAAATT 
CATATGTAGT ATTATTATTT CCTTATATGT 
CAAGAAAATA TATTTTTAAA GCTTTCATTT 
TGTAAATATA CAGAATGTTT TTTCTTACTT 
GGGGTTTGTT TTGCAATGTT TTAAACAGAG 
TGCTTTTAAA GAAACTTGGC TGCTTAAAAT 
TACAGATGTG GGGAGATGTA ATAAAACAAT 
TAGAGATTAA ATAATTCTAA GATGATCACT 
AATAGAAATA CTCAATTATG TCTTTGTTGT 
ATTATCAAAT TGTCGACATC ATTAATATAT 
GAAGCACAGC TTTACAGATG AGTATCTATG 
ATTAAAAGTA TTAGAAGGTG GTTATAATTG 



TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 

GGCCAGTTTT TTGGATTGAT AGGCACATCA 10 BO 

GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 

AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 

CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 

GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 

GAATGCACTC CTGCAGCCCA ATATGTGCGG IS 00 

ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 

AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 

TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 

GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 

TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 

GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 

GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 

AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 

CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 

GGATTTATGA CCCAAACTAC CAACAACTCT 2340 

GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 

GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 

GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GGTGAAAAAT . TGCATCGATG TAATCAGAAT 2580 

CTCACTTATA ACTATGAGGG AAGAGGATCT 2640 

AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2700 

GCAGAAGCAT GCACAAAGAG ATAATGTCAC 2760 

CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2820 

TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2880 

TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2940 

CACTGGAATT AAGGTCTCTA AAGCATCTGC 3000 

TAAATATGCT GGATAAATAT TAGTCCAACA 3060 

ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120 

GAAGAAAGTT TTGGAAAAGA AACAATGAAG 3180 

TAAAGAATTG GGACTCACCC CTACTGCACT 3240 

GTGTTGAAGT GCCCTATGAA GTAGCAATTT 3300 

TGTGTGTATA TTATTATTAA TCAATGCAAT 3360 

ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 3420 

GCTTCCTAGG CCTGGGCTCT TAAATGCTGC 3480 

CTGTCCAATT TGTGTAATTT GTTTAAAATT 3540 

GGGAAGGAAA TAGGGAATCC AATGGAACAG 3600 

TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3660 

TTTTGGTAGC AAGGGTCCAG AGATGAGGTG 3720 

TTTAAAACTT ACCTTTACTG AAGTTAAATC 3780 

TGACCAACAT CTTTTTAATT TAGATCCAAA 3840 

GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3900 

CTAAGCCCCA CAAACTTGAC ACCTGATCAG 3960 

CTCACTGCCC TTCTTCTGAG TGGCATTGGC 4020 

CCCCTTCTTT CGGCTTTCTG CTAAAGCAAC 4080 

CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 

AATTTTAAAA TAATAGAAGA AATAGAAATT 4200 

TCATTTTAGA ACAGAGGGAA CTTTGGGAGA 4260 

AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 4320 

GGGAGTAAAA GCAACATCGT CTGCTTCATA 4380 

TTTCTCAGGC CAATGGCAAC TGCCATTTGA 4440 

TATGGCTCAC CTTATTTGGA GTGAGAAATC 4500 

TGAAGGCATT TGCAGGATGA GCCTGAACTG 4560 

AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 4620 

GCCTATCTAA AATTCTGATT TATTCCTACA 4680 

TATGTGTTTT AGACTTAGAC TTTTTATTGC 4740 

TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 4800 

CTCCCGGGTT CATGCCATTC TCCTGCCTCA 4860 

CACCACCACG CCCGGCTAAT TTTTTGTATT 4920 

CCAGGATGGT CTCGATCTCC TGACCTCGTG 4980 

GGATTACAGG CATGACCCAC CGCTCCCGGC 5040 

AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 

ACTCAACCAA AAGACAGTCG AGAAGCCAGG 5160 

GTCTGAGAAT GGAATTCTCT GTAAGCCTAG 5220 

CAGTTTTATC TAACGGCTAC TGAAACACCC 5280 

ATCAAAACCT GCTACCTCCC CAAGACTTTA 5340 

CCAGTATCAC TTCCCTGTTT ATAAAACCTC 5400 

AAAACCACCT GGTCTGCATG TATGCCCGAA 5460 

TAATTTTAGG GATTCATTTC TATATTTTCA 5520 

GTAAGGTGAA ATTTATGGTA TTTGAGTGTG 5580 

TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 

TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 

TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 

AAGCAAAAAT TGGATGCATA AAGTAATATT 5820 

ATTAACTTGG TTTCTTGTTT TTGCTGTATT 5880 

TTGCAAAATT ATGCTTATGG CTGGCATGGA S940 

ATTAATGGGG AATATTTTGG ACAATGTTTC 6000 

ATTGTAATGT TGGGAAGAGA TCACTATTTT 6060 

ATACATATGT ATAATAAATT TTGATCGGGT 6120 

CAGAGTATTC CATGAATAGT ACACTGACAC 6180 
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AGGGGTTTTA CTTTGAGGAC CAGTGTAGTC AAGGGAAAAC ATGAGTTAAA 
GGCAATATTG CAGTCTTGAT TCTGCCACTT ACAGGATAGA TAATGCCTGA 
CAAGATGATC CAACCATAAA GGTGCTCTGT GCTTCACAGT GAATCTTTTC 
AGTGTGCTCC CCTACAAACG TTAAGACTGA TCATTTCAAA AATCTATTAG 
AGCCTTACAT TTTAATATAG GTTGAACCAA AATTTCAATT CCAGTAACTT 
CATTATTTTT GTGTATGTCT TCAAGAATGT TCATTGGATT TTTGTTTGTA 
ACCGGATACA TTTCACGTGT CCTTCAGTAT TGATTTGGTT GAATATTGGG 
TGAGAAGCAT GGACACTAGA GCCAGAATGC TTGGATATGA ATCCTGGATC 
TTCTGTGTGA CCTTTGAAAG GCTACTTATT TCCTCTCTTA GCTTTCTCAT 
GAACAATGCC AGCCTCATGG GGTTGTTGAA TGATTAAATT AGTTAATATA 
ATAGAACACT GCCTGCACAT AGTAAAAGAA TTATAAGTGT GAGGTAGTTG 
GTAGTTGGAT ATACTACCGA ACAATATCTA ATCTCTTTTT AGGGAAATAA 
TATATATAAT CCCGAAACAT G 

Seq ID NO: 529 Protein sequence 
Protein Accession 8: NP 001932.1 



MAAAGPRRSV 
ADLIRSSDPD 
KTRHTRETVL 
EPLNLPYIER 
PVFTEAIYNF 
TGVITTVSHY 
EAFVEENAFN 
KPLNYEENRQ 
KENLAVGSKI 
KNELYNITVL 
EPVHGAPFYF 
TKLLRVNLCE 
KRFPEDLAQQ 
MMKGGNOTLE 
DRMPSQDWU 



11 

I 

RGAVCLHLLL 
FRVLNDGSVY 
RRAKRRWAPI 
DTGNLFCTRP 
EVLESSRPGT 
LDREWDKYS 
VEILRIPIED 
VNLEIGVNNE 
NGYKAYDPEN 
AIDKDDRSCT 
SLPNTSPEIS 
CTHPTQCRAT 
NLIISNTEAP 
SCRGAGHHHT 
TYNYEGRGSP 



21 
I 

TLVIFSRDGE 
TARAVALSDK 
PCSMQENSLG 
VDREEYDVFD 
TVGWCATDR 
LIMKVQDMDG 
KDLINTANWR 
APFARDIPRV 
RNGNGLRYKK 
GTLAVNIEDV 
RLWSLTKVND 
SRSTGVILGK 
GDDRVCSANG 
LDSCRGGHTE 
AGSVGCCSEK 



31 
I 

ACKKVILNVP 
KRSPTIWLSD 
PFPLFLQQVE 
LIAYASTADG 
DEPDTMHTRL 
QFFGLIGTST 
VNFTILKGNE 
TALNRALVTV 
LHDPKGWITI 
NDNPPEILQE 
TAARLSYQKN 
WAILAILLGI 
FMTQTTNNSS 
VDNCRYTYSE 
QEEDGLDFLN 



41 
I 

SKLEADKIIG 
KRKQTQKEVT 
SDAAQNYTVF 
YSADLPLPLP 
KYSILQQTPR 
CIITVTDSND 
NGHFKISTDK 
HVRDLDEGPE 
DEISGSIITS 
YWICKPKMG 
AGFQEYTIPI 
ALLFSVLLTL 
QGFCGTMGSG 
WHSPTQPRLG 
NLEPKFITLA 



AAGAAAAGCA 
ACTTTAATGA 
CCCATGCAGG 
CTATATCAAA 
CTATTGTAAC 
ATAGTAAAAT 
TCATAATGGT 
TGTCACTTAC 
TAAAATCAAT 
CCTAAAGTAC 
GTAAAATTAT 
AGTTTGTGCA 



51 
I 

RVNLEECFRS 
VLLEHQKKVS 
YSISGRGVDK 
IRVEDENDNH 
SPGLFSVHPS 
NAPTFRQNAY 
ETNEGVLSW 
CTPAAQYVRI 
KILDREVETP 
YTDIIiAVDPD 
TVKDRAGQAA 
VCGVFGATKG 
MKNGGQETIE 
EKLHRCNQNE 
EACTKR 



Seq ID NO: 530 DNA ' sequence 

Nucleic Acid Accession #: NMJ>16583. 

Coding sequence : 72 . . 842 



GGAGTGGGGG 
TAAGAGCAAA 
CCATGGCCCA 
ATCCAGCCCT 
ATGGCCTGCT 
TGAAGCCTGG 
CAGTGATTCC 
AACTTGGCCT 
TAAAGCTCCA 
TGGACATCAC 
TTGGTGACTG 
CCCTCCCCAT 
AGTTGGTTCA 
CCCTGGTGCA 
AAGCCTTCCA 
GCCCATGTGC 
TCCCACCAGG 
AAAAAAAAAA 



11 
I 

AGAGAGAGGA 
GATGTTTCAA 
GTTTGGAGGC 
GCCCTTGAGT 
GTCTGGGGGC 
AGGAGGTACT 
TGGCCTGAAC 
TGTGCAGAGC 
AGTGAATACG 
TGCAGAAATC 
CACCCATTCC 
TCAAGGTCTT 
GGGCAACGTG 
TGACATTGTT 
GGAAGGGGCT 
TGGAAGATGA 
CGTGTGTAAC 
AAAAAAAAAA 



21 
I 

GACCAGGACA 
ACTGGGGGCC 
CTGCCCGTGC 
CCCACAGGTC 
CTGTTGGGCA 
TCTGGTGGCC 
AACATCATTG 
CCTGATGGCC 
CCCCTGGTCG 
TTAGCTGTGA 
CCTGGAAGCC 
CTGGACAGCC 
TGCCCTCTGG 
AACATGCTGA 
GGCCTCTGCT 
CACAGTTGCC 
ATCCCATGTG 
AAAAAAAAA 



31 
I 

GCTGCTGAGA 
TCATTGTCTT 
CCCTGGACCA 
TTGCAGGAAG 
TTCTGGAAAA 
TCCTTGGGGG 
ACATAAAGGT 
ACCGTCTCTA 
GTGCAAGTCT 
GAGATAAGCA 
TGCAAATTTC 
TCACAGGGAT 
TCAATGAGGT 
TCCACGGACT 
GAGCTGCTTC 
TTCTCTCCGA 
CCTCACCTAA 



41 
I 

CCTCTAAGAA 
CTACGGGCTG 
GACCCTGCCC 
CTTGACAAAT 
CCTTCCGCTC 
ACTGCTTGGA 
CACTGACCCC 
TGTCACCATC 
GTTGAGGCTG 
GGAGAGGATC 
TCTGCTTGAT 
CTTGAATAAA 
TCTCAGAGGC 
ACAGTTTGTC 
CCAGTGCTCA 
GGAACCTGCC 
TAAAATGGCT 



51 
I 

GTCCAGATAC 
TTAGCCCAGA 
TTGAATGTGA 
GCCCTCAGCA 
CTGGACATCC 
AAAGTGACGT 
CAGCTGCTGG 
CCTCTCGGCA 
GCTGTGAAGC 
CACCTGGTCC 
GGACTTGGCC 
GTCCTGCCTG 
TTGGACATCA 
ATCAAGGTCT 
CAGATGGCTG 
CCCTCTCCTT 
CTTCTTCTGC 



6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6*720 
6780 
6840 
6900 
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60 
120 
160 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq ID NO: 531 Protein sequence 
Protein Accession ft: NP_057667.1 

1 11 21 31 41 51 

I I I I I I 

MFQTGGLIVF YGLLAQTMAQ FGGLPVPLDQ TLPLNVNPAI, PLSPTGLAGS LTNALSNGLL 60 
SGGLLGILEN LPbLDILKPG GGTSGGLLGG LLGKVTSVIP GLNNIIDIKV TDPQLIiELGL 120 
VQSPDGHRLY VTIPLGIKLQ VNTPLVGASL LRLAVKLDIT AEILAVRDKQ ERIHLVLGDC 180 
THSPGSLQIS LLDGLGPLPI QGLLDSLTGI LNKVLPELVQ GNVCPLVNEV LRGLDITLVH 240 
DIVNMLIHGL QFVIKV 



75 



80 



85 



Seq ID NO: 532 DNA sequence 

Nucleic Acid Accession #: NM_004363.1 

Coding sequence: 115.. 2223 



I 

CTCAGGGCAG 
TCCTGGAACT 
TCTCCCTCGG 
TCACTTCTAA 
TTCAATGTCG 
TTTGGCTACA 
GTAATAGGAA 
CCCAATGCAT 



11 
I 

AGGGAGGAAG 
CAAGCTCTTC 
CCCCTCCCCA 
CCTTCTGGAA 
CAGAGGGGAA 
GCTGGTACAA 
CTCAACAAGC 
CCCTGCTGAT 



21 
I 

GACAGCAGAC 
TCCACAGAGG 
CAGATGGTGC 
CCCGCCCACC 
GGAGGTGCTT 
AGGTGAAAGA 
TACCCCAGGG 
CCAGAACATC 



31 
I 

CAGACAGTCA 
AGGACAGAGC 
ATCCCCTGGC 
ACTGCCAAGC 
CTACTTGTCC 
GTGGATGGCA 
CCCGCATACA 
ATCCAGAATG 



41 
I 

CAGCAGCCTT 
AGACAGCAGA 
AGAGGCTCCT 
TCACTATTGA 
ACAATCTGCC 
ACCGTCAAAT 
GTGGTCGAGA 
ACACAGGATT 



51 
I 

GACAAAACGT 
GACCATGGAG 
GCTCACAGCC 
ATCCACGCCG 
CCAGCATCTT 
TATAGGATAT 
GATAATATAC 
CTACACCCTA 



60 
120 
180 
240 
300 
360 
420 
480 



389 
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CACGTCATAA AGTCAGATCT TGTGAATGAA GAAGCAACTG GCCAGTTCCG GGTATACCCG 540 

GAGCTGCCCA AGCCCTCCAT CTCCAGCAAC AACTCCAAAC CCGTGGAGGA CAAGGATGCT 600 

GTGGCCTTCA CCTGTGAACC TGAGACTCAG GACGCAACCT ACCTGTGGTG GGTAAACAAT 660 

CAGAGCCTCC CGGTCAGTCC CAGGCTGCAG CTGTCCAATG GCAACAGGAC CCTCACTCTA 720 

TTCAATGTCA CAAGAAATGA CACAGCAAGC TACAAATGTG AAACCCAGAA CCCAGTGAGT 780 

GCCAGGCGCA GTGATTCAGT CATCCTGAAT GTCCTCTATG GCCCGGATGC CCCCACCATT 840 

TCCCCTCTAA ACACATCTTA CAGATCAGGG GAAAATCTGA ACCTCTCCTG CCACGCAGCC 900 

TCTAACCCAC CTGCACAGTA CTCTTGGTTT GTCAATGGGA CTTTCCAGCA ATCCACCCAA 960 

GAGCTCTTTA TCCCCAACAT CACTGTGAAT AATAGTGGAT CCTATACGTG CCAAGCCCAT 1020 

AACTCAGACA CTGGCCTCAA TAGGACCACA GTCACGACGA TCACAGTCTA TGCAGAGCCA 1080 

CCCAAACCCT TCATCACCAG CAACAACTCC AACCCCGTGG AGGATGAGGA TGCTGTAGCC 1140 

TTAACCTGTG AACCTGAGAT TCAGAACACA ACCTACCTGT GGTGGGTAAA TAATCAGAGC 1200 

CTCCCGGTCA GTCCCAGGCT GCAGCTGTCC AATGACAACA GGACCCTCAC TCTACTCAGT 1260 

GTCACAAGGA ATGATGTAGG ACCCTATGAG TGTGGAATCC AGAACGAATT AAGTGTTGAC 1320 

CACAGCGACC CAGTCATCCT GAATGTCCTC TATGGCCCAG ACGACCCCAC CATTTCCCCC 1380 

TCATACACCT ATTACCGTCC AGGGGTGAAC CTCAGCCTCT CCTGCCATGC AGCCTCTAAC 1440 

CCACCTGCAC AGTATTCTTG GCTGATTGAT GGGAACATCC AGCAACACAC ACAAGAGCTC 1500 

TTTATCTCCA ACATCACTGA GAAGAACAGC GGACTCTATA CCTGCCAGGC CAATAACTCA 1560 

GCCAGTGGCC ACAGCAGGAC TACAGTCAAG ACAATCACAG TCTCTGCGGA GCTGCCCAAG 1620 

CCCTCCATCT CCAGCAACAA CTCCAAACCC GTGGAGGACA AGGATGCTGT GGCCTTCACC 1680 

TGTGAACCTG AGGCTCAGAA CACAACCTAC CTGTGGTGGG TAAATGGTCA GAGCCTCCCA 1740 

GTCAGTCCCA GGCTGCAGCT GTCCAATGGC AACAGGACCC TCACTCTATT CAATGTCACA 1800 

AG AAA TGACG CAAGAGCCTA TGTATGTGGA ATCCAGAACT CAGTGAGTGC AAACCGCAGT 1860 

GACCCAGTCA CCCTGGATGT CCTCTATGGG CCGGACACCC CCATCATTTC CCCCCCAGAC 1920 

TCGTCTTACC TTTCGGGAGC GAACCTCAAC CTCTCCTGCC ACTCGGCCTC TAACCCATCC 1980 

CCGCAGTATT CTTGGCGTAT CAATGGGATA CCGCAGCAAC ACACACAAGT TCTCTTTATC 2040 

GCCAAAATCA CGCCAAATAA TAACGGGACC TATGCCTGTT TTGTCTCTAA CTTGGCTACT 2100 

GGCCGCAATA ATTCCATAGT CAAGAGCATC ACAGTCTCTG CATCTGGAAC TTCTCCTGGT 2160 

CTCTCAGCTG GGGCCACTGT CGGCATCATG ATTGGAGTGC TGGTTGGGGT TGCTCTGATA 2220 

TAGCAGCCCT GGTGTAGTTT CTTCATTTCA GGAAGACTGA CAGTTGTTTT GCTTCTTCCT 2280 

TAAAGCATTT GCAACAGCTA CAGTCTAAAA TTGCTTCTTT ACCAAGGATA TTTACAGAAA 2340 

AGACTCTGAC CAGAGATCGA GACCATCCTA GCCAACATCG TGAAACCCCA TCTCTACTAA 2400 

AAATACAAAA ATGAGCTGGG CTTGGTGGCG CGCACCTGTA GTCCCAGTTA CTCGGGAGGC 2460 

TGAGGCAGGA GAATCGCTTG AACCCGGGAG GTGGAGATTG CAGTGAGCCC AGATCGCACC 2520 

ACTGCACTCC AGTCTGGCAA CAGAGCAAGA CTCCATCTCA AAAAGAAAAG AAAAGAAGAC 2580 

TCTGACCTGT ACTCTTGAAT ACAAGTTTCT GATACCACTG CACTGTCTGA GAATTTCCAA 2640 

AACTTTAATG AACTAACTGA CAGCTTCATG AAACTGTCCA CCAAGATCAA GCAGAGAAAA 2700 

TAATTAATTT CATGGGACTA AATGAACTAA TGAGGATTGC TGATTCTTTA AATGTCTTGT 2760 

TTCCCAGATT TCAGGAAACT TTTTTTCTTT TAAGCTATCC ACTCTTACAG CAATTTGATA 2820 

AAATATACTT TTGTGAACAA AAATTGAGAC ATTTACATTT TCTCCCTATG TGGTCGCTCC 2880 

AGACTTGGGA AACTATTCAT GAATATTTAT ATTGTATGGT AATATAGTTA TTGCACAAGT 2940 
TCAATAAAAA TCTGCTCTTT GTATAACAGA AAAA 

Seq ID NO: 533 Protein sequence 
Protein AcceBBion #: NP_004354.1 

1 11 21 31 41 51 

I I I I I I 

MESPSAPPHR WCIPWQRLLL TASLLTFWNP PTTAKLTIES TPFNVAEGKE VLLLVHNLPQ 60 

HLFGYSWYKG ERVDGNRQII GYVIGTQQAT PGPAYSGREI IYPNASLLIQ NIIQNDTGFY 120 

TLHVIKSDLV NEEATGQFRV YPELPKPSIS SNNSKPVEDK DAVAFTCEPE TQDATYLWWV 180 

NNQSLPVSPR LQLSNGNRTL TLFNVTRNDT ASYKCETQNP VSARRSDSVI LNVLYGPDAP 240 

TISPLNTSYR SGENLNLSCH AASNPPAQYS WFVNGTFQQS TQELFIPNIT VNNSGSYTCQ 300 

AHNSDTGLNR TTVTTITVYA EPPKPF1TSN NSNPVEDEDA VALTCEPEIQ NTTYLWWVNN 360 

QSLPVSPRLQ LSNDNRTLTl* LSVTRNDVGP YECGIQNELS VDHSDPVILN VLYGPDDPTI 420 

SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL IDGNIQQHTQ ELFISNITEK NSGLYTCQAN 480 

NSASGHSRTT VKTITVSAEL PKPSISSNNS KPVEDKDAVA FTCEPEAQNT TYLWWVNGQS 540 

LPVSPRLQLS NGNRTLTLFN VTRNDARAYV CGIQNSVSAN RSDPVTLDVL YGPDTPIISP 600 

PDSSYLSGAN LNLSCHSASN PSPQYSWRIN GIPQQHTQVL FIAKITPNNN GTYACFVSNL 660 
ATGRNNSIVK SITVSASGTS PGLSAGATVG IMIGVLVGVA LI 

Seq ID NO: 534 DNA sequence 

Nucleic Acid Accession #: NM_006952.1 

Coding sequence: 11.. 793 

1 11 21 31 41 51 

I I I I I I 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TGCTGATTTT 60 

TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA TCTTCTTTGT 120 

ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG ACATCTATGG 180 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC TGATGTTTAT 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 360 

ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 420 

TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 540 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCCT GGTTTTTATC ACAATCAGGG 660 

CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACGCC TGGGGGGTTG CCTGGTTTGG 720 

ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 
AATTGAATAT TAAGAA 

Seq ID NO: 535 Protein sequence 
Protein Accession #: NP_008883.1 

1 11 21 

I I I 



31 41 51 

I I I 
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MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT DNDDIYGAAW 
IGIFVGICLP CLSVUGIVGI MKSSRKILLA YFILMFIVYA FEVASCITAA TQRDFFTPNL 
FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTH DRLMLQDKCC GVNGPSDWQK YTSAFRTENN 
DADYPWPRQC CVMNNLKBPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA WGVAWFGFAI 
LCWTFWVLLG TMFYWSRIEY 

Seq ID HO: 536 DNA sequence 

Nucleic Acid Accession 8: NM_002638.1 

Coding sequence: 120.. 473 
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60 
120 
180 
240 



CAATACAGCT 
GCTGGACTGC 
TGAGGGCCAG 
AGGCAGCTGT 
TCAATGGACA 
CGCAAGAGCC 
TCCGGTGCGC 
TCAAGAAGTG 
CGGTCCTTGC 
TGCTGCCCTT 
GAGCTGCCTC 



11 
I 

AAGGAATTAT 
ATAAAGATTG 
CAGCTTCTTG 
CACGGGAGTT 
AGATCCCGTT 
AGTCAAAGGT 
CATGTTGAAT 
CTGTGAAGGC 
TGCACCTGTG 
CCCCTTCCCA 
TCTCATCCAC 



21 
I 

CCCTTGTAAA 
GTATGGCCTT 
ATCGTGGTGG 
CCTGTTAAAG 
AAAGGACAAG 
CCAGTCTCCA 
CCCCCTAACC 
TCTTGCGGGA 
CCGTCCCCAG 
CACTGTCCAT 
TTTCCAATAA 



31 
I 

TACCACAGAC 
AGCTCTTAGC 
TGTTCCTCAT 
GTCAAGACAC 
TTTCAGTTAA 
CTAAGCCTGG 
GCTGCTTGAA 
TGGCCTGTTT 
AGCTACAGGC 
TCTTCCTCCC 
A 



41 
I 

CCGCCCTGGA 
CAAACACCTT 
CGCTGGGACG 
TGTCAAAGGC 
AGGTCAAGAT 
CTCCTGCCCC 
AGATACTGAC 
CGTTCCCCAG 
CCCATCTGGT 
ATTCAGGATG 



51 

I 

GCCAGGCCAA 
CCTGACACCA 
CTGGTTCTAG 
CGTGTTCCAT 
AAAGTCAAAG 
ATTATCTTGA 
TGCCCAGGAA 
TGAAGGGAGC 
CCTAAGTCCC 
CCCACGGCTG 



Seq ID NOt 537 Protein sequence 
Protein Accession #: NP_002629.1 

21 



51 



1 11 21 31 41 

I I I I .1 I 

MRASSFLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVX 
AQEPVKGPVS TKPGSCPIIL IRCAMLNPPN RCLKDTDCPG IKKCCEGSCG MACPVPQ 

Seq ID NO: 538 DNA sequence 

Nucleic Acid Accession 8: NM_001793.2 

Coding sequence: 71.. 2560 



AAAGGGGCAA 
CTCTGCAGCC 
CTGGCTGCAG 
CTTGGAGGCG 
CTGCCCTGGG 
TGGCGAGACA 
ATCCAAACGT 
TGAAAATGGC 
AGACACCAAG 
CTTCGCTGTA 
GATTGCCAAG 
CCCCATGAAC 
GGACACCTTC 
GACAGCCACG 
CCATAGCCAA 
CACCATCAGC 
CATCCAGGCC 
GATCCTTGAT 
GCCTGAGAAT 
CAACTCACCA 
TACCATCACC 
TTTTGAGGCC 
GCTGAAGCTC 
ACCTGTGTTT 
GCCTGTGTGT 
CATCCTGAGA 
TGTGGGCACC 
GGTCTTGGCC 
ACTGATTGAT 
CCAAAGCCCT 
CCCTTTCCAG 
GGAAGGTGAC 
GCACCTTTCT 
GTGCGACTGC 
CCCTGTGCTG 
GAGAAAGAAG 
CGTCTTCTAC 
GCTCCACCGA 
CATCATCCCG 
TATAATTGAG 
CTTGGTGTTC 
CTCCGCCTCC 
GAAGCTGGCA 
GGGACCAAAC 
GACTTCGGAG 
ACGTTAGAGT 
AGCACTGAAA 
TCTTACCTGC 
TACAGTGGAC 



11 
I 

GAGCTGAGCG 
ATGGGGCTCC 
TGCGCGGCCT 
GGAGGCGCGG 
CAAGAGCCAG 
GTCCAGGAAA 
ATCTTACGAA 
AAGGGTCCCT 
ATTTTCTACA 
GAGAAGGAGA 
TATGAGCTCT 
ATCTCCATCA 
CGAGGGAGTG 
GATGAGGATG 
GAACCAAAGG 
GTCATCTCCA 
ACAGACATGG 
GCCAATGACA 
GCAGTGGGCC 
GCGTGGCGTG 
ACCCACCCTG 
AAAAACCAGC 
CCAACCTCCA 
GTCCCACCCT 
GTCTACACTG 
GACCCAGCAG 
CTCGACCGTG 
ATGGACAATG 
GTCAATGACC 
GTGCGCCAGG 
GCCCAGCTCA 
ACAGTGGTCT 
CTGTCTGACC 
CATGGCCATG 
GGGGCTGTCC 
CGGAAGATCA 
TATGGCGAAG 
GGTCTGGAGG 
ACACCCATGT 
AACCTGAAGG 
GACTATGAGG 
GACCAAGACC 
GACATGTACG 
GTCAGGCCAC 
CTTGTCAGGA 
GGTTGCTTCC 
ACCTCTCCAC 
CGTAAAATGC 
TTTCTCTCTG 



21 
I 

GAACACCGGC 
CTCGTGGACC 
CCGAGCCGTG 
AGCAGGAGCC 
CTCTGTTTAG 
GAAGGTCACT 
GACACAAGAG 
TCCCCCAGAG 
GCATCACGGG 
CAGGCTGGTT 
TTGGCCACGC 
TCGTGACCGA 
TCTTAGAGGG 
ATGCCATCTA 
ACCCACACGA 
GTGGCCTGGA 
ATGGGGACGG 
ATGCTCCCAT 
ATGAGGTGCA 
CCACCTACCT 
AGAGCAACCA 
ACACCCTGTA 
CAGCCACCAT 
CCAAAGTCGT 
CAGAAGACCC 
GGTGGCTAGC 
AGGATGAGCA 
GAAGCCCTCC 
ATGGCCCAGT 
TGCTGAACAT 
CAGATGACTC 
TGTCCCTGAA 
ATGGCAACAA 
TCGAAACCTG 
TGGCTCTGCT 
AGGAGCCCCT 
AGGGGGGTGG 
CCAGGCCGGA 
ACCGTCCTCG 
CGGCTAACAC 
GCAGCGGCTC 
AAGATTACGA 
GTGGCGGGGA 
AGAGCATCTC 
AGTGGCCGTA 
TTAGCCTTTC 
CTGGGCCAGG 
TCAACCCTGT 
GAATGGAACC 



31 
I 

CCGCCGTCGC 
TCTCGCGTCT 
CCGGGCGGTC 
CGGCCAGGCG 
CACTGATAAT 
GAAGGAAAGG 
AGATTGGGTG 
ACTGAATCAG 
GCCGGGGGCA 
GTTGTTGAAT 
TGTGTCAGAG 
CCAGAATGAC 
AGTCCTACCA 
CACCTACAAT 
CCTCATGTTC 
CCGGGAAAAA 
CTCCACCACC 
GTTTGACCCC 
GAGGCTGACG 
TATCATGGGC 
GGGCATCCTG 
CGTTGAAGTG 
AGTGGTCCAC 
TGAGGTCCAG 
TGACAAGGAG 
CATGGACCCA 
GTTTGTGAGG 
CACCACTGGC 
CCCTGAGCCC 
CACGGACAAG 
AGACATCTAC 
GAAGTTCCTG 
AGAGCAGCTG 
CCCTGGACCC 
GTTCCTCCTG 
CCTACTCCCA 
CGAAGAGGAC 
GGTGGTTCTC 
GCCAGCCAAC 
AGACCCCACA 
CGACGCCGCG 
TTATCTGAAC 
GGACGACTAG 
CAAGGGGTCT 
GCAACTTGGC 
AGGATGGAGG 
GTTGCCTCAG 
GTCCTGGGCC 
TTCTTAGGCC 



41 

! 

GGCAGCTGCT 
CTCCTCCTTC 
TTCAGGGAGG 
CTGGGGAAAG 
GATGACTTCA 
AATCCATTGA 
GTTGCTCCAA 
CTCAAGTCTA 
GACAGCCCCC 
AAGCCACTGG 
AATGGTGCCT 
CACAAGCCCA 
GGTACTTCTG 
GGGGTGGTTG 
ACCATTCACC 
GTCCCTGAGT 
ACGGCAGTGG 
CAGAAGTACG 
GTCACTGATC 
GGTGACGACG 
ACAACCAGGA 
ACCAACGAGG 
GTGGAGGATG 
GAGGGCATCC 
AATCAAAAGA 
GACAGTGGGC 
AACAACATCT 
ACGGGAACCC 
CGTCAGATCA 
GACCTGTCTC 
TGGACGGCAG 
AAGCAGGATA 
ACGGTGATCA 
TGGAAGGGAG 
CTGGTGCTGC 
GAAGATGACA 
CAGGACTATG 
OGCAATGACG 
CCAGATGAAA 
GCCCCGCCCT 
TCCCTGAGCT 
GAGTGGGGCA 
GCGGCCTGCC 
CAGTTCCCCC 
GGAGACAGGC 
AATGTGGGCA 
AGGCCAAGTT 
TGGGCCTGCT 
TCCTGGTGCA 



51 
I 

TCACCCCTCT 
TCCAGGTTTG 
CTGAAGTGAC 
TATTCATGGG 
CTGTGCGGAA 
AGATCTTCCC 
TATCTGTCCC 
ATAAAGATAG 
CTGAGGGTGT 
ACCGGGAGGA 
CAGTGGAGGA 
AGTTTACCCA 
TGATGCAGGT 
CTTACTCCAT 
GGAGCACAGG 
ACACACTGAC 
CAGTAGTGGA 
AGGCCCATGT 
TGGACGCCCC 
GGGACCATTT 
AGGGTTTGGA 
CCCCTTTTGT 
TGAATGAGGC 
CCACTGGGGA 
TCAGCTACCG 
AGGTCACAGC 
ATGAAGTCAT 
TTCTGCTAAC 
CCATCTGCAA 
CCCACACCTC 
AGGTCAACGA 
CATATGACGT 
GGGCCACTGT 
GTTTCATCCT 
TTTTGTTGGT 
CCCGTGACAA 
ACATCACCCA 
TGGCACCAAC 
TCGGCAACTT 
ACGACACCCT 
CCCTCACCTC 
GCCGCTTCAA 
TGCAGGGCTG 
TTCAGCTGAG 
TATGAGTCTG 
GTTTGACTTC 
TCCAGAAGCC 
GTGACTGACC 
ACTTAATTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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1320 
1380 
1440 
1500 
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1800 
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2400 
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2580 
2640 
2700 
2760 
2820 
2880 
2940 



391 



WO 02/086443 

TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 3000 

GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 3060 

TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 3120 

GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 
TAAAGAAACT TTTCCCAGAA AAAAA 



PCT/US02/12476 



Seq ID NO: 539 Protein sequence 
Protein Accession ft: NP 001784.2 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



MGLPRGPLAS 
QEPALFSTDN 
KGPFPQRLNQ 
YELFGHAVSE 
DEDDAIYTYN 
TDMDGDGSTT 
AWRATYLIMG 
PTSTATIWH 
DPAGWLAMDP 
VNDHGPVPEP 
TWLSLKKFL 
GAVLALLFLL 
GLEARPEWL 
DYEGSGSDAA 



11 

I 

LLLLQVCWLQ 
DDFTVRNGET 
LXSNKDRDTK 
NGASVEDPMN 
GWAYSIHSQ 
TAVAWEILD 
GDDGDHFTIT 
VEDVNEAPVF 
DSGQVTAVGT 
RQITICNQSP 
KQDTYDVHLS 
LVLLLLVRKK 
RNDVAPTIIP 
SLSSLTSSAS 



21 
I 

CAASEPCRAV 
VQERRSLKER 
IFYSITGPGA 
ISIIVTDQND 
EPKDPHDLMF 
ANDNAPMFDP 
THPESNQGIL 
VPPSKWEVQ 
LDREDEQFVR 
VRQVUJITDK 
LSDHGNKEQL 
RKIKEPLLLP 
TPMYRPRPAN 
DQDQDYDYLN 



31 
I 

FREAEVTLEA 
NPLKIFPSKR 
DSPPEGVFAV 
HKPKFTQDTF 
TIHRSTGTIS 
QKYEAHVPEN 
TTRKGLDFEA 
EGIPTGEPVC 
NNIYEVMVLA 
DLSPHTSPFQ 
TVIRATVCDC 
EDDTRDNVFY 
PDEIGNFIIE 
EWGSRFKKLA 



41 
I 

GGAEQEPGQA 
ILRRHKRDWV 
EKETGWl.ir.TrN 
RGSVLEGVLP 
VISSGLDREK 
AVGHEVQRLT 
KNQHTLYVEV 
VYTAEDPDKE 
MDNGSPPTTG 
AOLTDDSDIY 
HGHVETCPGP 
YGEEGGGEED 
NLKAANTDPT 
DMYGGGEDD 



51 
I 

LGKVPMGCPG 
VAPISVPENG 
KPLDREEIAK 
GTSVMQVTAT 
VPEYTLTIQA 
VTDLDAPNSP 
TNEAPFVLKL 
NQKISYRILR 
TGTLLLTLID 
WTAEVNEEGD 
WKGGFILPVL 
QDYDITQLHR 
APPYDTLLVF 



.Seq ID NO: 540 DNA sequence . 
Nucleic Acid Accession Eos sequence 
Coding sequence: 1..672 



1 
I 

ATGAGGCTCC 
CGGGGCTCCC 
AAGGGCGGGG 
CTGCTCGCCT 
GCGAGACAAC 
TGTCATGTTT 
ACAGAGCCAT 
AAGCAGTGCT 
CTCCTGGAAG 
TTAGAGGGGC 
AGCTGTGGTG 
AGCCTGTCTT 



11 
I 

AAAGACCCCG 
CCTACGGGCC 
AGGGGGCGCC 
TGCTGCTGGT 
GAGATCCAGA 
GTGAGAGAGA 
ACTGCX3TTAT 
CCGCTGGTTG 
AGCCCATGCC 
CACCTATCAA 
GGCTGTGGCT 
GA 



21 
I 

ACAGGCCCCG 
AGACCCGGGG 
GCGCGCTGAC 
CGTGGCCCTA 
GGACTCCCAG 
AAACACTTTC 
AGCGGCCGTG 
TGCAGCGATG 
CTTCTTTTAC 
CTCATCAGTG 
GGCCATCCTC 



31 

! 

GCGGGTGGGA 
AGAGGCGCGC 
CCTCCCTGGG 
CCGCGGGTGT 
CGAACGGACG 
GAGTGCCAGA 
AAAATATTTC 
GAGAGACCCA 
CTCAAGTGTT 
TTCAAAGAAT 
CTGCTGCTGG 



41 
I 

GGCGCGCGCC 
GGAGGCTGCG 
CACCGCTGGG 
GGACAGACGC 
AGGGTGACAA 
ACCCAAGGAG 
CACGTTTTTT 
AGCCAGAGGA 
GTAAAATTCG 
ATGCTGGGAG 
CCTCCATTGC 



51 
I 

CCGGGGCGGG 
AAGGTTCCAG 
GACGATGGCG 
CAACCTGACT 
TAGAGTGTGG 
GTGCAAATGG 
CATGGTTGCG 
GAAGCGGTTT 
CTACTGCAAT 
CATGGGTGAG 
AGCCGGCCTC 



Seq ID NO: 541 Protein sequence 
Protein Accession #: Eos sequence 



11 



21 



51 



31 41 

! ! I I 1 I 

MRLQRPRQAP AGGRRAPRGG RGSPYRPDPG RGARRLRRFQ KGGEGAPRAD PPWAPLGTMA 
LLALLLWAL PRVWTDANLT ARQRDPEDSQ RTDEGDNRVW CHVCERENTF ECQNPRRCKW 
TEPYCVIAAV KIFPRFFMVA KQCSAGCAAM ERPKPEEKRF LLEEPMPFFY LKCCKIRYCN 
LEGPPINSSV FKEYAGSMGE SCGGLWLAIL LLLASIAAGL SLS 

Seq ID NO: 542 DNA sequence 

Nucleic Acid Accession #: XM_035292.2 

Coding sequence: 53.. 1576 



I 

GCTCGCTGGG 
TGCGGGCCCG 
GGAGAAGATG 
CGTGACCCTG 
TATCGGCTCG 
GCTGGCGCTG 
CGCGGAGCTC 
CTACGGCTCG 
ATCGCAGTAC 
CTGCCCGGTG 
GGCCGTGAAC 
CAAGCTCCTG 
TGTGTCCAAT 
TGTGCTGGCA 
CACAGAGGAA 
CATCGTGACG 
GCAGATGCTG 
GTCCTGGATC 
GTTCACATCC 
CTCCATGATC 
GACGCTGCTC 
CAACTGGCTC 
TGAGCTTGAG 
CCTCTTCCTG 
CATCATCCTC 
GTGGCTCCTC 



11 

I 

CCGCGGCTCC 
AAGCGGCGCG 
CTGGCCGCCA 
CAGCGGAACA 
GGCATCTTCG 

GGCACCACCA 
CTGCCCGCCT 
ATCGTGGCCC 
CCCGAGGAGG 
TGCTACAGCG 
GCCCTGGCCC 
CTAGATCCCA 
TTATACAGCG 
ATGATCAACC 
CTGGTGTACG 
TCGTCCGAGG 
ATCCCCGTCT 
TCCAGGCTCT 
CACCCACAGC 
TACGCCTTCT 
TGCGTGGCCC 
CGGCCCATCA 
ATCGCCGTCT 
AGCGGGCTGC 
CAGGGCATCT 



21 
I 

CGGGTGTCCC 
CGCTAGCGGC 
AGAGCGCGGA 
TCACGCTGCT 
TGACGCCCAC 
CCGCGTGCGG 
TCTCCAAATC 
TCCTCAAGCT 
TGGTCTTCGC 
CAGCCAAGCT 
TGAAGGCCGC 
TGATCATCCT 
ACTTCTCATT 
GCCTCTTTGC 
CCTACAGAAA 
TGCTGACCAA 
CCGTGGCCGT 
TCGTGGGCCT 
TCTTCGTGGG 
TCCTCACCCC 
CCAAGGACAT 
TGGCCATCAT 
AGGTGAACCT 
CCTTCTGGAA 
CCGTCTACTT 
TCTCCACGAC 



31 
I 

AGGCCCGGCC 
GCCGGCGGCC 
CGGCTCGGCG 
CAACGGCGTG 
GGGCGTGCTC 
CGTCTTCTCC 
GGGCGGCGAC 
CTGGATCGAG 
CACCTACCTG 
CGTGGCCTGC 
CACCCGGGTC 
GCTGGGCTTC 
TGAAGGCACC 
CTATGGAGGA 
CCTGCCCCTG 
CCTGGCCTAC 
GGACTTCGGG 
GTCCTGCTTC 
GTCCCGGGAA 
CGTGCCGTCC 
CTTCTCCGTC 
CGGCATGATC 
GGCCCTGCCT 
GACACCCGTG 
CTTCGGGGTC 
CGTCCTGTGT 



41 
I 

GGTGCGCAGA 
GAGGAGAAGG 
CCGGCAGGCG 
GCCATCATCG 
AAGGAGGCAG 
ATCGTGGGCG 
TACGCCTACA 
CTGCTCATCA 
CTCAAGCCGC 
CTCTGCGTGC 
CAGGATGCCT 
GTCCAGATCG 
AAACTGGATG 
TGGAATTACT 
GCCATCATCA 
TTCACCACCC 
AACTATCACC 
GGCTCCGTCA 
GGCCACCTGC 
CTCGTGTTCA 
ATCAACTTCT 
TGGCTGCGCC 
GTGTTCTTCA 
GAGTGTGGCA 
TGGTGGAAAA 
CAGAAGCTCA 



51 
I 

GCATGGCGGG 
AAGAGGCGCG 
AGGGCGAGGG 
TGGGGACCAT 
GCTCGCCGGG 
CGCTCTGCTA 
TGCTGGAGGT 
TCCGGCCTTC 
TCTTCCCCAC 
TGCTGCTCAC 
TTGCCGCCGC 
GAAAGGGTGA 
TGGGGAACAT 
TGAATTTCGT 
TCTCCCTGCC 
TGTCCACCGA 
TGGGCGTCAT 
ATGGGTCCCT 
CCTCCATCCT 
CGTGTGTGAT 
TCAGCTTCTT 
ACAGAAAGCC 
TCCTGGCCTG 
TCGGCTTCAC 
ACAAGCCCAA 
TGCAGGTGGT 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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420 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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720 
780 
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1020 
1080 
1140 
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1380 
1440 
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1560 
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WO 02/086443 

CCCCCAGGAG ACATAGCCAG GAGGCCGAGT GGCTGCOGGA GGAGCATGC 
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Seq ID NO: 543 Protein sequence 
Protein Accession #: XP 035292.2 



MAGAGPKRRA 
GTIIGSGIFV 
LEVYGSLPAF 
LLTAVNCYSV 
GNIVLALYSG 
STEQMLSSEA 
SILSMIHPQL 
RKPELERPIK 
KPKWLLQGIF 



11 
I 

LAAPAAEEKE 
TPTGVLKEAG 
LKLWIEUiII 
KAATRVQDAF 
LFAYGGWNYL 
VAVDFGNYHL 
LTPVPSLVPT 
VNLALPVFFI 
STTVLCQKLM 



21 

t 

EAREKMLAAK 
SPGLALWWA 
RPSSQYIVAL 
AAAKLLALAL 
NFVTEEMINP 
GVMSWIIPVF 
CVMTLLYAFS 
LACLFLIAVS 
QWPQET 



31 
I 

SADGSAPAGE 
ACGVFSIVGA 
VFATYLLKPL 
IILIX3FVQIG 
YRNLPLAIII 
VGI.SCFGSVN 
KDIFSVINFF 
FWKTPVECGI 



41 
I 

GEGVTLQRNI 
LCYAELGTTI 
FPTCPVPEEA 
KGDVSNLDPN 
SLPIVTLVYV 
GSLFTSSRLF 
SFFNWLCVAL 
GFTIILSGLP 



51 
I 

TLLNGVAIIV 
SKSGGDYAYM 
AKLVACIiCVL 
FSFEGTKLDV 
LTNliAYFTTL 
FVGSREGHLP 
AIIGMIWLRH 
VYFFGVWWKN 



Seq ID NO: 544 DNA sequence 

Nucleic Acid Accession #: NM_005268.1 

Coding sequence: 168.. 989 



TAAAAAGCAA 
TCTGGATATG 
AGCCCTGAGG 
TCTTTGAGGG 
TGTCTCTGGT 
GTGATGACCA 
TTGATGAGTT 
CATGCCCCTC 
ACCGAGAAGC 
GTGGGCTCTG 
TTCTCTATGT 
ACGCAGATCC 
TTTTCACCCT 
TCATCTACCT 
TGTGCACAGG 
CGGGTGACCT 
GAGACCATGT 
CCTGGATGGG 
CATGAGGTAG 
TCAACTCCAG 
GCTCGGTTTC 



11 
I 

AAGAATTCGC 
AAATTCAAGC 
AGTAGTCACT 
ACTCCTGAGT 
CTTCATCTTC 
CAAGGACTTC 
CTTCCCTGTG 
ACTGCTCGTG 
CCA TGGGGAG 
GTGGACATAT 
GTTCCACTCA 
ATGTCCCAAT 
CTTCATGGTG 
GGTGAGCAAG 
TCATCACCCC 
CATCTTTCTG 
GAAGAAAACC 
GAGGCTCTAG 
GGGCAGGCAA 
CCACCTGCCC 
CTTTTCTAGA 



21 

! 

GGCCGCGTCG 
TGCTTGCTGA 
CAGTAGCAGC 
GGGGTCAACA 
CGCGTGCTGG 
GACTGCAATA 
TCCCATGTGC 
GTCATGCACG 
AACAGTGGGC 
GTCTGCAGCC 
TTCTACCCCA 
ATAGTGGACT 
GCCACAGCTG 
AGATGCCACG 
CACGGTACCA 
GGCTCAGACA 
ATCTTGTGAG 
CATCTCTCAT 
GAGAGAGGAT 
CAGCTCGACG 
ATGGAAATAG 



31 
I 

ACACGGGCTT 
GTCCTATTGC 
TGACGCGTGG 
AGTACTCCAC 
TGTACCTGGT 
CTCGCCAGCC 
GCCTCTGGGC 
TGGCCTACCG 
GCCTCTACCT 
TAGTGTTCAA 
AATATATCCT 
GCTTCATCTC 
CCATCTGCAT 
AGTGCCTGGC 
CCTCTTCCTG 
GTCATCCTCC 
GGGCTGCCTG 
AGGTGCAACC 
TCAGACGCTC 
GCACTGGGCC 
TGAGGGCCAA 



41 

I 

CCCCGAAAAC 
CGGCTGCTGG 
GTCCACCATG 
AGCCTTTGGG 
GACGGCCGAG 
CGGCTGCTCC 
CCTGCAGCTT 
GGAGGTTCAG 
GAACCCCGGC 
GGCGAGCGTG 
CCCTCCTGTG 
CAAGCCCTCA 
CCTGCTCAAC 
AGCAAGGAAA 
CAAACAAGAC 
TCTCTTACCA 
GACTGGTCTG 
TGAGAGTGGG 
TGGGAGCCAG 
AGTTCCCCCT 
TGC 



51 
I 

CTTCCCCGCT 
GAGCCAGGAG 
AACTGGAGTA 
CGCATCTGGC 
CGTGTGTGGA 
AACGTCTGCT 
ATCCTGGTGA 
GAGAAGAGGC 
AAGAAGCGGG 
GACATCGCCT 
GTCAAGTGCC 
GAGAAGAACA 
CTCGTGGAGC 
GCTCAAGCCA 
GACCTCCTTT 
GACCGCCCCC 
GCAGGTTGGG 
GGAGCTAAGC 
TTCCTAGTCC 
CTGCTCTGCA 



Seq ID NO: 545 Protein sequence 
Protein Accession 8: NP_005259.1 



21 



31 



41 



51 



MNWSIFEGLL 
SNVCFDEFFP 
GKKRGGLWWT 
SEKNIFTLFM 
DDLLSGDLIF 



11 

111(1 
SGVNKYSTAF GRIWLSLVFI FRVLVYLVTA ERVWSDDHKD FDCNTRQPGC 
VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG ENSGRLYLNP 
YVCSLVFKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NIVDCFISKP 
VATAAICILL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 
LGSDSHPPLL PDRPRDHVKK TIL 



Seq ID NO: 546 DNA sequence 

Nucleic Acid Accession #: NM_002391. 

Coding sequence: 26.. 457 



CGGGCGAAGC 
CGCCCTGCTG 
CCCGGGGAGC 
CGGCGTGGGT 
GCCCTGCAAC 
TGCGTGTGAT 
CAATGCTCAG 
AAAGGCCAAA 
GCCCCTGGTG 
CACCAGTGCC 
ACTCCCCAGC 
TGAGCCTCCC 
ATTACTAAGA 
TAATAT 



11 
I 

AGCGCGGGCA 
GCGCTCACCT 
GAGTGCGCTG 
TTCCGCGAGG 
TGGAAGAAGG 
GGGGGCACAG 
TGCCAGGAGA 
GCCAAGAAAG 
TCACATGGGG 
TTCTGTCTGC 
CCCACCCCTA 
CCAAAGCAAT 
AACACATCAA 



21 
I 

GCGAGATGCA 
CCGCGGTCGC 
AGTGGGCCTG 
GCACCTGCGG 
AGTTTGGAGC 
GCACCAAAGT 
CCATCCGCGT 
GGAAGGGAAA 
CCTGGCCACG 
TCGTTAGCTT 
AGTGCCCAAA 
GTGAGTCCCA 
ATAAACTGAC 



31 
I 

GCACCGAGGC 
CAAAAAGAAA 
GGGGCCCTGC 
GGCCCAGACC 
CGACTGCAAG 
CCGCCAAGGC 
CACCAAGCCC 
GGACTAGACG 
CCCTCCCTCT 
TAATCAATCA 
GTGGGGAGGG 
GAGCCCGCTT 
TTTTTCCCCC 



41 
I 

TTCCTCCTCC 
GATAAGGTGA 
ACCCCCAGCA 
CAGCGCATCC 
TACAAGTTTG 
ACCCTGAAGA 
TGCACCCCCA 
CCAAGCCTGG 
CCCAGGCCCG 
TGCCCTGCCT 
ACAAGGGATT 
TTGTTCTTCC 
CAATAAAAGC 



51 
I 

TCACCCTCCT 
AGAAGGGCGG 
GCAAGGATTG 
GGTGCAGGGT 
AGAACTGGGG 
AGGCGCGCTA 
AGACCAAAGC 
ATGCCAAGGA 
AGATGTGACC 
TGTCCCTCTC 
CTGGGAAGCT 
CCACAATTCC 
TCTTCTTTTT 



Seq ID NO: 547 Protein sequence 
Protein Accession #: NP_002382.1 



11 



21 
I 



31 



41 



51 



'60 
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420 
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60 
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MQHRGFLLLT LLALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT 
CGAQTQRIRC RVPCNWKKEF GADCKYKFEN WGACDGGTGT KVRQGTLKKA RYNAQCQETI 
RVTKPCTPKT KAKAKAKKGK GKD 



60 
120 



Seq ID NO: 54 8 DNA sequence 
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WO 02/086443 

Nucleic Acid Accession ft: 
Coding sequence : l . . 786 



PCT/US02/12476 



NM 006783.1 
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I 

ATGGATTGGG 
GGGAAGGTGT 
CAGGAAGTGT 
AAAAATGTGT 
CTGATCTTCG 
GAAACCACTC 
ATTAAAAAGC 
TTTTTCCGAA 
TACCACCTGC 
TTTATTTCTA 
ATTTGCATGC 
AGATCAAAGA 
CAGAATGAAA 
AGCTAA 



11 
I 

GGACGCTGCA 
GGATCACAGT 
GGGGTGACGA 
GCTATGACCA 
TCTCCACCCC 
GCAAGTTCAG 
ACAAGGTTCG 
TCATCTTTGA 
CCTGGGTGTT 
GGCCAACAGA 
TGCTTAACGT 
GAGCACAGAC 
TGAATGAGCT 



21 
I 

CACTTTCATC 
CATCTTTATT 
GCAAGAGGAC 
CTTTTTCCCG 
AGCGCTGCTG 
GCGAGGAGAG 
GATAGAGGGG 
AGCAGCCTTT 
GAAATGTGGG 
GAAGACCGTG 
GGCAGAGTTG 
GCAAAAAAAT 
GATTTCAGAT 



31 
I 

GGGGGTGTCA 
TTCCGAGTCA 
TTCGTCTGCA 
GTGTCCCACA 
GTGGCCATGC 
AAGAGGAATG 
TCGCTGTGGT 
ATGTATGTGT 
ATTGACCCCT 
TTTACCATTT 
TGCTACCTGC 
CACCCCAATC 
AGTGGTCAAA 



41 

I 

ACAAACACTC 
TGATCCTAGT 
ACACACTGCA 
TCCGGCTGTG 
ATGTGGCCTA 
ATTTCAAAGA 
GGACGTACAC 
TTTACTTCCT 
GCCCCAACCT 
TTATGATTTC 
TGCTGAAAGT 
ATGCCCTAAA 
ATGCAATCAC 



51 
I 

CACCAGCATC 
GGTGGCTGCC 
ACCGGGATGC 
GGCCCTCCAG 
CTACAGGCAC 
CATAGAGGAC 
CAGCAGCATC 
TTACAATGGG 
TGTTGACTGC 
TGCGTCTGTG 
GTGTTTTAGG 
GGAGAGTAAG 
AGGTTTCCCA 



Seq ID NO: 549 Protein sequence 
Protein Accession #: NP_006774.1 



MDWGTLHTFI 
KNVCYDHFFP 
IKKHKVRIEG 
FISRPTEKTV 
QNEMNEIilSD 



11 21 31 41 51 

I I I I I 

GGVNKHSTSI GKVWITVIFI FRVMILWAA QEVWGDEQED FVCNTLQPGC 
VSHIRLWALQ. LIFVSTPALL VAMHVAYYRH ETTRKFRRGE KRNDFKDIED 
SLWWTYTSSI FFRIIFEAAF MYVFYFLYNG YHLPWVLKCG IDPCPNLVDC 
FTIFMISASV ICMLLNVAEL CYLLLKVCFR RSKRAQTQKN HPNHALKESK 
SGQNAITGFP S 



Seq ID NO: 550 DNA sequence 

Nucleic Acid Accession 8: NM_002571.1 

Coding sequence s 99 . . 587 



CATCCCTCTG 
TCACCCTGGG 
AGGACCTGGA 
ACATCTCCCT 
CCACCCCCGA 
AGAAGAAGGT 
TGGCGAACGA 
AGGACACCAC 
AGGACGATGA 
GGTACTTGCT 
CCAGGAAGAC 
TTTCAAAGAA 
TCCTGCTGCA 
GCAGAGGTTA 



11 

I 

GCTCCAGAGC 
CGTGGCCCTG 
GCTCCCAAAG 
CATGGCGACA 
GGACAACCTG 
CCTTGGAGAG 
GGCCACGCTG 
CACCCCCATC 
GATCATGCAG 
GGACTTGAAA 
CAGACTCCCA 
TAACCACAGC 
CACCTGCACC 
TTAATAAACC 



21 
I 

TCAGAGCCAC 
GTCTGTGGTG 
TTGGCAGGGA 
CTGAAGGCCC 
GAGATCGTTC 
AAGACTGGGA 
CTCGATACTG 
CAGAGCATGA 
GGATTCATCA 
CAGATGGAAG 
CCCTTCCACA 
TCAGAAGACG 
ATTGCCATGG 
CTTGGAGCAT 



31 
I 

CCACAGCCGC 
TCCCGGCCAT 
CCTGGCACTC 
CTCTGAGGGT 
TGCACAGATG 
ATCCAAAGAA 
ACTACGACAA 
TGTGCCAGTA 
GGGCTTTCAG 
AGCCGTGCCG 
GCTCCAGAGC 
ATGACGTGGT 
GGAGGCTGCT 
G 



41 
I 

AGCCATGCTG 
GGACATCCCC 
CATGGCCATG 
CCACATCACC 
GGAGAACAAC 
GTTCAAGATC 
TTTCCTGTTT 
CCTGGCCAGA 
GCCCCTGCCC 
TTTCTAGCTC 
AGTGGGACTT 
CATCTGTGTC 
CCCTGGGGGC 



51 

I . 

TGCCTCCTGC 
CAGACCAAGC 
GCGACCAACA 
TCACTGTTGC 
AGCTGTGTTG 
AACTATACGG 
CTCTGCCTAC 
GTCCTGGTGG 
AGGCACCTAT 
ACCTCCGCCT 
CCTCCTGCCC 
GCCATCCCCT 
AGAGTCTCTG 



Seq ID NO: 551 Protein sequence 
Protein Accession #: NP_002562.1 

21 



1 11 21 31 41 51 

I I I I I I 

MDIPQTKQDL ELPKLAGTWH SMAMATNNIS LMATLKAPLR VHITSLLPTP EDNLEIVLHR 
WENNSCVEKK VLGEKTGNPK KFKINYTVAN EATLLDTDYD NFLFLCLQDT TTPIQSMMCQ 
YLARVLVEDD EIMQGFIRAF RPLPRHLWYL LDLKQMEEPC RF 

Seq ID NO: 552 DNA sequence 

Nucleic Acid Accession ft: NM_006500.1 

Coding sequence: 27.. 1967 



1 
I 

ACTTGCGTCT 
TCGCCGCCTG 
CGCCTGAGCT 
AGTCCCAAGG 
TCATCTTCCG 
TCAGCCTCCA 
GCATCTTCTT 
TCTACAAAGC 
GTAAGGAGCC 
TCATCTGGTA 
CGTCCCAGAC 
TGGTTAAAGA 
GGAACCACAT 
TGTGGCTGGA 
GTTTGGCTGA 
GGGAGGCAGA 
AGGAACACAG 
TGAGTGAACC 
CCCCTGAGAG 
ACCTCGAGTT 



11 
I 

CGCCCTCCGG 
CTGCTGCTGT 
GGTGGAGGTG 
CAACCTCAGC 
TGTGCGCCAG 
GGACAGAGGG 
GTGCCAGGGC 
TCCGGAGGAG 
TGAGGAGGTC 
CAAGAATGGC 
TGTGGAGTCG 
AGACAAAGAT 
GAAGGAGTCC 
AGTGGAGCCC 
TGGCAACCCT 
GGAAGAGACA 
TGGGCGCTAT 
ACAGGAACTA 
ACAGGAAGGC 
CCAGTGGCTG 



21 
I 

CCAAGCATGG 
CCTCGCGTCG 
GAAGTGGGCA 
CATGTCGACT 
GGCCAGGGCC 
GCTACTCTGG 
AAGCGCCCTC 
CCAAACATCC 
GCTACCTGTG 
CGGCCTCTGA 
AGTGGTTTGT 
GCCCAGTTTT 
AGGGAAGTCA 
GTGGGAATGC 
CCACCACACT 
ACCAACGACA 
GAATGTCAGG 
CTGGTGAACT 
AGCAGCCTCA 
AGAGAAGAGA 



31 
I 

GGCTTCCCAG 
CGGGTGTGCC 
GCACAGCCCT 
GGTTTTCTGT 
AGAGCGAACC 
CCCTGACTCA 
GGTCCCAGGA 
AGGTCAACCC 
TAGGGAGGAA 
AGGAGGAGAA 
ACACCTTGCA 
ACTGTGAGCT 
CCGTCCCTGT 
TGAAGGAAGG 
TCAGCATCAG 
ACGGGGTCCT 
CCTGGAACTT 
ATGTGTCTGA 
CCCTGACCTG 
CAGACCAGGT 



41 
I 

GCTGGTCTGC 
CGGAGAGGCT 
TCTGAAGTGC 
CCACAAGGAG 
TGGGGAGTAC 
AGTCACCCCC 
GTACCGCATC 
CCTGGGCATC 
CGGGTACCCC 
GAACCGGGTC 
GAGTATTCTG 
CAACTACCGG 
TTTCTACCCG 
GGACCGCGTG 
CAAGCAGAAC 
GGTGCTGGAG 
GGACACCATG 
CGTCCGAGTG 
TGAGGCAGAG 
GCTGGAAAGG 



51 
I 

GCCTTCTTGC 
GAGCAGCCTG 
GGCCTCTCCC 
AAGCGGACGC 
GAGCAGCGGC 
CAAGACGAGC 
CAGCTCCGCG 
CCTGTGAACA 
ATTCCTCAAG 
CACATTCAGT 
AAGGCACAGC 
CTGCCCAGTG 
ACAGAAAAAG 
GAAATCAGGT 
CCCAGCACCA 
CCTGCCCGGA 
ATATCGCTGC 
AGTCCCGCAG 
AGTAGCCAGG 
GGGCCTGTGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



394 



WO 02/086443 m _ 

TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA TCGCTGCGTG GCGTCTGTGC 1260 

CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 

GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTG AATCTGTCTT 1380 

GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 1440 

AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 

TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 

TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACA ACCACTGGCC 1620 

TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 

TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740 

TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 

TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 

GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCC CGAATCACTT 1980 

CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2040 

CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 2220 

CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 

AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 

GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 2460 

AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 2520 

ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2580 

GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 2640 

TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 

TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 

CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 

CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 

ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 

TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 

GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3060 

TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 

AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

CTGTGTGTAT GCATACATA7 GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 3240 

TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300 

AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 3360 

AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 

AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA GAGCAGACAG 3480 

CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA GCACGAAGGG CCTGGCAGGC 3540 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA CTT 

Seq ID NO: 553 Protein sequence 
Protein Accession 8: NP_006491.1 

1 11 21 31 41 51 

I I I I I I 

GLPRLVCAFL LAACCCCPRV AGVPGEAEQP APELVEVEVG STALLKCGLS QSQGNLSHVD 60 

WFSVHKEKRT LIFRVRQGQG QSBPGEYEQR LSIjQDRGATL ALTQVTPQDE RIFLCQGKRP 120 

RSQEYRIQLR VYKAPEEPNI QVNPLGIPVN SKEPEEVATC VGRNGYPIPQ VIWYKNGRPIj 180 

KEEKNRVHIQ SSQTVESSGti YTLQSILKAQ LVKEDKDAQF YCELNYRLPS GNHMKESREV 240 

TVPVFYPTEK VWLEVEPVGM LKEGDRVEIR CIADGNPPPH FSISKQNPST REAEEETTND 300 

NGVLVLEPAR KEHSGRYECQ AWNLDTMISL LSEPQElrLVN YVSDVRVSPA APERQEGSSL 360 

TLTCEAESSQ DLEFQWLREE TDQVLERGPV LQLHDLKREA GGGYRCVASV PSIPGLNRTQ 420 

LVKLAIFGPP WMAFKERKVW VKENMVLNLS CEASGHPRPT ISWNVKGTAS EQDQDPQRVL 480 

STLNVLVTPE LLETGVECTA SNDLGKNTSI LFLELVNLTT LTPDSNTTTG LSTSTASPHT 540 

RANSTSTERK LPEPESRGW IVAVIVCILV LAVLGAVLYF LYKKGKLPCR RSGKQEITLP 600 
PSRKTELWE VKSDKLPEEM GLLQGSSGDK RAPGDQGEKY IDLRH 

Seq ID NO: 554 DNA sequence 

Nucleic Acid Accession 8: NM_003183.3 

Coding sequence: 165.. 263 9 

1 11 21 31 41 51 

I I I I I I 

TCGAGCCTGG CGGTAGAATC TTCCCAGTAG GCGGCGCGGG AGGGAAAAGA GGATTGAGGG 60 

GCTAGGCCGG GCGGATCCCG TCCTCCCCCG ATGTGAGCAG TTTTCCGAAA CCCCGTCAGG 120 

CGAAGGCTGC CCAGAGAGGT GGAGTCGGTA GCGGGGCCGG GAACATGAGG CAGTCTCTCC 180 

TATTCCTGAC CAGCGTGGTT CCTTTCGTGC TGGCGCCG CG ACCTCCGGAT GACCCGGGCT 240 

TCGGCCCCCA CCAGAGACTC GAGAAGCTTG ATTCTTTGCT CTCAGACTAC GATATTCTCT 300 

CTTTATCTAA TATCCAGCAG CATTCGGTAA GAAAAAGAGA TCTACAGACT TCAACACATG 360 

TAGAAACACT ACTAACTTTT TCAGCTTTGA AAAGGCATTT TAAATTATAC CTGACATCAA 420 

GTACTGAACG* TTTTTCACAA AATTTCAAGG TCGTGGTGGT GGATGGTAAA AACGAAAGCG 480 

AGTACACTGC AAAATGGCAG GACTTCTTCA CTGGACACGT GGTTGGTGAG CCTGACTCTA 540 

GGGTTCTAGC CCACATAAGA GATGATGATG TTATAATCAG AATCAACACA GATGGGGCCG 600 

AATATAACAT AGAGCCACTT TGGAGATTTG TTAATGATAC CAAAGACAAA AGAATGTTAG 660 

TTTATAAATC TGAAGATATC AAGAATGTTT CACGTTTGCA GTCTCCAAAA GTGTGTGGTT 720 

ATTTAAAAGT GGATAATGAA GAGTTGCTCC CAAAAGGGTT AGTAGACAGA GAACCACCTG 780 

AAGAGCTTGT TCATCGAGTG AAAAGAAGAG CTGACCCAGA TCCCATGAAG AACACGTGTA 840 

AATTATTGGT GGTAGCAGAT CATCGCTTCT ACAGATACAT GGGCAGAGGG GAAGAGAGTA 900 

CAACTACAAA TTACTTAATA GAGCTAATTG ACAGAGTTGA TGACATCTAT CGGAACACTT 960 

CATGGGATAA TGCAGGTTTT AAAGGCTATG GAATACAGAT AGAGCAGATT CGCATTCTCA 1020 

AGTCTCCACA AGAGGTAAAA CCTGGTGAAA AGCACTACAA CATGGCAAAA AGTTACCCAA 1080 

ATGAAGAAAA GGATGCTTGG GATGTGAAGA TGTTGCTAGA GCAATTTAGC TTTGATATAG 1140 

CTGAGGAAGC ATCTAAAGTT TGCTTGGCAC ACCTTTTCAC ATACCAAGAT TTTGATATGG 1200 

GAACTCTTGG ATTAGCTTAT GTTGGCTCTC CCAGAGCAAA CAGCCATGGA GGTGTTTGTC 1260 

CAAAGGCTTA TTATAGCCCA GTTGGGAAGA AAAATATCTA TTTGAATAGT GGTTTGACGA 1320 

GCACAAAGAA TTATGGTAAA ACCATCCTTA CAAAGGAAGC TGACCTGGTT ACAACTCATG 1380 
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WO 02/086443 

AATTGGGACA TAATTTTGGA GCAGAACATG ATCCGGATGG 
ATGAGGACCA GGGAGGGAAA TATGTCATGT ATCCCATAGC 
ACAATAAGAT GTTTTCAAAC TGCAGTAAAC AATCAATCTA 
CCCAGGAGTG TTTTCAAGAA CGCAGCAATA AAGTTTGTGG 
GAGAAGAGTG TGATCCTGGC ATCATGTATC TGAACAACGA 
GCACGTTGAA GGAAGGTGTC CAGTGCAGTG ACAGGAACAG 
AGTTTGAGAC TGCCCAGAAG AAGTGCCAGG AGGCGATTAA 
CCTACTGCAC AGGTAATAGC AGTGAGTGCC CGCCTCCAGG 
TTTGCTTGGA TCTTGGCAAG TGTAAGGATG GGAAATGCAT 
AGCAGCTGGA GTCCTGTGCA TGTAATGAAA CTGACAACTC 
ACCTTTCTGG CCGCTGTGTG CCCTATGTCG ATGCTGAACA 
AAGGAAAGCC CTGTACAGTA GGATTTTGTG ACATGAATGG 
AGGATGTAAT TGAACGATTT TGGGATTTCA TTGACCAGCT 
AGTTTTTAGC AGACAACATC GTTGGGTCTG TCCTGGTTTT 
CTTTCAGCAT TCTTGTCCAT TGTGTGGATA AGAAATTGGA 
CTCTGTTTCA CCCCAGTAAC GTCGAAATGC TGAGCAGCAT 
TTATCAAACC CTTTCCTGCG CCCCAGACTC CAGGCCGCCT 
CTTCGGCGCC AGCAGCTCCA AAACTGGACC ACCAGAGAAT 
CCAGCACAGA CTCCCATATG GACGAGGATG GGTTTGAGAA 
GCACAGCTGC CAAGTCATTT. GAGGATCTCA CGGACCATCC 
CTGCCTCCTT TAAACTGCAG CGTCAGAATC GTGTTAACAG 
TTAGTTCTCA GCTCTTCTGA CTTAAGTGTG CAAAATATTT 
TCAATCACAG CTTGTATTTT GTGAAGACTG GGAAGTGACT 
TTTGAACTTC CTGCAGGTAA ACAGTTCTTG TGTGGTTTGG 
TAAGGTGAAA GTGAATCTAC TTATTTTGAG GCTTTCAGGT 
TTGACCTGTG GTGCAAAAGC AGAAAATACA GCTGGATTGG 
TGTAAATTAA TCTTTTATAT TGATAACAGC ACTGACTAGG 
ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT 
ATAGTTTTTT TTTTTTTTTT TTTTTTTTGC CTTCAACTAA 
GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT 
CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT 
TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA 
TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT 
TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA 
CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC 
CGCCACTACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC 

Seq ID NO: 555 Protein sequence 
Protein Accession #: NPJJ03174.2 



PCT/US02/12476 



I 

MRQSLLFLTS 
QTSTHVETLL 
GEPDSRVLAH 
PKVCGYLKVD 
RGEESTTTNY 
AKSYPNEEKD 
HGGVCPKAYY 
AECAPNEDQG 
SRVDEGEECD 
TCKGVSYCTG 
KVCCRDLSGR 
INTFGKFLAD 
SASVRIIKPF 
PFPNSSTAAK 



11 

I 

WPFVLAPRP 
TFSALKRHFK 
IRDDDVIIRI 
NEELLPKGLV 
LIELIDRVDD 
AWDVKMLLEQ 
SPVGKKNIYL 
GKYVMYP1AV 
PGIMYLNNDT 
NSSECPPPGN 
CVPYVDAEQK 
NIVGSVLVFS 
PAPQTPGRLQ 
SFEDLTDHPV 



21 

I 

PDDPGFGPHQ 
LYLTSSTERF 
NTDGAEYNIE 
DREPPEELVH 
IYRNTSWDNA 
FSFDIAEEAS 
NSGLTSTKNY 
SGDHENNKMF 
CCNSDCTLKE 
AENDTVCLDL 
NLFLRKGKPC 
LIFWIPFSIL 
PAPVIPSAPA 
ARSEKAASFK 



31 
I 

RLEKLDSLLS 
SQNFKWWD 
PLWRFVNDTK 
RVKRRADPDP 
GFKGYGIQIE 
KVCLAHLFTY 
GKTILTKEAD 
SNCSKQSIYK 
GVQCSDRNSP 
GKCKDGKCIP 
TVGFCDMNGK 
VHCVDKKLDK 
APKLDHQRMD 
LQRQNRVNSK 



TCTAGCAGAA 
TGTGAGTGGC 
TAAGACCATT 
GAACTCGAGG 
CACCTGCTGC 
TCCTTGCTGT 
TGCTACTTGC 
AAATGCTGAA 
CCCTTTCTGC 
CTGCAAGGTG 
AAAGAACTTA 
CAAATGTGAG 
GAGCATCAAT 
CTCCTTGATA 
TAAACAGTAT 
GGATTCTGCA 
GCAGCCTGCC 
GGACACCATC 
GGACCCCTTC 
GGTCGCCAGA 
CAAAGAAACA 
TTATAGATTT 
TAGCAGATGC 

TTTAGTTTTT 
GTTATGAATA 
GAAATGATCA 
TTATTTGTGA 
AAACAAAGGA 
TACCCAGAGT 
AATATGGCTC 
GGGCTATACA 
TGCTGGGCAG 
TACATGTCTT 
ACTTGGAGAG 
CTC 



41 
I 

DYDIIjSLSNI 
GKNESEYTAK 
DKRMLVYKSE 
MKNTCKLLW 
QIRILKSPQE 
QDFDMGTLGL 
LVTTHELGHN 
TIESKAQECF 
CCKNCQFETA 
FCEREQQLES 
CEKRVQDVIE 
QYESLSLFHP 
TIQEDPSTDS 
ETEC 



TGTGCCCCGA 
GATCACGAGA 
GAAAGTAAGG 
GTGGATGAAG 
AACAGCGACT 
AAAAACTGTC 
AAAGGCGTGT 
AATGACACTG 
GAGAGGGAAC 
TGCTGCAGGG 
TTTTTGAGGA 
AAACGAGTAC 
ACTTTTGGAA 
TTTTGGATTC 
GAATCTCTGT 
TCGGTTCGCA 
CCTGTGATCC 
CAGGAAGACC 
CCAAATAGCA 
AGTGAAAAGG 
GAGTGCTAAT 
GACCTACAAA 
TGGTCATGTG 
TTTGAAAAGG 
AAAATATCTT 
TTTACGTTTT 
GTTTTTTTTT 
GAAAAGTGGA- 
GATAAATTTA 
TTTTATGTAG 
TTCATAATTC 
TGGTAGCCAG 
TTTTTCTGTA 
AGAAAATTCA 
GCTGAGGTTG 



51 
I 

QQHSVRKRDL 
WQDFFTGHW 
DIKNVSRLQS 
ADHRFYRYMG 
VKPGEKHYNM 
AYVGSPRANS 
FGAEHDPDGL 
QERSNKVCGN 
QKKCQEAINA 
CACNETDNSC 
RFWDFIDQLS 
SNVEMLSSMD 
HMDEDGFEKD 



Seq ID NO: 556 DNA sequence 

Nucleic Acid Accession 8: NM_021832. 

Coding sequence: 164.. 2248 



I 

TCGAGCCTGG 
CTAGGCCGGG 
GAAGGCTGCC 
ATTCCTGACC 
CGGCCCCCAC 
TTTATCTAAT 
AGAAACACTA 
TACTGAACGT 
GTACACTGTA 
GGTTCTAGCC 
ATATAACATA 
TTATAAATCT 
TTTAAAAGTG 
AGAGCTTGTT 
ATTATTGGTG 
AACTACAAAT 
ATGGGATAAT 
GTCTCCACAA 
TGAAGAAAAG 
TGAGGAAGCA 
AACTCTTGGA 
AAAGGCTTAT 
CACAAAGAAT 
ATTGGGACAT 



11 
I 

CGGTAGAATC 
CGGATCCCGT 
CAGAGAGGTG 
AGCGTGGTTC 
CAGAGACTCG 
ATCCAGCAGC 
CTAACTTTTT 
TTTTCACAAA 
AAATGGCAGG 
CACATAAGAG 
GAGCCACTTT 
GAAGATATCA 
GATAATGAAG 
CATCGAGTGA 
GTAGCAGATC 
TACTTAATAG 
GCAGGTTTTA 
GAGGTAAAAC 
GATGCTTGGG 
TCTAAAGTTT 
TTAGCTTATG 
TATAGCCCAG 
TATGGTAAAA 
AATTTTGGAG 



21 
I 

TTCCCAGTAG 
CCTCCCCCGA 
GAGTCGGTAG 
CTTTCGTGCT 
AGAAGCTTGA 
ATTCGGTAAG 
CAGCTTTGAA 
ATTTCAAGGT 
ACTTCTTCAC 
ATGATGATGT 
GGAGATTTGT 
AGAATGTTTC 
AGTTGCTCCC 
AAAGAAGAGC 
ATCGCTTCTA 
AGCTAATTGA 
AAGGCTATGG 
CTGGTGAAAA 
ATGTGAAGAT 
GCTTGGCACA 
TTGGCTCTCC 
TTGGGAAGAA 
CCATCCTTAC 
CAGAACATGA 



31 
I 

GCGGCGCGGG 
TGTGAGCAGT 
CGGGGCCGGG 
GGCGCCGCGA 
TTCTTTGCTC 
AAAAAGAGAT 
AAGGCATTTT 
CGTGGTGGTG 
TGGACACGTG 
TATAATCAGA 
TAATGATACC 
ACGTTTGCAG 
AAAAGGGTTA 
TGACCCAGAT 
CAGATACATG 
CAGAGTTGAT 
AATACAGATA 
GCACTACAAC 
GTTGCTAGAG 
CCTTTTCACA 
CAGAGCAAAC 
AAATATCTAT 
AAAGGAAGCT 
TCCGGATGGT 



41 

I 

AGGAAAAGAG 
TTTCCGAAAC 
AACATGAGGC 
CCTCCGGATG 
TCAGACTACG 
CTACAGACTT 
AAATTATACC 
GATGGTAAAA 
GTTGGTGAGC 
ATCAACACAG 
AAAGACAAAA 
TCTCCAAAAG 
GTAGACAGAG 
CCCATGAAGA 
GGCAGAGGGG 
GACATCTATC 
GAGCAGATTC 
ATGGCAAAAA 
CAATTTAGCT 
TACCAAGATT 
AGCCATGGAG 
TTGAATAGTG 
GACCTGGTTA 
CTAGCAGAAT 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2680 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



51 
I 

GATTGAGGGG 
CCCGTCAGGC 
AGTCTCTCCT 
ACCCGGGCTT 
ATATTCTCTC 
CAACACATGT 
TGACATCAAG 
ACGAAAGCGA 
CTGACTCTAG 
ATGGGGCCGA 
GAATGTTAGT 
TGTGTGGTTA 
AACCACCTGA 
ACACGTGTAA 
AAGAGAGTAC 
GGAACACTTC 
GCATTCTCAA 
GTTACCCAAA 
TTGATATAGC 
TTGATATGGG 
GTGTTTGTCC 
GTTTGACGAG 
CAACTCATGA 
GTGCCCCGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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tgaggaccag ggagggaaat atgtcatgta tcccatagct gtgagtggcg atcacgagaa 

CAATAAGATG TTTTCAAACT GCAGTAAACA ATCAATCTAT AAGACCATTG AAAGTAAGGC 
CCAGGAGTGT TTTCAAGAAC GCAGCAATAA AGTTTGTGGG AACTCGAGGG TGGATGAAGG 
AGAAGAGTGT GATCCTGGCA TCATGTATCT GAACAACGAC ACCTGCTGCA ACAGCGACTG 
CACGTTGAAG GAAGGTGTCC AGTGCAGTGA CAGGAACAGT CCTTGCTGTA AAAACTGTCA 
GTTTGAGACT GCCCAGAAGA AGTGCCAGGA GGCGATTAAT GCTACTTGCA AAGGCGTGTC 
CTACTGCACA GGTAATAGCA GTGAGTGCCC GCCTCCAGGA AATGCTGAAG ATGACACTGT 
TTGCTTGGAT CTTGGCAAGT GTAAGGATGG GAAATGCATC CCTTTCTGCG AGAGGGAACA 
GCAGCTGGAG TCCTGTGCAT GTAATGAAAC TGACAACTCC TGCAAGGTGT GCTGCAGGGA 
CCTTTCCGGC CGCTGTGTGC CCTATGTCGA TGCTGAACAA AAGAACTTAT TTTTGAGGAA 
AGGAAAGCCC TGTACAGTAG GATTTTGTGA CATGAATGGC AAATGTGAGA AACGAGTACA 
GGATGTAATT GAACGATTTT GGGATTTCAT TGACCAGCTG AGCATCAATA CTTTTGGAAA 
GTTTTTAGCA GACAACATCG TTGGGTCTGT CCTGGTTTTC TCCTTGATAT TTTGGATTCC 
TTTCAGCATT CTTGTCCATT GTGTGTAACG TCGAAATGCT GAGCAGCATG GATTCTGCAT 
CGGTTCGCAT TATCAAACCC TTTCCTGCGC CCCAGACTCC AGGCCGCCTG CAGCCTGCCC 
CTGTGATCCC TTCGGCGCCA GCAGCTCCAA AACTGGACCA CCAGAGAATG GACACCATCC 
AGGAAGACCC CAGCACAGAC TCACATATGG ACGAGGATGG GTTTGAGAAG GACCCCTTCC 
CAAATAGCAG CACAGCTGCC AAGTCATTTG AGGATCTCAC GGACCATCCG GTCACCAGAA 
GTGAAAAGGC TGCCTCCTTT AAACTGCAGC GTCAGAATCG TGTTGACAGC AAAGAAACAG 
AGTGCTAATT TAGTTCTCAG CTCTTCTGAC TTAAGTGTGC AAAATATTTT TATAGATTTG 
ACCTACAATC AATCACAGCT TATATTTTGT GAAGACTGGG AAGTGACTTA GCAGATGCTG 
GTCATGTGTT TGAACTTCCT GCAGGTAAAC AGTTCTTGTG TGGTTTGGCC CTTCTCCTTT 
TGAAAAGGTA AGGTGAAGGT GAATCTAGCT TATTTTGAGG CTTTCAGGTT TTAGTTTTTA 
AAATATCTTT TGACCTGTGG TGCAAAAGCA GAAAATACAG CTGGATTGGG TTATGAGTAT 
TTACGTTTTT GTAAATTAAT CTTTTATATT GATAACAGGC ACTGACTAGG GAAATGATCA 
GTTTTTTTTT ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGTGA 
GAAAAGTGGA ATAGTTTTTT TT T TT TT T TT TTTTTTTTGC CTTCAACTAA AAACAAAGGA 
GATAAATTTA GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT 
TTTTATGTAG CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC 
TTCATAATTC TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA GGG CTATACA 
TGGTAGCCAG TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG 
TTTTTCTGTA TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT 
AGAAAATTCA CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG 
3421 GCTGAGGTTG CGCCACTACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC CTC 



PCT/US02/12476 



Seq ID NO: 557 Protein sequence 
Protein Accession #: NP_068604.1 



MRQSLLFLTS 
QTSTHVETLL 
GEPDSRVLAH 
PKVCGYLKVD 
RGEESTTTNY 
AKSYPNEEKD 
HGGVCPKAYY 
AECAPNEDQG 
SRVDEGEECD 
TCKGVSYCTG 
KVCCRDLSGR 
INTFGKFLAD 



11 
I 

WPFVLAPRP 
TFSALKRHFK 
IRDDDVIIRI 
NEELLPKGLV 
LIELIDRVDD 
AWDVKMLLEQ 
SPVGKKNIYL 
GKYVMYPIAV 
PGIMYLNNDT 
NSSECPPPGN 
CVPYVDAEQK 
NIVGSVLVFS 



21 
I 

PDDPGFGPHQ 
LYLTSSTERF 
NTDGAEYNIE 
DREPPEELVH 
IYRNTSWDNA 
FSFDIAEEAS 
NSGLTSTKNY 



CCNSDCTLKE 
AEDDTVCLDL 
NliFLRXGKPC 
LIFWIPFSIL 



31 
I 

RLEKLDSLLS 
SQNFKWWD 
PLWRFVNDTK 
RVKRRADPDP 
GFKGYGIQIE 
KVCLAHLFTY 
GKTILTKEAD 
SNCSKQSIYK 
GVQCSDRNSP 
GKCKDGKCIP 
TVGFCDMNGK 
VHCV 



41 

I 

DYDILSLSNI 
GKNESEYTVK 
DKRMLVYKSE 
MKNTCKLLW 
QIRILKSPOE 
QDFDMGTLGL 
LVTTHELGHN 
TIESKAQECF 
CCKNCQFETA 
FCEREQQLES 
CEKRVQDVIB 



51 
I 

QQHSVRKRDL 
WQDFFTGHW 
DIKNVSRLQS 
ADHRFYRYMG 
VKPGEKHYNM 
AYVGSPRANS 
FGAEHDPDGL 
QERSNKVCGN 
QKKCQEAINA 
CACNETDNSC 
RFWDFIDQLS 



Seq ID NO: 558 DNA sequence 

Nucleic Acid Accession §: NM_004994.1 

Coding sequence: 20.. 2143 



AGACACCTCT 
GGGCTGCTGC 
CCTGAGAACC 
CACTCGGGTG 
CCAGAAGCAA 
GCGAACCCCA 
CAAGTGGCAC 
GGCGGTGATT 
CACCTTCACT 
GCACGGAGAC 
TGGCCCCGGC 
GGGCGTCGTG 
CATCTTCGAG 
CTGGTGCAGT 
GAGACTCTAC 
CCAAGGCCAA 
CGCCACCACC 
CTCGACGGTG 
GGGTAAGGAG 
TACCACCTCG 
TTTGTTCCTC 
GCCGGAGGCG 
CGACGTGAAT 
AACCACCACC 
TGTCCACCCC 
AGGTCCCCCC 
TGCCTGCAAC 
CAAGGATGGG 



11 
I 

GCCCTCACCA 
TTTGCTGCCC 
AATCTCACCG 
GCAGAGATGC 
CTGTCCCTGC 
CGGTGCGGGG 
CACCACAACA 
GACGACGCCT 
CGCGTGTACA 
GGGTATCCCT 
ATTCAGGGAG 
GTTCCAACTC 
GGCCGCTCCT 
ACCACGGCCA 
ACCCGGGACG 
TCCTACTCCG 
GCCAACTACG 
ATGGGGGGCA 
TACTCGACCT 
AACTTTGACA 
GTGGCGGCGC 
CTCATGTACC 
GGCATCCGGC 
ACACCGCAGC 
TCAGAGCGCC 
ACTGCTGGCC 
GTGAACATCT 
AAGTACTGGC 



21 
I 

TGAGCCTCTG 
CCAGACAGCG 
ACAGGCAGCT 
GTGGAGAGTC 
CCGAGACCGG 
TCCCAGACCT 
TCACCTATTG 
TTGCCCGCGC 
GCCGGGACGC 
TCGACGGGAA 
ACGCCCATTT 
GGTTTGGAAA 
ACTCTGCCTG 
ACTACGACAC 
GCAATGCTGA 
CCTGCACCAC 
ACCGGGACAA 
ACTCGGCGGG 
GTACCAGCGA 
GCGACAAGAA 
ATGAGTTCGG 
CTATGTACCG 
ACCTCTATGG 
CCACGGCTCC 
CCACAGCTGG 
CTTCTACGGC 
TCGACGCCAT 
GATTCTCTGA 



31 
I 

GCAGCCCCTG 
CCAGTCCACC 
GGCAGAGGAA 
GAAATCTCTG 
TGAGCTGGAT 
GGGCAGATTC 
GATCCAAAAC 
CTTCGCACTG 
AGACATCGTC 
GGACGGGCTC 
CGACGATGAC 
CGCAGATGGC 
CACCACCGAC 
CGACGACCGG 
TGGGAAACCC 
GGACGGTCGC 
GCTCTTCGGC 
GGAGCTGTGC 
GGGCCGCGGA 
GTGGGGCTTC 
CCACGCGCTG 
CTTCACTGAG 
TCCTCGCCCT 
CCCGACGGTC 
CCCCACAGGT 
CACTACTGTG 
CGCGGAGATT 
GGGCAGGGGG 



41 

I 

GTCCTGGTGC 
CTTGTGCTCT 
TACCTGTACC 
GGGCCTGCGC 
AGCGCCACGC 
CAAACCTTTG 
TACTCGGAAG 
TGGAGCGCGG 
ATCCAGTTTG 
CTGGCACACG 
GAGTTGTGGT 
GCGGCCTGCC 
GGTCGCTCCG 
TTTGGCTTCT 
TGCCAGTTTC 
TCCGACGGCT 
TTCTGCCCGA 
GTCTTCCCCT 
GATGGGCGCC 
TGCCCGGACC 
GGCTTAGATC 
GGGCCCCCCT 
GAACCTGAGC 
TGCCCCACCG 
CCCCCCTCAG 
CCTTTGAGTC 
GGGAACCAGC 
AGCCGGCCGC 



51 

! 

TCCTGGTGCT 
TCCCTGGAGA 
GCTATGGTTA 
TGCTGCTTCT 
TGAAGGCCAT 
AGGGCGACCT 
ACTTGCCGCG 
TGACGCCGCT 
GTGTCGCGGA 
CCTTTCCTCC 
CCCTGGGCAA 
ACTTCCCCTT 
ACGGCTTGCC 
GCCCCAGCGA 
CATTCATCTT 
ACCGCTGGTG 
CCCGAGCTGA 
TCACTTTCCT 
TCTGGTGCGC 
AAGGATACAG 
ATTCCTCAGT 
TGCATAAGGA 
CACGGCCTCC 
GACCCCCCAC 
CTGGCCCCAC 
CGGTGGACGA 
TGTATTTGTT 
AGGGCCCCTT 



1500 
1560 
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1800 
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3420 



60 
120 
180 
240 
300 
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540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
4B0 
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780 
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1140 
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1320 
1380 
1440 
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1560 
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CCTTATCGCC GACAAGT6GC CCGCGCTGCC CCGCAAGCTG GACTCGGTCT TTGAGGAGCC 1740 

GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG GCGCCAGGTG TGGGTGTACA CAGGCGCGTC 1800 

GGTGCTGGGC CCGAGGCGTC TGGACAAGCT GGGCCTGGGA GCOGACGTGG CCCAGGTGAC 1860 

CGGGGCCCTC CGGAGTGGCA GGGGGAAGAT GCTGCTGTTC AGCGGGCGGC GCCTCTGGAG 1920 

GTTCGACGTG AAGGCGCAGA TGGTGGATCC CCGGAGCGCC AGCGAGGTGG ACCGGATGTT 1980 

CCCCGGGGTG CCTTTGGACA CGCACGACGT CTTCCAGTAC CGAGAGAAAG CCTATTTCTG 2040 

CCAGGACCGC TTCTACTGGC GCGTGAGTTC CCGGAGTGAG TTGAACCAGG TGGACCAAGT 2100 

GGGCTACGTG ACCTATGACA TCCTGCAGTG CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 2160 

GCAGTGCCAT GTAAATCCCC ACTGGGACCA ACCCTGGGGA AGGAGCCAGT TTGCCGGATA 2220 

CAAACTGGTA TTCTGTTCTG GAGGAAAGGG AGGAGTGGAG GTGGGCTGGG CCCTCTCTTC 2280 
TCACCTTTGT TTTTTGTTGG AGTGTTTCTA ATAAACTTGG ATTCTCTAAC CTTT 

Seq ID NO: 559 Protein sequence 
Protein Accession #: NP_004985.1 

1 11 21 31 41 51 

I I I I I I 

MSLWQPLVLV LLVLGCCFAA PRQRQSTLVL FPGDLRTNLT DRQLAEEYLY RYGYTRVAEM 60 

RGESKSLGPA LLLLQKQLSL PETGELDSAT LKAMRTPRCG VPDLGRFQTF EGDLKWHHHN 120 

ITYWIQNYSE DLPRAVIDDA FARAFALWSA VTPLTFTRVY SRDADIVIQF GVAEHGDGYP 180 

FDGKDGLLAH AFPPGPGIQG DAHFDDDELW SLGKGVWPT RFGNADGAAC HFPFIFEGRS 240 

Y SACT TPGRS DGLPWCSTTA NYDTDDRFGF CPSERLYTRD GNADGKPCQF PFIFQGQSYS 300 

ACTTDGRSDG YRWCATTANY DRDKLFGFCP TRADSTVMGG NSAGELCVFP FTFLGKEYST 360 

CTSEGRGDGR LWCATTSNFD SDKKWGFCPD QGYSLFLVAA HEFGHALGLD HSSVPEALMY 420 

PMYRFTEGPP LHKDDVNGIR HLYGPRPEPE PRPPTTTTPQ PTAPPTVCPT GPPTVHPSER 480 

PTAGPTGPPS AGPTGPPTAG PSTATTVPLS PVDDACNVNI FDAIAEIGNQ LYLFKDGKYVJ 540 

RFSEGRGSRP QGPFLIADKW PALPRKLDSV FEEPLSKKLF FFSGRQVWVY TGASVLGPRR 600 

LDKLGLGADV AQVTGALRSG RGKMLLFSGR RLWRFDVKAQ MVDPRSASEV DRMFPGVPLD 660 
THDVFQYREK AYFCQDRFYW RVSSRSELNQ VDQVGYVTYD ILQCPED 

Seq ID NO: 560 DNA sequence 

Nucleic Acid Accession 8: NM_000213.1 

Coding sequence: 127., 5385 

1 11 21 31 41 51 

I 1 I I I I 

CGCCCGCGCG CTGCAGCCCC ATCTCCTAGC GGCAGCCCAG GCGCGGAGGG AGCGAGTCCG 60 

CCCCGAGGTA GGTCCAGGAC GGGCGCACAG CAGCAGCCGA GGCTGGCCGG GAGAGGGAGG 120 

AAGAGGATGG CAGGGCCACG CCCCAGCCCA TGGGCCAGGC TGCTCCTGGC AGCCTTGATC 180 

AGCGTCAGCC TCTCTGGGAC CTTGGCAAAC CGCTGCAAGA AGGCCCCAGT GAAGAGCTGC 240 

ACGGAGTGTG TCCGTGTGGA TAAGGACTGC GCCTACTGCA CAGACGAGAT GTTCAGGGAC 300 

CGGCGCTGCA ACACCCAGGC GGAGCTGCTG GCCGCGGGCT GCCAGCGGGA GAGCATCGTG 360 

GTCATGGAGA GCAGCTTCCA AATCACAGAG GAGACCCAGA TTGACACCAC CCTGCGGCGC 420 

AGCCAGATGT CCCCCCAAGG CCTGCGGGTC CGTCTGCGGC CCGGTGAGGA GCGGCATTTT 480 

GAGCTGGAGG TGTTTGAGCC ACTGGAGAGC CCCGTGGACC TGTACATCCT CATGGACTTC 540 

TCCAACTCCA TGTCCGATGA TCTGGACAAC CTCAAGAAGA TGGGGCAGAA CCTGGCTCGG 600 

GTCCTGAGCC AGCTCACCAG CGACTACACT ATTGGATTTG GCAAGTTTGT GGACAAAGTC 660 

AGCGTCCCGC AGACGGACAT GAGGCCTGAG AAGCTGAAGG AGCCCTGGCC CAACAGTGAC 720 

CCCCCCTTCT CCTTCAAGAA CGTCATCAGC CTGACAGAAG ATGTGGATGA GTTCCGGAAT 780 

AAACTGCAGG GAGAGCGGAT CTCAGGCAAC CTGGATGCTC CTGAGGGCGG CTTCGATGCC 840 

ATCCTGCAGA CAGCTGTGTG CACGAGGGAC ATTGGCTGGC GCCCGGACAG CACCCACCTG 900 

CTGGTCTTCT CCACCGAGTC AGCCTTCCAC TATGAGGCTG ATGGCGCCAA CGTGCTGGCT 960 

GGCATCATGA GCCGCAACGA TGAACGGTGC CACCTGGACA CCACGGGCAC CTACACCCAG 1020 

TACAGGACAC AGGACTAGCC GTCGGTGCCC ACCCTGGTGC GCCTGCTCGC CAAGCACAAC 1080 

ATCATCCCCA TCTTTGCTGT CACCAACTAC TCCTATAGCT ACTACGAGAA GCTTCACACC 1140 

TATTTCCCTG TCTCCTCACT GGGGGTGCTG CAGGAGGACT CGTCCAACAT CGTGGAGCTG 1200 

CTGGAGGAGG CCTTCAATCG GATCCGCTCC AACCTGGACA TCCGGGCCCT AGACAGCCCC 1260 

CGAGGCCTTC GGACAGAGGT CACCTCCAAG ATGTTCCAGA AGACGAGGAC TGGGTCCTTT 1320 

CACATCCGGC GGGGGGAAGT GGGTATATAC CAGGTGCAGC TGCGGGCCCT TGAGCACGTG 1380 

GATGGGACGC ACGTGTGCCA GCTGCCGGAG GACCAGAAGG GCAACATCCA TCTGAAACCT 1440 

TCCTTCTCCG ACGGCCTCAA GATGGACGCG GGCATCATCT GTGATGTGTG CACCTGCGAG 1500 

CTGCAAAAAG AGGTGCGGTC AGCTCGCTGC AGCTTCAACG GAGACTTCGT GTGCGGACAG 1560 

TGTGTGTGCA GCGAGGGCTG GAGTGGCCAG ACCTGCAACT GCTCCACCGG CTCTCTGAGT 1620 

GACATTCAGC CCTGCCTGCG GGAGGGCGAG GACAAGCCGT GCTCCGGCCG TGGGGAGTGC 1680 

CAGTGCGGGC ACTGTGTGTG CTACGGCGAA GGCCGCTACG AGGGTCAGTT CTGCGAGTAT 1740 

GACAACTTCC AGTGTCCCCG CACTTCCGGG TTCCTCTGCA ATGACCGAGG ACGCTGCTCC 1800 

ATGGGCCAGT GTGTGTGTGA GCCTGGTTGG ACAGGCCCAA GCTGTGACTG TCCCCTCAGC 1860 

AATGCCACCT GCATCGACAG CAATGGGGGC ATCTGTAATG GACGTGGCCA CTGTGAGTGT 1920 

GGCCGCTGCC ACTGCCACCA GCAGTCGCTC TACACGGACA CCATCTGCGA GATCAACTAC 1980 

TCGGCGATCC ACCCGGGCCT CTGCGAGGAC CTACGCTCCT GCGTGCAGTG CCAGGCGTGG 2040 

GGCACCGGCG AGAAGAAGGG • GCGCACGTGT GAGGAATGCA ACTTCAAGGT CAAGATGGTG 2100 

GACGAGCTTA AGAGAGCCGA GGAGGTGGTG GTGCGCTGCT CCTTCCGGGA CGAGGATGAC 2160 

GACTGCACCT ACAGCTACAC CATGGAAGGT GACGGCGCCC CTGGGCCCAA CAGCACTGTC 2220 

CTGGTGCACA AGAAGAAGGA CTGCCCTCCG GGCTCCTTCT GGTGGCTCAT CCCCCTGCTC 2280 

CTCCTCCTCC TGCCGCTCCT GGCCCTGCTA CTGCTGCTAT GCTGGAAGTA CTGTGCCTGC 2340 • 

TGCAAGGCCT GCCTGGCACT TCTCCCGTGC TGCAACCGAG GTCACATGGT GGGCTTTAAG 2400 

GAAGACCACT ACATGCTGCG GGAGAACCTG ATGGCCTCTG ACCACTTGGA CACGCCCATG 2460 

CTGCGCAGCG GGAACCTCAA GGGCCGTGAC GTGGTCCGCT GGAAGGTCAC CAACAACATG 2520 

CAGCGGCCTG GCTTTGCCAC TCATGCCGCC AGCATCAACC CCACAGAGCT GGTGCCCTAC 2580 

GGGCTGTCCT TGCGCCTGGC CCGCCTTTGC ACCGAGAACC TGCTGAAGCC TGACACTCGG 2640 

GAGTGCGCCC AGCTGCGCCA GGAGGTGGAG GAGAACCTGA ACGAGGTCTA CAGGCAGATC 2700 

TCCGGTGTAC ACAAGCTCCA GCAGACCAAG TTCCGGCAGC AGCCCAATGC CGGGAAAAAG 2760 

CAAGACCACA CCATTGTGGA CACAGTGCTG ATGGCGCCCC GCTCGGCCAA GCCGGCCCTG 2820 

CTGAAGCTTA CAGAGAAGCA GGTGGAACAG AGGGCCTTCC ACGACCTCAA GGTGGCCCCC 2880 

GGCTACTACA CCCTCACTGC AGACGAGGAC GCCCGGGGCA TGGTGGAGTT CCAGGAGGGC 2940 

GTGGAGCTGG TGGACGTACG GGTGCCCCTC TTTATCCGGC CTGAGGATGA CGACGAGAAG 3000 

CAGCTGCTGG TGGAGGCCAT CGACGTGCCC GCAGGCACTG CCACCCTCGG CCGCCGCCTG 3060 
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GTAAACATCA CCATCATCAA GGAGCAAGCC AGAGACGTGG 
TTCTCGGTCA GCCGCGGGGA CCAGGTGGCC CGCATCCCTG 
GGCGGGAAGT CCCAGGTCTC CTACCGCACA CAGGATGGCA 
TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG 
GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC 
TTCCACGTCC AGCTCAGCAA CCCTAAGTTT GGGGCCCACC 
ACCATCATCA TCAGGGACCC AGATGAACTG GACCGGAGCT 
TCACAGCCAC CCCCTCACGG CGACCTGGGC GCCCOGCAGA 
GGGTCCAGGA AGATCCATTT CAACTGGCTG CCCCCTTCTG 
GTAAAGTACT GGATTCAGGG TGACTCCGAA TCCGAAGCCC 
CCCTCAGTGG AGCTCACCAA CCTGTACCCG TATTGCGACT 
TACGGGGCTC AGGGCGAGGG ACCCTACAGC TCCCTGGTGT 
GTGCCCAGCG AGCCAGGGCG TCTGGCCTTC AATGTCGTCT 
AGCTGGGCTG AGCCGGCTGA GACCAACGGT GAGATCACAG 
CTGGTCAACG ATGACAACCG ACCTATTGGG CCCATGAAGA 
AAGAACCGGA TGCTGCTTAT TGAGAACCTT CGGGAGTCCC 
AAGGCGCGCA ACGGGGCCGG CTGGGGGCCT GAGCGGGAGG 
CAGCCCAAGA GGCCCATGTC CATCCCCATC ATCCCTGACA 
AGCGGGGAGG ACTACGACAG CTTCCTTATG TACAGCGATG 
GGCAGCCAGA GGCCCAGCGT CTCCGATGAC ACTGAGCACC 
TTTGCCTTCC CGGGCAGCAC CAACTCCCTG CACAGGATGA 
TATGGCACCC ACCTGAGCCC ACACGTGCCC CACCGCGTGC 
ACACGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC 
GACTACTCCA CCCTCACCTC CGTCTCCTCC CACGACTCTC 
GACACGCCCA CCCGCCTGGT GTTCTCTGCC CTGGGGCCCA 
CAGGAGCCGC GGTGCGAGCG GCCGCTGCAG GGCTACAGTG 
GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC 
GACCTCCTGC CCAACCACTC CTACGTGTTC CGCGTGCGGG 
GGCCGAGAGC GTGAGGGTGT CATCACCATT GAATCCCAGG 
TGTCCCCTGC CAGGCTCCGC CTTCACTTTG AGCACTCCCA 
TTCACTGCCC TGAGCCCAGA CTCGCTGCAG CTGAGCTGGG 
GGGGATATCG TCGGCTACCT GGTGACCTGT GAGATGGCCC 
GCATTCCGGG TGGATGGAGA CAGCCCCGAG AGCCGGCTGA 
AACGTGCCCT ACAAGTTCAA GGTGCAGGCC AGGACCACTG 
GAGGGCATCA TCACCATAGA GTCCCAGGAT GGAGGACCCT 
GCCGGGCTCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA 
AGCGCCACCG AGCCCTTCCT AGTGGATGGG CCGACCCTGG 
GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA 
GGAACCCTTA GCACCCACAT GGACCAACAG TTCTTCCAAA 
CCCCCGCCAT GTCCCACTAG GCGTCCTCCC GACTCCTCTC 
CCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA 
TCCTGGGAGG CATGAAGGGG GCAAGGTCCG TCCTCTGTGG 
AAAGAGCTGG GAGCAGCACA AGGACCCAGC CTTTGTTCTG 
TACTG 



TGTCCTTTGA 
TCATCCGGCG 
CCGCGCAGGG 
AGGCCTGGAA 
GGGGCCGCCA 
TGGGCCAGCC 
TCACGAGTCA 
ACCCCAATGC 
GCAAGCCAAT 
ACCTGCTCGA 
ATGAGATGAA 
CCTGCCGCAC 
CCTCCACGGT 
CCTACGAGGT 
AAGTGCTGGT 
AGCCCTACCG 
CCATCATCAA 
TCCCTATCGT 
ACGTTCTACG 
TGGTGAATGG 
CCACGACCAG 
TAAGCACATC 
ACTCGACCAC 
GCCTGACTGC 
CATCTCTCAG 
TGGAGTACCA 
AGACCTCGGT 
CCCAGAGCCA 
TGCACCCGCA 
GTGCCCCAGG 
AGCGGCCACG 
AAGGAGGAGG 
CCGTGCCGGG 
AGGGCTTCGG 
TCCCGCAGCT 
GCATCACCAC 
GGGCCCAGCA 
GCCGGACACT 
CTTGACCGCA 
CCGGAGCCTC 
CAGAGCAGGG 
GCCCAAACCT 
CACTTAATAA 



GCAGCCTGAG 
TGTCCTGGAC 
CAACCGGGAC 
AGAGCTGCAG 
GGTCCGCCGT 
CCACTCCACC 
GATGTTGTCA 
TAAGGCCGCT 
GGGGTACAGG 
CAGCAAGGTG 
GGTGTGCGCC 
CCACCAGGAA 
GACCCAGCTG 
CTGCTATGGC 
TGACAACCCT 
CTACACGGTG 
CCTGGCCACC 
GGACGCCCAG 
CTCTCCATCG 
CCGGATGGAC 
TGCTGCTGCC 
CTCCACCCTC 
ACTGCCGAGG 
TGGTGTGCCC 
AGTGAGCTGG 
GCTGCTGAAC 
GGTGGTGGAA 
GGAAGGCTGG 
GAGCCCACTG 
CCCGCTGGTG 
GAGGCCCAAT 
GCCAGCCACC 
CCTCAGCGAG 
GCCAGAGCGC 
GGGCAGCCGT 
CACCCACACC 
CCTGGAGGCA 
GACCACCAGC 
CCCTGCCCCA 
CTCAGCTACT 
GCTAGGTGTC 
ATTTGTAACC 
ATGGTTTTGC 
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Seq ID NO: 561 Protein sequence 
Protein Accession #: NP_000204.1 



1 
I 

MAGPRPSPWA 
CNTQAELLAA 
EVFEPLESPV 
PQTDMRPEKL 
QTAVCTRDIG 
TQDYPSVPTL 
EAFNRIRSNL 
THVCQLPEDQ 
CSBGWSGQTC 
FQCPRTSGFL 
CHCHQQSLYT 
LKRAEEVWR 
LLPLLALLLL 
SGNLKGRDW 
AQLRQEVEEN 
LTEKQVEQRA 
LVEAIDVPAG 
KSQVSYRTQD 
VQLSNPKFGA 
RXIHFNWLPP 
AQGEGPYSSL 
NDDNRPIGPM 
KRPMSIPIIP 
FPGSTNSLHR 
STLTSVSSHD 
ELHRLNIPNP 
LPGSAFTIiST 
RVDGDSPESR 
LFQHPLQSEY 
LSTHMDQQFF 



11 
I 

RLLLAALISV 
GCQRESIWM 
DLYILMDFSN 
KEPWPNSDPP 
WRPDSTHLLV 
VRLLAKHNI I 
DIRALDSPRG 
KGNIHLKPSF 
NCSTGSLSDI 
CNDRGRCSMG 
DTICEINYSA 
CSFRDEDDDC 
LCWKYCACCK 
RWKVTNNMQR 
LNEVYRQISG 
FHDLKVAPGY 
TATLGRRLVN 
GTAQGNRDYI 
HLGQPHSTTI 
SGKPMGYRVK 
VSCRTHQEVP 
KKVLVDNPKN 
DIPIVDAQSG 
MTTTSAAAYG 
SRLTAGVPDT 
AQTSWVEDL 
PSAPGPLVFT 
LTVPGLSENV 
SSITTTHTSA 
QT 



21 
I 

SLSGTLANRC 
ESSFQITEET 
SMSDDLDNLK 
FSFKNVISLT 
FSTESAFHYE 
PIFAVTNYSY 
LRTEVTSKMF 
SDGLKMDAGI 
QPCLREGEDK 
QCVCEPGWTG 
IHPGLCEDLR 
TYSYTMEGDG 
ACLALLPCCN 
PGFATHAASI 
VHKLQQTKFR 
YTLTADQDAR 
ITIIKEQARD 
PVEGELLFQP 
IIRDPDELDR 
YWIQGDSESE 
SEPGRLAFNV 
RMLLIENLRE 
EDYDSFLMYS 
THLSPHVPHR 
PTRLVFSALG 
LPNHSYVFRV 
ALSPDSLQLS 
PYKFKVQART 
TEPFLVDGPT 



31 

I" 

KKAPVKSCTE 
QIDTTLRRSQ 
KMGQNLARVL 
EDVDEFRNKL 
ADGANVLAGI 
SYYEKLHTYF 
QKTRTGSFHI 
ICDVCTCELQ 
PCSGRGECQC 
PSCDCPLSNA 
SCVQCQAWGT 
APGPNSTVLV 
RGHMVGFKED 
NPTELVPYGL 
QQPNAGKKQD 
GMVEFQEGVE 
WSFEQPEFS 
GEAWKELQVK 
SFTSQMLSSQ 
AHLLDSKVPS 
VSSTVTQLSW 
SQPYRYTVKA 
DDVLRSPSGS 
VLSTSSTLTR 
PTSLRVSWQE 
RAQSQEGWGR 
WERPRRPNGD 
TEGFGPEREG 
LGAQHLEAGG 



41 

I 

CVRVDKDCAY 
MSPQGLRVRL 
SQLTSDYTIG 
QGERISGNLD 
MSRNDERCHL 
PVSSLGVLQE 
RRGEVGIYQV 
KEVRSARCSF 
GHCVCYGEGR 
TCIDSNGGIC 
GEKKGRTCEE 
HKKKDCPPGS 
HYMLRENIiMA 
SLRLARLCTE 
HTIVDTVLMA 
LVDVRVPLFI 
VSRGDQVARI 
LLELQEVDSL 
PPPHGDLGAP 
VELTNLYPYC 
AEPAETNGEI 
RNGAGWGPER 
QRPSVSDDTE 
DYNSLTRSEH 
PRCERPLQGY 
EREGVITIES 
IVGYLVTCEM 
IITIESQDGG 
SLTRHVTQEF 



51 
I 

CTDEMFRDRR 
RPGEERHFEL 
FGKFVDKVSV 
APEGGFDAIL 
DTTGTYTQYR 
DSSNIVELLE 
QLRALEHVDG 
NGDFVCGQCV 
YEGQFCEYDN 
NGRGHCECGR 
CNFKVKMVDE 
FWWLIPLLLL 
SDHLDTPMLR 
NLLKPDTREC 
PRSAKPALLK 
RPEDDDEKQL 
PVIRRVLDGG 
LRGRQVRRFH 
QNPNAKAAGS 
DYEMKVCAYG 
TAYEVCYGLV 
EAIINLATQP 
HLVNGRMDFA 
SHSTTLPRDY 
SVEYQLLNGG 
QVHPQSPIjCP 
AQGGGPATAF 
PFPQLGSRAG 
VSRTLTTSGT 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 



60 
120 
180' 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 562 DNA sequence 

Nucleic Acid Accession #: NM_013332.1 

Coding sequence : 1 . . 63 



11 



21 



31 
I 



41 
I 



51 
I 



399 



10 



15 



20 



25 



WO 02/086443 

GCACGAGGGC GCTTTTGTCT CCGGTGAGTT TTGTGGCGGG AAGCTTCTGC GCTGGTGCTT 60 

AGTAACCGAC TTTCCTCCGG ACTCCTGCAC GACCTGCTCC TACAGCCGGC GATCCACTCC 120 

CGGCTGTTCC CCCGGAGGGT CCAGAGGCCT TTCAGAAGGA GAAGGCAGCT CTGTTTCTCT 180 

GCAGAGGAGT AGGGTCCTTT CAGCCATGAA GCATGTGTTG AACCTCTACC TGTTAGGTGT 240 

GGTACTGACC CTACTCTCCA TCTTCGTTAG AGTGATGGAG TCCCTAGAAG GCTTACTAGA 300 

GAGCCCATCG CCTGGGACCT CCTGGACCAC CAGAAGCCAA CTAGCCAACA CAGAGCCCAC 360 

CAAGGGCCTT CCAGACCATC CATCCAGAAG CATGTGATAA GACCTCCTTC CATACTGGCC 420 

ATATTTTGGA ACACTGACCT AGACATGTCC AGATGGGAGT CCCATTCCTA GCAGACAAGC 480 

TGAGCACCGT TGTAACCAGA GAACTATTAC TAGGCCTTGA AGAACCTGTC TAACTGGATG 540 

CTCATTGCCT GGGCAAGGCC TGTTTAGGCC GGTTGCGGTG GCTCATGCCT GTAATCCTAG 600 

CACTTTGGGA GGCTGAGGTG GGTGGATCAC CTGAGGTCAG GAGTTCGAGA CCAGCCTCGC 660 

CAACATGGCG AAACCCCATC TCTACTAAAA ATACAAAAGT TAGCTGGGTG TGGTGGCAGA 720 

GGCCTGTAAT CCCAGTTCCT TGGGAGGCTG AGGCGGGAGA ATTGCTTGAA CCCGGGGACG 780 

GAGGTTGCAG TGAACCGAGA TCGCACTGCT GTACCCAGCC TGGGCCACAG TGCAAGACTC 840 

CATCTCAAAA AAAAAAAGAA AAGAAAAAGC CTGTTTAATG CACAGGTGTG AGTGGATTGC 900 

TTATGGCTAT GAGATAGGTT GATCTCGCCC TTACCCCGGG GTCTGGTGTA TGCTGTGCTT 960 

TCCTCAGCAG TATGGCTCTG ACATCTCTTA GATGTCCCAA CTTCAGCTGT TGGGAGATGG 1020 

TGATATTTTC AACCCTACTT CCTAAACATC TGTCTGGGGT TCCTTTAGTC TTGAATGTCT 1080 

TATGCTCAAT TATTTGGTGT TGAGCCTCTC TTCCACAAGA GCTCCTCCAT GTTTGGATAG 1140 

CAGTTGAAGA GGTTGTGTGG GTGGGCTGTT GGGAGTGAGG ATGGAGTGTT CAGTGCCCAT 1200 

TTCTCATTTT ACATTTTAAA GTCGTTCCTC CAACATAGTG TGTATTGGTC TGAAGGGGGT 1260 

GGTGGGATGC CAAAGCCTGC TCAAGTTATG GACATTGTGG CCACCATGTG GCTTAAATGA 1320 
TTTTTTCTAA CTAATAAAGT GGAATATATA TTTCAAAAAA AAAAAAAAAA AA 

Seq ID NO: 563 Protein sequence 
Protein Accession #: NP 037464.1 



PCTYUS02/12476 



1 11 21 31 41 51 

™ 1 1 1 I I I 

3 (J MKHVLNLYLL GWLTLLSIF VRVMESLEGL LESPSPGTSW TTRSQLANTB PTKGLPDHPS 

RSM 



60 



35 
40 
45 
50 
55 
60 



Seq ID NO: 564 DNA sequence 

Nucleic Acid Accession #: NM_023915.1 

Coding sequence: 2S0..1326 



GGCACGAGGG 
TCAAAGCTTA 
GTGAATGGAC 
CCCACGCCTC 
AACTGAAGAA 
CAAGAGAGTC 
AATGAATTTG 
TTGCTGAATG 
TTCTATCTCA 
ATAGTCCATG 
TCAGTTTTGT 
GATCGCTATC 
ACGAAGGTTT 
ATCCTGACAA 
CCTTTGGGGG 
GTGCTGGTGA 
AGGCAATTCA 
GTGGCTGTGT 
AGTCACTTAG 
ATTACACTTT 
TGTAGGTCAT 
ATCAGATCAC 
GTGTAGGCCT 
TTCATTATCC 



11 

I 

TTTCGTTTTC 
TTCTTAATTA 
AGCCAGCCAC 
AATCGTCCCC 
TGGGGTTCAA 
ACAATTCAGG 
ACACAATTGT 
GTTTAGCAGT 
AAAACATAGT 
ATGCAGGATT 
TTTATGCAAA 
TGAAGGTGGT 
TATCTGTTTG 
ATGGTCAGCC 
TCAAATGGCA 
TTCTGATCGG 
TAAGTCAGTC 
TTTTTACCTG 
ACAGGCTTTT 
TCTTGTCTGC 
TTTCAAGAAG 
TGCAAAGTGT 
TTTATTGTTT 
TTAAAAAAAA 



21 
I 

ATGCTTTACC 
GAGACAAGAA 
CACAATGAAA 
AAGTGTTTCC 
CTTGACGCTT 
CAACAGGAGC 
CTTGCCGGTG 
GTGGATCTTC 
GGTTGCAGAC 
TGGACCTTGG 
CATGTATACT 
CAAGCCATTT 
TGTTTGGGTG 
AACAGAGGAC 
TACGGCAGTC 
ATGTTACATA 
AAGCCGAAAG 
CTTTCTACCA 
AGATGAATCT 
GTGTAATGTT 
GCTGTTCAAA 
GAGAAGATCG 
GTTGGAATCG 
AA 



31 

I 

AGAAAATCCA 
ACCTGTTTCA 
GAAATCAAAC 
TGACACGCAT 
GCAAAATTAC 
GACGGGCCAG 
CTTTATCTCA 
TTCCACATTA 
CTCATAATGA 
TACTTCAAGT 
TCCATCGTGT 
GGGGACTCTC 
ATCATGGCTG 
AATATCCATG 
ACCTATGTGA 
GCCATATCCA 
CGAAAACATA 
TATCACTTGT 
GCACAAAAAA 
TGCCTGGATC 
AAATCAAATA 
GAAGTTCGCA 
ATATGTACAA 



41 
I 

CTTCCCTGCC 
ACTTGAAGAC 
CAGGAATAAC 
CTTTGCTTAC 
CAAATAACGA 
GAAAGAACAC 
TTATATTTGT 
GGAATAAAAC 
CGCTGACATT 
TTATTCTCTG 
TCCTTGGGCT 
GGATGTACAG 
TTTTGTCTTT 
ACTGCTCAAA 
ACAGCTGCTT 
GGTACATCCA 
ACCAGAGCAT 
GCAGAATTCC 
TCCTATATTA 
CAATAATTTA 
TCAGAACCAG 
TATATTATGA 
AGTGTAAATA 



51 
I 

GACCTTAGTT 
ACCGTATGAG 
CTATGCTGAA 
AGTGCATCAC 
GCTGCACGGC 
CACCCTTCAC 
GGCAAGCATC 
CAGCTTCATA 
TCCATTTCGA 
CAGATACACT 
GATAAGCATT 
CATAACCTTC 
GCCAAACATC 
ACTTAAAAGT 
GTTTGTGGCC 
CAAATCCAGC 
CAGGGTTGTT 
TTTTACTTTT 
CTGCAAAGAA 
CTTTTTCATG 
GAGTGAAAGC 
TTACACTGAT 
AATGTTTCTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



65 



Seq ID NO: 565 Protein sequence 
Protein Accession ft: NP 076404 



70 



75 



80 



85 



i 
I 

MGFNIiTLAKL 
GLAVWIFFHI 
FYANMYTSIV 
NGQPTEDNIH 
ISQSSRKRKH 
FLSACNVCLD 



11 
I 

PNNELHGQES 
RNKTSFIFYL 
FLGLISIDRY 
DCSKLKSPIiG 
NQSIRVWAV 
PIIYFFMCRS 



21 
I 

HNSGNRSDGP 
KNIWADLIM 
LKVVKPFGDS 
VKWHTAVTYV 
FFTCFLPYHL 
FSRRLFKKSN 



31 
I 

GKNTTLHNEF 
TLTFPFRIVH 
RMYSITFTKV 
NSCLFVAVLV 
CRIPFTFSHI* 
IRTRSESIRS 



41 
I 

DTIVLPVLYL 
DAGFGPWYFK 
LSVCVWVIMA 
ILIGCYIAIS 
DRLLDESAQK 
LQSVRRSEVR 



51 

I 

IIFVASILLN 
PILCRYTSVL 
VLSLPNIILT 
RYIHKSSRQF 
ILYYCKEITL 
IYYDYTDV 



Seq ID NO: 566 DNA sequence 
Nucleic Acid Accession #: NM_005365. 
Coding sequence: 1..948 



I 

ATGTCTCTCG 
GAGGACTTGG 
TCCTCTGACA 
CCTCAGGGAG 
GAGGGCTCCA 
GAGTTCATGT 



11 

I 

AGCAGAGGAG 
GCCTGATGGG 
GCAAGGAGGA 
GCGCTTCCTC 
GCAGTCAAGA 
TCCAAGAAGC 



21 
I 

TCCGCACTGC 
TGCACAGGAA 
GGAGGTGTCT 
CTCCATTTCC 
AGAGGAAGAG 
ACTGAAATTG 



31 

I 

AAGCCTGATG 
CCCACAGGCG 
GCTGCTGGGT 
GTCTACTACA 
CCAAGCTCCT 
AAGGTGGCTG 



41 
I 

AAGACCTTGA 
AGGAGGAGGA 
CATCAAGTCC 
CTTTATGGAG 
CGGTCGACCC 
AGTTGGTTCA 



51 
I 

AGCCCAAGGA 
GACTACCTCC 
TCCCCAGAGT 
CCAATTCGAT 
AGCTCAGCTG 
TTTCCTGCTC 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 



400 



10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CACAAATATC GAGTCAAGGA GCCGGTCACA AAGGCAGAAA 
AATTACAAGC GCTACTTTCC TGTGATCTTC GGCAAAGCCT 
TTTGGCACTG ATGTGAAGGA GGTGGACCCC GCCGGCCACT 
CTTGGCCTCT CGTGCGATAG CATGCTGGGT GATGGTCATA 
CTGATCATTG TCCTGGGTGT GATCCTAACC AAAGACAACT 
TGGGAAGCGT TGAGTGTGAT GGGGGTGTAT GTTGGGAAGG 
CCCAGGAAGC TGCTCACCCA AGATTGGGTG CAGGAAAACT 
CCCGGCAGTG ATCCTGCGCA CTACGAGTTC CTGTGGGGTT 
AGCTATGAGA AGGTCATAAA TTATTTGGTC ATGCTCAATG 
CCATCCCTTT ATGAAGAGGT TTTGGGAGAG GAGCAAGAGG 

Seq ID NO: 567 Protein sequence 
Protein Accession 8: NP_005356.1 

1 11 21 31 

I I I I 

MSLEQRSPHC KPDEDLEAQG EDLGLMGAQE PTGEEEETTS 

PQGGASSSIS VYYTLWSQFD EGSSSQEEEE PSSSVDPAQL 

HKYRVKEPVT KAEMLESVIK NYKRYFPVIF GKASEFMQVI 

LGLSCDSMLG DGHSMPKAAL LIIVLGVILT KDNCAPEEVI 

PRKLLTQDWV QENYLEYRQV PGSDPAHYEF LWGSKAHAET 
PSLYEEVLGE EQEGV 

Seq ID NO: 568 DNA sequence 
Nucleic Acid Accession #: NM_014400 
Coding sequence : 86 1126 



PCT/US02/12476 



TGCTGGAGAG 
CCGAGTTCAT 
CCTACATCCT 
GCATGCCCAA 
GCGCCCCTGA 
AGCACATGTT 
ACCTGGAGTA 
CCAAGGCCCA 
CAAGAGAGCC 
GAGTCTGA 



41 
I 

SSDSKEEEVS 
EFMFQEALKL 
FGTDVKEVDP 
WEALSVMGVY 
SYEKVINYLV 



CGTCATCAAA 
GCAGGTGATC 
TGTCACTGCT 
GGCCGCCCTC 
AGAGGTTATC 
CTACGGGGAG 
CCGGCAGGTG 
CGCTGAAACC 
CATCTGCTAC 



51 
I 

AAGSSSPPQS 
KVAELVHFLL 
AGHSYILVTA 
VGKEHMFYGE 
MLNAREPICY 



GGTTACTCAT 
GACGCCAAGG 
GATCTGGACT 
GTGCTACAGC 
GAAGTGCGCG 
CGGACAATTC 
CGGCCTGGAT 
CTGCAACGCC 
ATACCCGCCC 
GGGTACATCG 
CTTCGACGGC 
CTGTGTCCAG 
TGGCTCCTGT 
CCCTCGAATC 
CACATCTGTC 
GCCAGCGCCA 
GGAGCCCAGG 
TCCTGCAAAA 
ATTGGCAGCC 
AAATTTCCCT 
CCCACCACTG 
CTTCTGCTGC 
GGGTGTTCTA 
TCCTCTTGTG 
AGGATGCTAA 
GGTGGGACAA 
ATCGGTTCCC 
CTTATGTCTG 
TTGTATAGTG 



11 

I 

CCTGGGCTCA 
GAGCAGGACG 
GCAGGCTGGC 
TGCGTGCAGA 
CCGGGCGTGG 
TCGCTGGCAG 
CTTCACGGGC 
AAGCTCAACC 
AACGGCGTGG 
CCGCCGGTCG 
AACGTCACCT 
GATGAATTCT 
TGCCAGGGGT 
CCACCCCTTG 
ACCACTTCTA 
ACCAGTCAGA 
TTGACTGGAG 
GGGGGGCCCC 

CTCACCTACT 
GACTGGGCTG 
GCTGGTTTGC 
GCTTTTTGAG 
ATGTTAGGAC 
GCTTCCTACT 
TGGCTCCCCA 
CATATGTCTT 
TGTGTGATCA 
AAAAAAAA 



21 

I 

GGTAAGAGGG 
GAGCCATGGA 
TGCTGCTGCT 
AAGCAGATGA 
ACGTCTGCAC 
TGCSGGGTTG 
TTCTGGCGTT 
TCACCTCGCG 
AGTGCTACAG 
TGAGCTGCTA 
TGACGGCAGC 
GCACTCGGGA 
CCCGCTGTAA 
TCCGGCTGCC 
CCTCGGCCCC 
CTCCGAGACA 
GCGCCGCTGG 
AGCAGCCCCA 
CCGTGGCTGC 
TCTCTGGCCC 
GCCCAGCCCC 
GGCTTTGGGA 
GACAGCTCCT 
AGAGTGAGAG 
CACTTTCTCC 
CTCTAAGCAC 
CCTTACTAGA 
GTTTCTGGCA 



31 

t 

CCCGAGCTCG 
CCCCGCCAGG 
GCTGCTTCGC 
CGGATGCTCC 
CGAGGCCGTG 
CGGTTCGGGA 
CATCCAGCTG 
GGCGCTCGAC 
CTGTGTGGGC 
CAACGCCAGC 
TAATGTGACT 
TGGAGTAACA 
CTCTGACCTC 
CCCTCCAGAG 
AGTGAGACCC 
GGGAGTAGAA 
CCACCAGGAC 
TAATAAAGGC 
TGGTGTCCTA 
TGGGTACCCC 
TGTTTTTCCA 
AATAAAATAC 
GTATCCTTCT 
AAGTCAGCTG 
TAGCCAGCCT 
TGCCTCCCCT 
CTGTGAGCTC 
CATAAATGCC 



41 
I 

GAGGCGGCAC 
AAAGCAGGTG 
GGAGGAGCGC 
CCGAACAAGA 
GGGGCGGTGG 
CTCCCCGGCA 
CAGCAATGCG 
CCGGCAGGTA 
CTGAGCCGGG 
GATCATGTCT 
GTGTCCTTGC 
GGCCCAGGGT 
CGCAACAAGA 
CCCACGACTG 
ACATCCACCA 
CACGAGGCCT 
CGCAGCAATT 
TGTGTGGCTC 
CTGTGAGCTT 
TCTTCTCATC 
ACATTCCCCA 
CGTTGTATAT 
CATCCTTGTC 
TCACGGGGAA 
GGACTTTGGA 
ACTCCCCGCA 
CTCGAGGGCA 
TCAATAAAGA 



51 
I 

ACCCAGGGGG 
CCCAGGCCAT 
AGGCCCTGGA 
TGAAGACAGT 
AGACCATCCA 
AGAATGACCG 
CTCAGGATCG 
ATGAGAGTGC 
AGGCGTGCCA 
ACAAGGGCTG 
CTGTCCGGGG 
TCACGCTCAG 
CCTACTTCTC 
TGGCCTCAAC 
CCAAACCCAT 
CCCGGGATGA 
CAGGGCAGTA 
CCACAGCTGG 
CTCCACCTGG 
ACTTCCTGTT 
GTATCCCCAG 
ATTCTGGCAG 
TCTCCGCTTG 
GGTGAGAGAG 
GCGTGGGGTG 
TCTTTGGGGA 
GGGACCGTGC 
TTTAATTACT 



Seq ID NO: 569 Protein sequence 
Protein Accession 8: NP 055215 



MDPARKAGAQ 
CTEAVGAVET 
SRALDPAGNE 
AANVTVSLPV 
LPPPEPTTVA 
AGHQDRSNSG 



11 
I 

AMIWTAGWLL 
IHGQFSLAVX 
SAYPPNGVEC 
RGCVQDEFCT 
STTSVTTSTS 
QYPAKGGPQQ 



21 

I 

LLLLRGGAQA 
GCGSGLPGKN 
YSCVGLSREA 
RDGVTGPGFT 
APVRPTSTTK 
PHNKGCVAPT 



31 

I 

LECYSCVQKA 
DRGLDLHGLL 
CQGTSPPWS 
LSGSCCQGSR 
PMPAPTSQTP 
AGLAALLLAV 



41 
I 

DDGCSPNKMK 
AFIQLQQCAQ 
CYNASDHVYK 
CNSDLRNKTY 
RQGVEHEASR 
AAGVLL 



51 

I 

TVKCAPGVDV 
DRCNAKLNLT 
GCFDGNVTLT 
FSPRIPPLVR 
DEEPRLTGGA 



Seq ID NO: 570 DNA sequence 

Nucleic Acid Accession tf: NM_00S329.1 

Coding sequence: 1..1662 



I 

ATGCCGGTGC 
GTGCTGGGTG 
CACTACCTGT 
CTTTTTGCCT 
TCCCCGCGGC 
TTGCGCAAGT 
GTGGTGGATG 
GGCGGCACCG 
GGTGAGACGG 
AGCACCTTCT 



11 
I 

AGCTGACGAC 
GCATCCTGGC 
CCTTCGGCCT 
TCCTGGAGCA 
GGGGCTCGGT 
GCCTGCGCTC 
GCAACCGCCA 
AGCAGGCCGG 
AGGCCAGCCT 
CGTGCATCAT 



21 
I 

AGCCCTGCGT 
AGCCTATGTG 
GTACGGCGCC 
CCGGCGCATG 
GGCACTGTGC 
GGCCCAGCGC 
GGAGGACGCC 
CTTCTTTGTG 
GCAGGAGGGC 
GCAGAAGTGG 



31 
I 

GTGGTGGGCA 
ACGGGCTACC 
ATCCTGGGCC 
CGACGTGCCG 
ATTGCCGCGT 
ATCTCCTTCC 
TACATGCTGG 
TGGCGCAGCA 
ATGGACCGTG 
GGAGGCAAGC 



41 

I 

CCAGCCTGTT 
AGTTCATCCA 
TGCACCTGCT 
GCCAGGCCCT 
ACCAGGAGGA 
CTGACCTCAA 
ACATCTTCCA 
ACTTCCATGA 
TGCGGGATGT 
GCGAGGTCAT 



51 
I 

TGCCCTGGCA 
CACGGAAAAG 
CATTCAGAGC 
GAAGCTGCCC 
CCCTGACTAC 
GGTGGTCATG 
CGAGGTGCTG 
GGCAGGCGAG 
GGTGCGGGCC 
GTACACGGCC 



420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



401 



WO 02/086443 

TTCAAGGCCC TCGGCGATTC GGTGGACTAC ATCCAGGTGT GCGACTCTGA CACTGTGCTG 660 

GATCCAGCCT GCACCATCGA GATGCTTCGA GTCCTGGAGG AGGATCCCCA AGTAGGGGGA 720 

GTCGGGGGAG ATGTCCAGAT CCTCAACAAG TAOGACTCAT GGATTTCCTT CCTGAGCAGC 780 

GTGCGGTACT GGATGGCCTT CAACGTGGAG CGGGCCTGCC AGTCCTACTT TGGCTGTGTG 840 

CAGTGTATTA GTGGGCCCTT GGGCATGTAC CGCAACAGCC TCCTCCAGCA GTTCCTGGAG 900 

GACTGGTACC ATCAGAAGTT CCTAGGCAGC AAGTGCAGCT TCGGGGATGA CCGGCACCTC 960 

ACCAACCGAG TCCTGAGCCT TGGCTACCGA ACTAAGTATA CCGCGCGCTC CAAGTGCCTC 1020 

ACAGAGACCC CCACTAAGTA CCTCCGGTGG CTCAACCAGC AAACCCGCTG GAGCAAGTCT 1080 

TACTTCCGGG AGTGGCTCTA CAACTCTCTG TGGTTCCATA AGCACCACCT CTGGATGACC 1140 

TACGAGTCAG TGGTCACGGG XTTCTTCCCC TTCTTCCTCA TTGCCACGGT TATACAGCTT 1200 

TTCTACCGGG GCCGCATCTG GAACATTCTC CTCTTCCTGC TGACGGTGCA GCTGGTGGGC 1260 

ATTATCAAGG CCACCTACGC CTGCTTCCTT CGGGGCAATG CAGAGATGAT CTTCATGTCC 1320 

CTCTACTCCC TCCTCTATAT GTCCAGCCTT CTGCCGGCCA AGATCTTTGC CATTGCTACC 1380 

ATCAACAAAT CTGGCTGGGG CACCTCTGGC CGAAAAACCA TTGTGGTGAA CTTCATTGGC 1440 

CTCATTCCTG TGTCCATCTG GGTGGCAGTT CTCCTGGGAG GGCTGGCCTA CACAGCTTAT 1500 

TGCCAGGACC TGTTCAGTGA GACAGAGCTA GCCTTCCTTG TCTCTGGGGC TATACTGTAT 1560 

GGCTGCTACT GGGTGGCCCT CCTCATGCTA TATCTGGCCA TCATCGCCCG GCGATGTGGG 1620 
AAGAAGCCGG AGCAGTACAG CTTGGCTTTT GCTGAGGTGT GA 

Seq ID NO: 571 Protein sequence 
Protein Accession #: NP 005320.1 



1 11 21 31 41 51 

I I I I I I 

MPVQLTTALR WGTSLPALA VLGGILAAYV TGYQFIHTEK HYLSFGLYGA ILGLHLLIQS 60 

LFAFLEHRRM RRAGQALKLP SPRRGSVALC IAAYQEDPDY LRKCLRSAQR ISFPDUCWM 120 

WDGNRQEDA YMLDIFHEVL GGTEQAGFFV WRSNFHEAGE GETEASLQEG MDRVRDWRA 180 

STFSCIMQKW GGKREVMYTA FKALGDSVDY IQVCDSDTVL DPACTIEMLR VLEEDPQVGG 240 

VGGDVQILNK YDSWISFLSS VRYWMAFNVE RACQSYFGCV QCISGPLGMY RNSLLQQFLE 300 

DWYHQKFLGS KCSFGDDRHL TNRVLSLGYR TKYTARSKCL TETPTKYLRW LNQQTRWSKS 360 

YFREWLYNSL WFHKHHLWMT YESWTGFFP FFLIATVIQL FYRGRIWNIL LFLLTVQLVG 420 

IIKATYACFL RGNAEMIFMS LYSLLYMSSL LPAKIFAIAT INKSGWGTSG RKTIWNFIG 480 

LIPVSIWVAV LLGGLAYTAY CQDLFSETEL AFLVSGAILY GCYWVALLMI* YLAI I ARRCG 540 
KKPEQYSLAF AEV 



Seq ID NO: 572 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 148-7095 



1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 

TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 

TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 



402 



WO 02/086443 

AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 

AGCAGTGATQ CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AG CCTAC TCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGG CTGACAG CCCTTAACAT TTCTTCACCT GTTTCPGTAG CTGAATTTAC ATATACAACA 3240 

TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 

ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 

GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 

AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 

CAGGAAAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 

CCAATCTCAT ACTCACTATC TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 

TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 

CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGGATCA 4800 

GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4860 

GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

GGATTCCCAC AGTCCCCAAC ATCATCTGTT ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 

TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 5280 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 5340 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 5400 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 5520 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 5700 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 5820 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 6000 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 

AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 6360 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 

GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 6840 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 7020 

TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 7080 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 7200 

GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 7260 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 7320 

TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 7380 

TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 

GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 

AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 

TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7680 

GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 

TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7800 

GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 

AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA AAAA 
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Seq ID NO: 573 Protein' sequence: 
Protein Accession 8 : - Eos sequence 

1 IX 21 31 41 51 

I I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KIiVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNIiKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA AJkDPFILIiNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPOVG5GR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPMjLDNQI 780 

LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ 840 

ILPQVTSATE SDKVPLHASL PVAGGDLLLE PSIAQYSDVL STTHAASETL EFGSESGVLY 900 

KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIF TVSYSSAIPV KDSVGVTYQG 960 

SLFSGPSHIP IPKSSLITPT ASLLQPTHAL SGDGEWSGAS SDSEFLLPDT DGLTALNISS 1020 

PVSVAEFTYT TSVFGDDNKA LSKSEI IYGN ETELQIPSFN EMVYPSESTV MPNMYDNVNK 1080 

LNASLQETSV SISSTKGMFP GSLAHTTTKV FDHEISQVPE NNFSVQPTHT VSQASGDTSL 1140 

KPVLSANSEP ASSDPASSEM LSPSTQLLFY ETSASFSTEV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPILVE TPKVDKISST MLHLIVSNSA SSENMLHSTS VPVFDVSPTS HMHSASLQGL 1260 

TISYASEKYE PVLLKSESSH QWPSLYSND ELFQTANLEI NQAHPPKGRH VFATPVLSID 1320 

EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGHVAITAVS 1380 

PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGEDGD TDDDGDDDDD DRGSDGLSIH 1440 

KCMSCSSYRE SQEKVMNDSD THENSLMDQN NPISYSLSEN SEEDNRVTSV SSDSQTGMDR 1500 

SPGKSPSANG LSQKHNDGKE ENDIQTGSAL LPLSPESKAW AVLTSDEESG SGQGTSDSLN 1560 

ENETSTDFSF ADTNEKDADG ILAAGDSEIT PGFPQSPTSS VTSENSEVFH VSEAEASNSS 1620 

HESRIGLAEG LESEKKAVIP LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR 1680 

VISTPPTPIF PISDDVGAIP IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI 1740 

TADSSNHPDN KHKNRYINIV AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAYIAA 1800 

QGPLKSTAED FWRMIWEHNV EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ 1860 

VLAYYTVRNF TLRNTKIKKG SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY 1920 

AXRHAVGPW VHCSAGVGRT GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVOTEE 1980 

QYVFIHDTLV EAILSKETEV LDSHIHAYVN ALLIPGPAGK TKLEKQFQIiL SQSNIQQSDY 2040 

SAALKQCNRE KNRTSSI1PV ERSRVGISSIi SGEGTDYINA SYIMGYYQSN EFIITQHPLL 2100 

HTIKDFWRMI WDHNAQLWM IPDGQNMAED EFVYWPNKDE PINCESFKVT LMAEEHKCLS 2160 

NEEKLIIQDF ILEATQDDYV LEVRHFQCPK WPNPDSPISK TFELISVIKE EAANRDGPMI 2220 

VHDEHGGVTA GTFCALTTLM HQLEKENSVD VYQVAKMINL MRPGVFADIE QYQFLYKVIL 2280 
SLVSTRQEEN PSTSLDSNGA ALPDGNIAES LESLV 

Seq ID NO: 574 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 148-4518 

1 11 21 31 41 51 

I I 1 I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA A7GAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
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GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AGCTGAGGGG 2460 

TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 

TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 

TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 2640 

CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 2700 

TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 

GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 2820 

AAGCACAAGA ATCGATACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2880 

CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2940 

TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3000 

TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 3060 

GAGAAAGGAA GGAGAAAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 

AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 

ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGAOGTGTG 3240 

GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 

GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 3360 

GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 3420 

CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA 3480 

CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 

GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 3600 

GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 

AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 

AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 

AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3840 

GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3900 

TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 3960 

GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGTCACT 4020 

CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 

ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 

TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 

GAAGCTGCCA ATAGGGATGG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGACGGCA 4260 

GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACTAG AAAAAGAAAA TTCCGTGGAT 4320 

GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 4380 

CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 

CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 4500 

TTAGAGTCTT TAGTTTAACA GAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4680 

TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4980 

AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGTCTCCATG GACCAAATTT 5040 

ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA GTTCTGTGTA 5100 

ATTGTTTAGT TTAATGACGT AGTTCATTAG CTGGTCTTAC TCTACCAGTT TTCTGACATT 5160 

GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA 5220 

ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC CAAACATTGT 5280 

TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA AATATTGCCA TTAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAAAAAA 



Seq ID NO: 575 Protein sequence: 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

I I I I I 1 

MRILKRPLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIKNTGK TVEINLTNDY RVSGGVSEMV 120 

PKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQklVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEFQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLMD 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNAEASNS SHESRIGLAE GLESEKKAVI 780 

PLVIVSALTF ICLWLVGIL IYWRKCFQTA HFYLEDSTSP RVISTPPTPI FPISDDVGAI 840 

PIKHFPKHVA DLHASSGFTE EFETLKEFYQ EVQSCTVDLG ITADSSNHPD NKHKNRYINI 900 

VAYDHSRVKL AQLAEKDGKL TDYINANYVD GYNRPKAYIA AQGPLKSTAE DFWRMIWEHN 960 

VEVIVMITNL VEKGRRKCDQ YWPADGSEEY GNFLVTQKSV QVLAYYTVRN FTLRNTKIKK 1020 

GSQKGRPSGR WTQYHYTQW PDMGVPEYSl* PVLTFVRKAA YAKRHAVGPV WHCSAGVGR 1080 

TGTYIVLDSM LQQIQHEGTV NIFGFLKHIR SQRNYLVQTE EQYVFIHDTL VEAILSKETE 1140 

VLDSHIHAYV NALLIPGPAG KTKLEKQFQL LSQSNZQQSD YSAALKQCNR EKNRTSSIIP 1200 

VERSRVGISS LSGEGTDYIN ASYIMGYYQS NEFIITQHPL LHTIKDFWRM IWDHNAQLW 1260 

MIPDGQNMAE DEFVYWPNKD EPINCESFKV TLMAEEHKCL SNEEKLIIQD FILEATQDDY 1320 

VliEVRHFQCP KWPNPDSPIS KTFELISVIK EEAANRDGPM IVHDEHGGVT AGTFCALTTL 1380 

MHQLEKENSV DVYQVAKMIN LMRPGVFADI EQYQFLYKVI LSLVSTRQEE NPSTSLDSNG 1440 
AALPDGNIAE SLESLV 
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Seq ID NO: 576 DNA sequence 

Nucleic Acid Accession 8: EOS sequence 

Coding sequence: 148-4494 

1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCAGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 

GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC ACAAGAATCG ATACATAAAT 2820 

ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC TTGCTGAAAA GGATGGCAAA 2880 

CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA ACAGACCAAA AGCTTATATT 2940 

GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT GGAGAATGAT ATGGGAACAT 3000 

AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA AAGGAAGGAG AAAATGTGAT 3060 

CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT TTCTGGTCAC TCAGAAGAGT 3120 

GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC TAAGAAACAC AAAAATAAAA 3180 

AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA CACAGTATCA CTACACGCAG 3240 

TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC TGACCTTTGT GAGAAAGGCA 3300 

GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC ACTGCAGTGC TGGAGTTGGA 3360 

AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC AGATTCAACA CGAAGGAACT 3420 

GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA GAAATTATTT GGTACAAACT 3480 

GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG CCATACTTAG TAAAGAAACT 3540 

GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA 3600 

GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC AGTCAAATAT ACAGCAGAGT 3 660 

GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA ATCGAACTTC TTCTATCATC 3720 

CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG GAGAAGGCAC AGACTACATC 3780 

AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT TCATCATTAC CCAGCACCCT 3840 

CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG ACCATAATGC CCAACTGGTG 3900 

GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT TTGTTTACTG GCCAAATAAA 3960 

GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA TGGCTGAAGA ACACAAATGT 4020 

CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT TAGAAGCTAC ACAGGATGAT 4080 

TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC CAAATCCAGA TAGCCCCATT 4140 

AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG CTGCCAATAG GGATGGGCCT 4200 

ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA CTTTCTGTGC TCTGACAACC 4260 

CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT ACCAGGTAGC CAAGATGATC 4320 

AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT ATCAGTTTCT CTACAAAGTG 4380 

ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT CCACCTCTCT GGACAGTAAT 4440 

GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG AGTCTTTAGT TTAACACAGA 4500 

AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT TCCTAAAATT AGGCAGGAAA 4560 

ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG ACAGTAACTT TCATGACATA 4620 

GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC TTTTTGCAAG ACTTGTAATT 4680 

TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT 4740 

GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT TTATAGAGGT TAGGAATTCC 4800 
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AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG CTGTATTTGT AGCAATTATC 4860 

AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA AATAAAACAC TCTTCCATAT 4920 

GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA GAAATAATCT GTTACTTATT 4980 

GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT TTATAATTGT AGATTTTTAT 5040 

ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG TTTAGTTTAA TGACGTAGTT 5100 

CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT TGTGTTACCT AAGTCATTAA 5160 

CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG AAATACCTTC ATTTTGAAAG 5220 

AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA ATGGTTTTTA TCCAAGGAAT 5280 

TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 5340 
AAA 



Seq ID NO: 577 Protein sequence: 
Protein Accession #: EOS sequence 

1 11 21 31 41 51 

I I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNIi LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVfcY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPBLDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SEN PET IT YD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFI CLWI»VGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 

IKHFPKHVAD LHASSGFTEE FEEVQSCTVD LGITADSSNH PDNKHKNRYI NIVAYDHSRV 900 

KIiAQLAEKDG KLTDYINANY VDGYNRPKAY IAAQGPLKST AEDFWRMIWE HNVEVIVMIT 960 

NLVEKGRRKC DQYWPADGSE EYGNFLVTQK SVQVLAYYTV RNFTLRNTKI KKGSQKGRPS 1020 

GRWTQYHYT QWPDMGVPEY SLPVLTFVRK AAYAKRHAVG PWVHCSAGV GRTGTYIVLD 1080 

SMLQQIQHEG TVNIFGFLKH 2RSQRNYLVQ TEEQYVFIHD TLVEAILSKE TEVLDSHIHA 1140 

YVNALLIPGP AGKTKLEKQF QLLSQSNIQQ SDYSAALKQC NREKNRTSSI IPVERSRVGI 1200 

SSLSGEGTDY INASYIMGYY QSNEFIITQH PLLHTIKDFW RMIWDHNAQL WMIPDGQNM 1260 

AEDEFVYWPN KDEPINCESF KVTLMAEEHK CLSNEBKLII QDFILEATQD DYVLEVRHFQ 1320 

CPKWPNPDSP ISKTPELISV IKEEAANRDG PMIVHDEHGG VTAGTFCALT TLMHQLEKEN 1380 

SVDVYQVAKM INLMRPGVFA DIEQYQFLYK VILSLVSTRQ EENPSTSLDS NGAALPDGNI 1440 
AESLESLV 



Seg ID NO: 578 DNA sequence 

Nucleic Acid Accession ft: EOS sequence 

Coding sequence: 501-4514 

1 11 21 31 41 51 

111)11 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAT TGGGGAAAGA 300 

AATATCCAAC ATGTAATAGC CCAAAACAAT CTCCTATCAA TATTGATGAA GATCTTACAC 360 

AAGTAAATGT GAATCTTAAG AAACTTAAAT TTCAGGGTTG GGATAAAACA TCATTGGAAA 420 

ACACATTCAT TCATAACACT GGGAAAACAG TGGAAATTAA TCTCACTAAT GACTACCGTG 480 

TCAGCGGAGG AGTTTCAGAA ATGGTGTTTA AAGCAAGCAA GATAACTTTT CACTGGGGAA 540 

AATGCAATAT GTCATCTGAT GGATCAGAGC ATAGTTTAGA AGGACAAAAA TTTCCACTTG 600 

AGATGCAAAT CTACTGCTTT GATGCGGACC GATTTTCAAG TTTTGAGGAA GCAGTCAAAG 660 

GAAAAGGGAA GTTAAGAGCT TTATCCATTT TGTTTGAGGT TGGGACAGAA GAAAATTTGG 720 

ATTTCAAAGC GATTATTGAT GGAGTCGAAA GTGTTAGTCG TTTTGGGAAG CAGGCTGCTT 780 

TAGATCCATT CATACTGTTG AACCTTCTGC CAAACTCAAC TGACAAGTAT TACATTTACA 840 

ATGGCTCATT GACATCTCCT CCCTGCACAG ACACAGTTGA CTGGATTGTT TTTAAAGATA 900 

CAGTTAGCAT CTCTGAAAGC CAGTTGGCTG TTTTTTGTGA AGTTCTTACA ATGCAACAAT 960 

CTGGTTATGT CATGCTGATG GACTACTTAC AAAACAATTT TCGAGAGCAA CAGTACAAGT 1020 

TCTCTAGACA GGTGTTTTCC TCATACACTG GAAAGGAAGA GATTCATGAA GCAGTTTGTA 1080 

GTTCAGAACC AGAAAATGTT CAGGCTGACC CAGAGAATTA TACCAGCCTT CTTGTTACAT 1140 

GGGAAAGACC TCGAGTCGTT TATGATACCA TGATTGAGAA GTTTGCAGTT TTGTACCAGC 1200 

AGTTGGATGG AGAGGACCAA ACCAAGCATG AATTTTTGAC AGATGGCTAT CAAGACTTGG 1260 

GTGCTATTCT CAATAATTTG CTACCCAATA TGAGTTATGT TCTTCAGATA GTAGCCATAT 1320 

GCACTAATGG CTTATATGGA AAATACAGCG ACCAACTGAT TGTCGACATG CCTACTGATA 13 80 

ATCCTGAACT TGATCTTTTC CCTGAATTAA TTGGAACTGA AGAAATAATC AAGGAGGAGG 1440 

AAGAGGGAAA AGACATTGAA GAAGGCGCTA TTGTGAATCC TGGTAGAGAC AGTGCTACAA 1500 

ACCAAATCAG GAAAAAGGAA CCCCAGATTT CTACCACAAC ACACTACAAT CGCATAGGGA 1560 

CGAAATACAA TGAAGCCAAG ACTAACCGAT CCCCAACAAG AGGAAGTGAA TTCTCTGGAA 1620 

AGGGTGATGT TCCCAATACA TCTTTAAATT CCACTTCCCA ACCAGTCACT AAATTAGCCA 1680 

CAGAAAAAGA TATTTCCTTG ACTTCTCAGA CTGTGACTGA ACTGCCACCT CACACTGTGG 1740 

AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATGA 1800 

ACTTGTCGGG GACTGCAGAA TCCTTAAATA CAGTTTCTAT AACAGAATAT GAGGAGGAGA 1860 

GTTTATTGAC CAGTTTCAAG CTTGATACTG GAGCTGAAGA TTCTTCAGGC TCCAGTCCCG 1920 

CAACTTCTGC TATCCCATTC ATCTCTGAGA ACATATCCCA AGGGTATATA TTTTCCTCCG 1980 

AAAACCCAGA GACAATAACA TATGATGTCC TTATACCAGA ATCTGCTAGA AATGCTTCCG 2040 

AAGATTCAAC TTCATCAGGT TCAGAAGAAT CACTAAAGGA TCCTTCTATG GAGGGAAATG 2100 

TGTGGTTTCC TAGCTCTACA GACATAACAG CACAGCCCGA TGTTGGATCA GGCAGAGAGA 2160 

GCTTTCTCCA GACTAATTAC ACTGAGATAC GTGTTGATGA ATCTGAGAAG ACAACCAAGT 2220 

CCTTTTCTGC AGGCCCAGTG ATGTCACAGG GTCCCTCAGT TACAGATCTG GAAATGCCAC 2280 
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ATTATTCTAC CTTTGCCTAC TTCCCAACTG AGGTAACACC TCATGCTTTT ACCCCATCCT 2340 

CCAGACAACA GGATTTGGTC TCCACGGTCA ACGTGGTATA CTCGCAGACA ACCCAACCGG 2400 

TATACAATGA GGCCAGTAAT AGTAGCCATG AGTCTCGTAT TGGTCTAGCT GAGGGGTTGG 2460 

AATCCGAGAA GAAGGCAGTT ATACCCCTTG TGATCGTGTC AGCCCTGACT TTTATCTGTC 2520 

TAGTGGTTCT TGTGGGTATT CTCATCTACT GGAGGAAAT6 CTTCCAGACT GCACACTTTT 2580 

ACTTAGAGGA CAGTACATCC CCTAGAGTTA TATCCACACC TCCAACACCT ATCTTTCCAA 2640 

TTTCAGATGA TGTCGGAGCA ATTCCAATAA AGCACTTTCC AAAGCATGTT GCAGATTTAC 2700 

ATGCAAGTAG TGGGTTTACT GAAGAATTTG AGACACTGAA AGAGTTTTAC CAGGAAGTGC 2760 

AGAGCTGTAC TGTTGACTTA GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC 2820 

ACAAGAATCG ATACATAAAT ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC 2880 

TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA 2940 

ACAGACCAAA AGCTTATATT GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT 3000 

GGAGAATGAT ATGGGAACAT AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA 3060 

AAGGAAGGAG AAAATGTGAT CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT 3120 

TTCTGGTCAC TCAGAAGAGT GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC 3180 

TAAGAAACAC AAAAATAAAA AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 3240 

CACAGTATCA CTACACGCAG TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC 3300 

TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 3360 

ACTGCAGTGC TGGAGTTGGA AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC 3420 

AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA 3480 

GAAATTATTT GGTACAAACT GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG 3540 

CCATACTTAG TAAAGAAACT GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC 3600 

TCCTCATTCC TGGACCAGCA GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 3660 

AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 3720 

ATCGAACTTC TTCTATCATC CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG 3780 

GAGAAGGCAC AGACTACATC AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT 3840 

TCATCATTAC CCAGCACCCT CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG 3900 

ACCATAATGC CCAACTGGTG GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT 3960 

TTGTTTACTG GCCAAATAAA GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA 4020 

TGGCTGAAGA ACACAAATGT CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT 4080 

TAGAAGCTAC ACAGGATGAT TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC 4140 

CAAATCCAGA TAGCCCCATT AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG 4200 

CTGCCAATAG GGATGGGCCT ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA 4260 

CTTTCTGTGC TCTGACAACC CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT 4320 

ACCAGGTAGC CAAGATGATC AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT 4380 

ATCAGTTTCT CTACAAAGTG ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT 4440 

CCACCTCTCT GGACAGTAAT GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG 4500 

AGTCTTTAGT TTAACACAGA AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT 4560 

TCCTAAAATT AGGCAGGAAA ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG 4620 

ACAGTAACTT TCATGACATA GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC 4680 

TTTTTGCAAG ACTTGTAATT TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT 4740 

ATTTCTAAGA ATGGAATTGT GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT 4800 

TTATAGAGGT TAGGAATTCC AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG 4860 

CTGTATTTGT AGCAATTATC AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA 4920 

AATAAAACAC TCTTCCATAT GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA 4980 

GAAATAATCT GTTACTTATT GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT 5040 

TTATAATTGT AGATTTTTAT ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG 5100 

TTTAGTTTAA TGACGTAGTT CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT 5160 

TGTGTTACCT AAGTCATTAA CT TTGTTT CA GCATGTAATT TTAACTTTTG TGGAAAATAG 5220 

AAATACCTTC ATTTTGAAAG AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA 5280 

ATGGTTTTTA TCCAAGGAAT TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 579 Protein sequences 
Protein Accession # : EOS .sequence 

1 11 21 31 41 51 

I I I I I I 

MVFKASKITF HWGKCNMSSD GSEHSLEGQK PPLEMQIYCF DADRFSSFEE AVKGKGKLRA 60 

LSILFEVGTE ENLDFKAIID GVESVSRFGK QAALDPFILL NLLPNSTDKY YIYNGSLTSP 120 

PCTDTVDWIV FKDTVSISES QLAVFCEVLT MQQSGYVMLM DYLQNNFREQ QYKFSRQVFS 180 

SYTGKEEIHE AVCSSEPENV QADPENYTSL LVTWERPRW YDTMIEKFAV LYQQLDGEDQ 240 

TKHEFLTDGY QDLGAILNNL LPNMSYVLQI VAICTNGLYG KYSDQLIVDM PTDNPELDLF 300 

PELIGTEEII KEEEEGKDIE EGAIVNPGRD SATNQIRKKE PQISTTTHYM RIGTKYNEAK 360 

TNRSPTRGSE FSGKGDVPNT SLNSTSQPVT KLATEKDISL TSQTVTELPP HTVEGTSASL 420 

NDGSKTVLRS PHMNLSGTAE SLNTVSITEY EEESLLTSFK LDTGAEDSSG SSPATSAIPF 480 

ISENISQGYI FSSENPETIT YDVLIPESAR NASEDSTSSG SEESLKDPSM EGNVWFPSST 540 

DITAQPDVGS GRESFLQTNY TEIRVDESEK TTKSFSAGPV MSQGPSVTDL EMPHYSTFAY 600 

FPTEVTPHAF TPSSRQQDLV STVNWYSQT TQPVYNEASN SSHESRIGLA EGLESEKKAV 660 

IPLVIVSALT FICLWLVGI LIYWRKCFQT AHFYLEDSTS PRVISTPPTP IFPISDDVGA 720 

IPIKHFPKHV ADLHASSGFT EEPETLKEFY QEVQSCTVDL GITADSSNHP DNKHKNRYIN 780 

IVAYDHSRVK LAQLAEKDGK LTDYINANYV DGYNRPKAYI AAQGPLKSTA EDFWRMIWEH 840 

NVEVIVMITN LVEKGRRKCD QYWPADGSEE YGNFLVTQKS VQVLAYYTVR NFTLRNTKIK 900 

KGSQKGRPSG RWTQYHYTQ WPDMGVPEYS LPVLTFVRKA AYAKRHAVGP VWHCSAGVG 960 

RTGTYIVLDS MLQQIQHEGT VN1FGFLKHI RSQRNYLVQT , EEQYVF IHDT LVEAILSKET 1020 

EVLDSHIHAY VNALLIPGPA GKTKLEKQFQ LLSQSNIQQS DYSAALKQCN REKNRTSSII 1080 

PVERSRVGIS SLSGEGTDYI NASYIMGYYQ SNEFIITQHP LLHTIKDFWR MIWDHNAQLV 1140 

VMIPDGQNMA EDEFVYWPNlC DEPINCESFK VTLMAEEHKC LSNEEKLIIQ DPILEATQDD 1200 

YVLEVRHFQC PKWPNPDSPI SKTFELISVI KEEAANRDGP MIVHDEHGGV TAGTFCALTT 1260 

LMHQLEKENS VDVYQVAKMI NLMRPGVFAD IEQYQFLYKV ILSLVSTRQE ENPSTSLDSN 1320 
GAALPDGNIA ESLESLV 

Seq ID NO: 580 DNA sequence 

Nucleic Acid Accession #; EOS sequence 

Coding sequence: 148-4632 

1 11 21 31 41 51 
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CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AACGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC OGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG - 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 13 80 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 2760 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 2820 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 2880 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 2940 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 3000 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 3060 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 3120 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 3180 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 3240 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 3300 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 3360 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 3420 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 3480 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 3540 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 3600 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 3660 

CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 3720 

GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 3780 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 3840 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 3900 

GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 3960 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 4020 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 4080 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 4140 

GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 4200 

GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 4260 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 4320 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 4380 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 4440 

CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 4500 

CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 4560 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 4620 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 4680 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 4740 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 4800 

TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 4860 

TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 4920 

ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 4980 

GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 5040 

TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 5100 

AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 5160 
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ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TC TAGT TCTG TGTAATTGTT 5220 

TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTC TACC AGTTTTCTGA CATTGTATTG 5280 

TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 5340 

ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 5400 

GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA S460 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: 581 Protein sequence: 
Protein Accession fl: EOS sequence 

1 11 21 31 41 51 

I I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

PKASKITFHW GKCNMSSDGS EHSLEGQKPP LEMQIYCPDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAI IDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVPK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLPPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDI SLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP B40 

•IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI TADSSNHPDN KHKNRYINIV .900 

AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAYIAA QGPLKSTAED FHRMIWEHNV 960 

EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ VLAYYTVRNF TLRNTKIKKG 1020 

SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY AKRHAVGPW VHCSAGVGRT 1080 
GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE QYVFIHDTLV EAILSKETEV .1140 

LDSHIHAYVN AIiLIPGPAGK TKLEKQFQGL TLSPRLECRG TISAHCNLPL PGLTDPPTSA 1200 

SRVAGTILLS QSNIQQSDYS AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS 1260 

YIMGYYQSNE FIITQHPIiLH TIKDFWRMIW DHNAQLWMI PDGQNMAEDE PVYWPNKDEP 1320 

INCESFKVTL MAEEHKCLSN EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISKT 1380 

FELISVIKEE AANRDGPMIV HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM 1440 
RPGVFADIEQ YQFLYKVILS LVGTRQEENP STSLDSNGAA LPDGNIAESL ESLV 



Seq ID NO: 582 DNA sequence 

Nucleic Acid Accession ft: NM_002851.1 

Coding sequence: 148.. 7092 

1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 
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TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 

TTTCGCCATC TGCATACA6T TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 

AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGGG GCCCTAGCCA TATACCAATA 3060 

CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 

TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 

ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 

GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 
TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA .4140 

CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGACAGA 4440 

GATAGTGATG GCTTATCCAT TCATAAGTGT ATGTCATGCT CATCCTATAG AGAATCACAG 4500 

GAAAAGGTAA TGAATGATTC AGACACCCAC GAAAACAGTC TTATGGATCA GAATAATCCA 4560 

ATCTCATACT CACTATCTGA GAATTCTGAA GAAGATAATA GAGTCACAAG TGTATCCTCA 4620 

GACAGTCAAA CTGGTATGGA CAGAAGTCCT GGTAAATCAC CATCAGCAAA TGGGCTATCC 4680 

CAAAAGCACA ATGATGGAAA AGAGGAAAAT GACATTCAGA CTGGTAGTGC TCTGCTTCCT 4740 

CTCAGCCCTG AATCTAAAGC ATGGGCAGTT CTGACAAGTG ATGAAGAAAG TGGATCAGGG 4800 

CAAGGTACCT CAGATAGCCT TAATGAGAAT GAGACTTCCA CAGATTTCAG TTTTGCAGAC 4860 

ACTAATGAAA AAGATGCTGA TGGGATCCTG GCAGCAGGTG ACTCAGAAAT AACTCCTGGA 4920 

TTCCCACAGT CCCCAACATC ATCTGTTACT AGCGAGAACT CAGAAGTGTT CCACGTTTCA 4980 

GAGGCAGAGG CCAGTAATAG TAGCCATGAG TCTCGTATTG GTCTAGCTGA GGGGTTGGAA 5040 

TCCGAGAAGA AGGCAGTTAT ACCCCTTGTG ATCGTGTCAG CCCTGACTTT TATCTGTCTA 5100 

GTGGTTCTTG TGGGTATTCT CATCTACTGG AGGAAATGCT TCCAGACTGC ACACTTTTAC 5160 

TTAGAGGACA GTACATCCCC TAGAGTTATA TCCACACCTC CAACACCTAT CTTTCCAATT 5220 

TCAGATGATG TCGGAGCAAT TCCAATAAAG CACTTTCCAA AGCATGTTGC AGATTTACAT 5280 

GCAAGTAGTG GGTTTACTGA AGAATTTGAG ACACTGAAAG AGTTTTACCA GGAAGTGCAG 5340 

AGCTGTACTG TTGACTTAGG TATTACAGCA GACAGCTCCA ACCACCCAGA CAACAAGCAC 5400 

AAGAATCGAT ACATAAATAT CGTTGCCTAT GATCATAGCA GGGTTAAGCT AGCACAGCTT 5460 

GCTGAAAAGG ATGGCAAACT GACTGATTAT ATCAATGCCA ATTATGTTGA TGGCTACAAC 5520 

AGACCAAAAG CTTATATTGC TGCCCAAGGC CCACTGAAAT CCACAGCTGA AGATTTCTGG 5580 

AGAATGATAT GGGAACATAA TGTGGAAGTT ATTGTCATGA TAACAAACCT CGTGGAGAAA 5640 

GGAAGGAGAA AATGTGATCA GTACTGGCCT GCCGATGGGA GTGAGGAGTA CGGGAACTTT 5700 

CTGGTCACTC AGAAGAGTGT GCAAGTGCTT GCCTATTATA CTGTGAGGAA TTTTACTCTA 5760 

AGAAACACAA AAATAAAAAA GGGCTCCCAG AAAGGAAGAC CCAGTGGACG TGTGGTCACA 5820 

CAGTATCACT ACACGCAGTG GCCTGACATG GGAGTACCAG AGTACTCCCT GCCAGTGCTG 5880 

ACCTTTGTGA GAAAGGCAGC CTATGCCAAG CGCCATGCAG TGGGGCCTGT TGTCGTCCAC 5940 

TGCAGTGCTG GAGTTGGAAG AACAGGCACA TATATTGTGC TAGACAGTAT GTTGCAGCAG 6000 

ATTCAACACG AAGGAACTGT CAACATATTT GGCTTCTTAA AACACATCCG TTCACAAAGA 6060 

AATTATTTGG TACAAACTGA GGAGCAATAT GTCTTCATTC ATGATACACT GGTTGAGGCC 6120 

ATACTTAGTA AAGAAACTGA GGTGCTGGAC AGTCATATTC ATGCCTATGT TAATGCACTC 6180 

CTCATTCCTG GACCAGCAGG CAAAACAAAG CTAGAGAAAC AATTCCAGCT CCTGAGCCAG 6240 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 6300 

CGAA C TTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 6360 

GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 6420 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 6480 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 6540 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 6600 

GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 6660 

GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 6720 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 6780 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 6840 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 6900 

CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 6960 

CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGAGCACAA GGCAGGAAGA GAATCCATCC 7020 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 7080 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 7140 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 7200 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 7260 

TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 7320 

TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 7380 

ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 7440 

GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 7500 

TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 7560 

AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 7620 

ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 7680 

TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 7740 
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TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 7800 

ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 7860 

GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: 583 Protein sequence 
Protein Accession fl: NP_002B42.1 

1 11 21 31 41 51 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTPIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFXAIIDGV ESVSRFGXQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDW I VFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPBUJLPPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEPS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPPIS 600 

ENISQGYIPS SBNPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYPP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLLLDNQI 780 

LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ 840 

ILPQVTSATE SDKVPLHASL PVAGGDLLLE PSLAQYSDVL STTHAASETL EFGSESGVLY 900 

KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIF TVSYSSAIPV HDSVGVTYQG 960 

SLFSGPSHIP IPKSSLITPT ASLLQPTHAI, SGDGEWSGAS SDSEFLLPDT DGLTALNISS 1020 

PVSVAEPTYT TSVFGDDNKA LSKSEIIYGN ETELQIPSPN EMVYPSESTV MPNMYDNVNK 1080 

LNASLQETSV SISSTKGMFP GSLAHTTTKV FDHEISQVPE NNFSVQPTHT VSQASGDTSI* 1140 

KPVLSANSEP ASSDPASSEM LSPSTQLLFY ETSASPSTEV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPILVE TPKVDKISST MLHLIVSNSA SSENMLHSTS VPVPDVSPTS HMHSASLQGL 1260 

TISYASEKYE PVLLKSESSH QVVPSLYSND ELFQTANLEI NQAHPPKGRH VFATPVLSID 1320 

EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGHVAITAVS 1380 

PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGEDGD TDDDGDDDDD RDSDGLSIHK 1440 

CMSCSSYRBS QEKVMNDSDT HENSLMDQNN PISYSLSENS EEDNRVTSVS SDSOTGMDRS 1500 

PGKSPSANGL SQKHNDGKEE NDIQTGSALL PLSPESKAWA VLTSDEESGS GQGTSDSLNE 1560 

NETSTDFSFA DTNEKDADGI LAAGDSEITP GFPQSPTSSV TSENSEVFHV SEAEASNSSH 1620 

ESRIGLAEGL ESEKKAVIPL VIVSALTFIC LWLVGILIY WRKCFQTAHF YLEDSTSPRV 1680 

ISTPPTPIFP ISDDVGAIPI KHFPKHVADL HASSGFTEEF ETLKEFYQEV QSCTVDLGIT 1740 

ADSSNHPDNK HKNRYINIVA YDHSRVKLAQ LAEKDGKLTD YINANYVDGY NRPKAYIAAQ 1800 

GPIiKSTAEDF WRMIWEHNVE VIVMITNLVE KGRRKCDQYW PADGSEEYGN FLVTQKSVQV 1860 

LAYYTVRNFT LRNTKIKKGS QKGRPSGRW TQYHYTQWPD MGVPEYSI»PV LTFVRKAAYA 1920 

KRKAVGPWV HCSAGVGRTG TYIVLDSMLQ QIQHEGTVNI FGFLKHIRSQ RNYLVQTEEQ 1980 

YVFIHDTLVE AILSKETEVL DSHIHAYVNA LLIPGPAGKT KLEKQFQLLS QSNIQQSDYS 2040 

AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS YIMGYYQSNE FIITQHPLLH 2100 

TIKDFWRMIW DHNAQLWMI PDGQNMAEDE FVYWPNKDEP INCESFKVTL MAEEHKCLSN 2160 

EEKLIIQDFI LEATQDDYVIi EVRHFQCPKW PNPDSPISKT FELISVIKEB AANRDGPMIV 2220 

HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM RPGVFADIEQ YQFLYKVILS 2280 
LVSTRQEENP STSLDSNGAA LPDGNIAESL ESLV 

Seq ID NO: 584 DNA sequence 

Nucleic Acid Accession ft: NM_005688.1 

Coding sequence: 126.. 4439 

1 11 21 31 41 51 

I I I I I I 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCXSGC TGGCGCGGTT GTCCTGGAGC 60 

AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 120 

AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 180 

GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 

TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 

GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 

CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

•TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

TTTGCTCCAA CGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 
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ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

GAGAGCGAGG AGCCAACCTG AGOGGTGGGC AGCGCCAGAG GATCAGCCTT GCCOGGGCCT 2220 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 

AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2 5 BO 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

TCCPGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 

GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAACGAGA 2820 

CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 

CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 

TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 

CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 

TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 

CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 

AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 

AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC 3780 

CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 

GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 

TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 

TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 

TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 

CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 

GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 

TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 4620 

ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 

TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 

CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 

ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 

TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 

TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 

GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CT TTT TGAGG TGGCACTTTT TCATTTGCCT 5400 

ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTTTTCCTTT 5460 

CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 

CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 

ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 

GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 

ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 5760 

CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 5820 
AAAAAAAAAA AAAAAAAA 
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MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL BTAARAEGLS 60 
LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLFSCM TFSWLSSLAR 120 
VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 
LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTEI VRSWSLALTW 240 
ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGSLLAGG 300 
PWAILGMIY NVIILGPTGF LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 360 
VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 
HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 480 
NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 
VLAEQKGHLIi LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 
KTSLISAILG QMTLLEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 
CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 720 
NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 
FMNIiLLGBTP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQhEEKGQGS 840 
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VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL 
VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVK6T 
KFFDTTPTGR ILNRFSKDMD BVDVRLPFQA EMFIQNVILV 
LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG 
LDDNQAPFFL PTCAMRWLAV RLDLISZALI TTTGLMIVLM 
GLFQPTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN 
RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL 
IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW 
SEVMENGDNF SVGERQLLCI ARAUiRHCKI LILDEATAAM 
MLTIAHRLHT VLGSDRIMVL RQGQWEFDT PSVLLSNDSS 

Seq ID NO: 586 DMA sequence 

Nucleic Acid Accession 8: NM_001327.1 

Coding sequence: 89.. 631 



PCT/US02/12476 



i 
I 

AGCAGGGGGC 
CTGAGAGCCG 
GAC6GGCGAT 
TGGCGGCCCA 
AAGGGCCTCG 
GCTGAATGGA 
CCTCGCCATG 
GGATGCCCCA 
CATACTGACT 
CTGTCTCCAG 
GGCTCAGCCT 
GCCTCCTCCC 
GTTTGTCGCT 



11 
I 

GCTGTGTGTA 
GGCAGAGGCT 
GCTGATGGCC 
GGAGAGGCGG 
GGGCCGGGAG 
TGCTGCAGAT 
CCTTTCGCGA 
CCGCTTCCCG 
ATCCGACTGA 
CAGCTTTCCC 
CCCTCAGGGC 
CTAGGGAATG 
GGAGGAGGAC 



21 
I 

CCGAGAATAC 
CCGGAGCCAT 
CAGGAGGCCC 
GTGCCACGGG 
GAGGCGCCCC 
GCGGGGCCAG 
CACCCATGGA 
TGCCAGGGGT 
CTGCTGCAGA 
TGTTGATGTG 
AGAGGCGCTA 
GTCCCAGCAC 
GGCTTACATG 



31 
I 

GAGAATACCT 
GCAGGCCGAA 
TGGCATTCCT 
CGGCAGAGGT 
GCGGGGTCCG 
GGGGCCGGAG 
AGCAGAGCTG 
GCTTCTGAAG 
CCACCGCCAA 
GATCACGCAG 
AGCCCAGCCT 
GAGTGGCCAG 
TTTGTTTCTG 



SYWIKQGSGN 
LRASSRLHDE 
FPCVGMIAGV 
LATIHAYNKG 
HGQIPPAYAG 
KAFSPDWPQB 
FRLVEIiSGGC 
DAIiERTHMKE 
DTETDLLIQE 
RFYAMFAAAE 



41 
I 

CGTGGGCCCT 
GGCCGGGGCA 
GATGGCCCAG 
CCCCGGGGCG 
CATGGCGGCG 
AGCCGCCTGC 
GCCCGCAGGA 
GAGTTCACTG 
CTGCAGCTCT 
TGCTTTCTGC 
GGCGCCCCTT 
TTCATTGTGG 
TAGAAAATAA 



TTVTRGNETS 
LPRRILRSPM 
PPWFLVAVGP 
QEPLHRYQEL 
LAISYAVQLT 
GEVTFENAEM 
IKIDGVRISD 
CIAQLPLKLE 
TIREAFADCT 
NKVAVKG 



51 
I 

GACCTTCTCT 
CAGGGGGTTC 
GGGGCAATGC 
CAGGGGCAGC 
CGGCTTCAGG 
TTGAGTTCTA 
GCCTGGCCCA 
TGTCCGGCAA 
CCATCAGCTC 
CCGTGTTTTT 
CCTAGGTCAT 
GGGCCTGATT 
AACTGAGCTA 



Seq ID NO: 587 Protein sequence 
Protein Accession &: NP_001318.1 



11 



21 



31 



41 



51 

I I I I I I 

MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGKAGAT GGRGPRGAGA ARASGPGGGA 
PRGPHGGAAS GLNGCCRCGA RGPESRLLEF YLAMPFATPM EAELARRSLA QDAPPLPVPG 
VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCFLPVP LAQPPSGQRR 

Seq ID NO: 588 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 52.. 459 



CCTCGTGGGC 
GAAGGCCAGG 
CCTGATGGCC 
GGTCCCCGGG 
CCGCATGGCG 
GACAGCCGCC 
ATCAGCTCCT 
GTGTTTTTGG 
TAGGTCATGC 
, GCCTGATTGT 
CTGAGCTA 



11 
I 

CCTGACCTTC 
GCACAGGGGG 
CAGGGGGCAA 
GCGCAGGGGC 
GTGCCGCTTC 
TGCTTCAGTT 
GTCTCCAGCA 
CTCAGGCTCC 
CTCCTCCCCT 
TTGTCGCTGG 



21 
I 

TCTCTGAGAG 
TTCGACGGGC 
TGCTGGCGGC 
AGCAAGGGCC 
TGCGCAGGAT 
CCGACTGACT 
GCTTTCCCTG 
CTCAGGGCAG 
AGGGAATGGT 
AGGAGGAOGG 



31 
I 

CCGGGCAGAG 
GATGCTGATG 
CCAGGAGAGG 
TCGGGGCCGA 
GGAAGGTGCC 
GCTGCAGACC 
TTGATGTGGA 
AGGCGCTAAG 
CCCAGCACGA 
CTTACATGTT 



41 
I 

GCTCCGGAGC 
GCCCAGGAGG 
CGGGTGCCAC 
GAGGAGGCGC 
CCTGCGGGGC 
ACCGCCAACT 
TCACGCAGTG 
CCCAGCCTGG 
GTGGCCAGTT 
TGTTTCTGTA 



51 
I 

CATGCAGGCC 
CCCTGGCATT 
GGGCGGCAGA 
CCCGCGGGGT 
CAGGAGGCCG 
GCAGCTCTCC 
CTTTCTGCCC 
CGCCCCTTCC 
CATTGTGGGG 
GAAAATAAAG 



Seq ID NO: 589 Protein sequence 
Protein Accession #: Eos sequence 

1 11 



51 



21 31 41 

I. I I I I I 

MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 
PRGPHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 
FLPVFLAQAP SGQRR 

Seq ID NO: 590 DNA sequence 

Nucleic Acid Accession #: NM_005562.1 

Coding sequences 90.. 3671 



ACAGCGGAGC 
AGACAGAGAC 
GCTTCTCGCT 
ATGGGAAGTC 
TCCGCTGCCT 
GCTTTTACCG 
CTCTTAGTGC 
CCAGATGCGA 
ACCAGAGACT 
ACGCGGGCCG 
CAGGTTACTA 
GGCATTCAGC 
TTCATCAAGA 
AATGGTCACA 



11 
I 

GCAGAGTGAG 
TGAGCGGCCC 
CCTCCTGCCC 
CAGGCAGTGT 
CAACTGCAAT 
GCACAGAGAA 
TCGATGTGAC 
CCGATGTCTG 
GCTAGACTCC 
CTGTGTCTGC 
TAATCTGGAT 
CAGCTGCCGC 
TGTTGATGGC 
GCGCCATCAA 



21 
I 

AACCACCAAC 
GGCACCGCCA 
GCAGCCCGGG 
ATCTTTGATC 
GACAACACTG 
AGGGACCGCT 
AACTCTGGAC 
CCAGGCTTCC 
AAGTGTGACT 
AAGCCAGCTG 
GGGGGGAACC 
AGCTCTGCAG 
TGGAAGGCTG 
GATGTGTTTA 



31 
I 

CGAGGCGCCG 
TGCCTGCGCT 
CCACCTCCAG 
GGGAACTTCA 
ATGGCATTCA 
GTTTGCCCTG 
GGTGCAGCTG 
ACATGCTCAC 
GTGACCCAGC 
TTACTGGAGA 
CTGAGGGCTG 
AATACAGTGT 
TCCAACGAAA 
GCTCAGCCCA 



41 
I 

GGCAGCGACC 
CTGGCTGGGC 
GAGGGAAGTC 
CAGACAAACT 
CTGCGAGAAG 
CAATTGTAAC 
TAAACCAGGT 
GGATGCGGGG 
TGGCATCGCA 
ACGCTGTGAT 
TACCCAGTGT 
CCATAAGATC 
TGGGTCTCCT 
ACGACTAGAC 



51 
I 

CCTGCAGCGG 
TGCTGCCTCT 
TGTGATTGCA 
GGTAATGGAT 
TGCAAGAATG 
TCCAAAGGTT 
GTGACAGGAG 
TGCACCCAAG 
GGGCCCTGTG 
AGGTGTCGAT 
TTCTGCTATG 
ACCTCTACCT 
GCAAAGCTCC 
CCTGTCTATT 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 
300 
360 
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60 
120 



60 
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60 
120 



60 
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180 
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TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AGCCTGTCCT 900 

TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG 960 

GTGCTGGTCT ACGGATCACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 1020 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCCAAG CAATAATTGG AGCCCCCAGC 1080 

TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA 1140 

CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACAAGGGGC 1260 

AATTCTGCCA GGATTGTGCT TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 1320 

GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC ACAGGAGATT 1380 

GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT GGTTTCTACA 1440 

ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 

CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 

CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 

TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 

GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 

ACCAGTGCAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 

GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGAAGT GATGGCACCT I860 

GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 1920 

CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 

AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 

GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 

AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 

ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 2220 

AGTACCAGAA CCGAGTTCGG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 2280 

CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 

CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 

CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 

CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGGACGGTG 2520 

CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 

CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2640 

TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 

CAAAGAGGAT CAAACAAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 

AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2820 

AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA 2880 

AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2940 

TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 

AAGCCATGAA GAGACTCTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 

AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 3120 

CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 

AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 

GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 

TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG 3360 

CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 

AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 3480 

AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3540 

AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3600 

AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 3660 

AGCAACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT TCTTGGGATA CAGATCTCAG 3720 

GGCTCGGGAG CCATGTCATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 

TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3900 

ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 3960 

ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 4020 

ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 4080 

ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 

ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 

ACCCAGGGTG TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 4260 

AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 

GTTCTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 4380 

ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 

GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 

ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT GATGCGCATC AATGTATTTT 4560 

ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 

CACACTTCAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 

TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 

GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG ATTGTCTAGT GAGGAAGACA 4800 

AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTGTT TGTCACAAGT GGTGTTTATT 4860 

GCAATAACCG CTTGGTTTGC AACCTCTTTG CTCAACAGAA CATATGTTGC AAGACCCTCC 4920 

CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 

CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA 5040 

TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT GGGAAGACTA 5100 

TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 
CAATTGTTAG ATGCC 

Seq ID NO: 591 Protein sequence 
Protein Accession # : NP_005553 . 1 

1 11 21 31 41 51 

I I I I I I 

MPALWLGCCL CPSLLLPAAR ATSRREVCDC NGKSRQCIFD RELHRQTGNG FRCLNCNDNT 60 

DGIHCBKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG RCSCKPGVTG ARCDRCLPGF 120 

HMLTDAGCTQ DQRLLDSKCD CDPAGIAGPC DAGRCVCKPA VTGERCDRCR SGYYNLDGGN 180 

PBGCTQCFCY GHSASCRSSA EYSVHKITST FHQDVDGWKA VQRNGSPAKL QWSQRHQDVF 240 

SSAQRLDPVY FVAPAKFLGN QQVSYGQSLS FDYRVDRGGR HPSAHDVILE GAGLRITAPL 300 

MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ LSYFEYRRLL RNLTALRIRA TYGEYSTGYI 360 

DNVTLISARP VSGAPAPWVE QCICPVGYKG QFCQDCASGY KRDSARliGPF GTCIPCNCQG 420 

GGACDPDTGD CYSGDENPDI ECADCPIGFY NDPHDPRSCK PCPCHNGFSC SVMPETEEW 480 



415 
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CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQCN NNVDPSASGN CDRLTGRCLK 540 

CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS EPVGCRSDGT CVCKPGFGGP 600 

NCEHGAFSCP ACYNQVKIQM DQFWQQLQRM EALISKAQGG DGWPDTELE GRMQQAEQAL 660 

QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM TVERVRALGS QYQNRVRDTH 720 

RLITQMQLSL AESEASLGNT NIPASDHYVG PNGFKSLAQE ATRIAESHVE SASNMEQLTR 780 

ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLAQQL TREATQAEIE 840 

ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS LSTLVTRHMD EFKRTQKNLG 900 

NWKEEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM GNATFYEVES ILKNLREFDL 960 

QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALEISSE 1020 

IEQEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEL ERKELEFDTN MDAVQMVITE 1080 

AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPLSVDEEGL VLLEQKLSRA KTQINSQLRP 1140 
MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENIRDNL PPGCYNTQAL EQQ 

Seq ID NO: 592 DNA sequence 

Nucleic Acid Accession fh AP101051.1 

Coding sequence: 221.856 

1 11 21 31 41 51 

1)1111 

GAGCAACCTC AGCTTCTAGT ATCCAGACTC CAGCGCCGCC CCGGGCGCGG ACCCCAACCC 60 

CGACCCAGAG CTTCTCCAGC GGCGGCGCAG CGAGCAGGGC TCCCCGCCTT AACTTCCTCC 120 

GCGGGGCCCA GCCACCTTCG GGAGTCCGGG TTGCCCACCT GCAAACTCTC CGCCTTCTGC 180 

ACCTGCCACC CCTGAGCCAG CGCGGGCGCC CGAGCGAGTC ATGGCCAACG CGGGGCTGCA 240 

GCTGTTGGGC TTCATTCTCG CCTTCCTGGG ATGGATCGGC GCCATCGTCA GCACTGCCCT 300 

GCCCCAGTGG AGGATTTACT CCTATGCCGG CGACAACATC GTGACCGCCC AGGCCATGTA 360 

CGAGGGGCTG TGGATGTCCT GCGTGTCGCA GAGCACCGGG CAGATCCAGT GCAAAGTCTT 420 

TGACTCCTTG CTGAATCTGA GCAGCACATT GCAAGCAACC CGTGCCTTGA TGGTGGTTGG 480 

CATCCTCCTG GGAGTGATAG CAATCTTTGT GGCCACCGTT GGCATGAAGT GTATGAAGTG 540 

CTTGGAAGAC GATGAGGTGC AGAAGATGAG GATGGCTGTC ATTGGGGGTG CGATATTTCT 600 

TCTTGCAGGT CTGGCTATTT TAGTTGCCAC AGCATGGTAT GGCAATAGAA TCGTTCAAGA 660 

ATTCTATGAC CCTATGACCC CAGTCAATGC CAGGTACGAA TTTGGTCAGG CTCTCTTCAC 720 

TGGCTGGGCT GCTGCTTCTC TCTGCCTTCT GGGAGGTGCC CTACTTTGCT GTTCCTGTCC 780 

CCGAAAAACA ACCTCTTACC CAACACCAAG GCCCTATCCA AAACCTGCAC CTTCCAGCGG 840 

GAAAGACTAC GTGTGACACA GAGGCAAAAG GAGAAAATCA TGTTGAAACA AACCGAAAAT 900 

GGACATTGAG ATACTATCAT TAACATTAGG ACCTTAGAAT TTTGGGTATT GTAATCTGAA 960 

GTATGGTATT ACAAAACAAA CAAACAAACA AAAAACCCAT GTGTTAAAAT ACTCAGTGCT 1020 

AAACATGGCT TAATCTTATT TTATCTTCTT TCCTCAATAT AGGAGGGAAG ATTTTACCAT 1080 

TTGTATTACT GCTTCCCATT GAGTAATCAT ACTCAAATGG GGGAAGGGGT GCTCCTTAAA 1140 

TATATATAGA TATGTATATA TACATGTTTT TCTATTAAAA ATAGACAGTA AAATACTATT 1200 

CTCATTATGT TGATACTAGC ATACTTAAAA TATCTCTAAA ATAGGTAAAT GTATTTAATT 1260 

CCATATTGAT GAAGATGTTT ATTGGTATAT TTTCTTTTTC GTCCTTATAT ACATATGTAA 1320 

CAGTCAAATA TCATTTACTC TTCTTCATTA GCTTTGGGTG CCTTTGCCAC AAGACCTAGC 1380 

CTAATTTACC AAGGATGAAT TCTTTCAATT CTTCATGCGT GCCCTTTTCA TATACTTATT 1440 

TTATTTTTTA CCATAATCTT ATAGCACTTG CATCGTTATT AAGCCCTTAT TTGTTTTGTG 1500 

TTTCATTGGT CTCTATCTCC TGAATCTAAC ACATTTCATA GCCTACATTT TAGTTTCTAA 1560 

AGCCAAGAAG AATTTATTAC AAATCAGAAC TTTGGAGGCA AATCTTTCTG CATGACCAAA 1620 

GTGATAAATT CCTGTTGACC TTCCCACACA ATCCCTGTAC TCTGACCCAT AGCACTCTTG 1680 

TTTGCTTTGA AAATATTTGT CCAATTGAGT AGCTGCATGC TGTTCCCCCA GGTGTTGTAA 1740 

CACAACTTTA TTGATTGAAT TTTTAAGCTA CTTATTCATA GTTTTATATC CCCCTAAACT 1800 

ACCTTTTTGT TCCCCATTCC TTAATTGTAT TGTTTTCCCA AGTGTAATTA TCATGCGTTT 1860 

TATATCTTCC TAATAAGGTG TGGTCTGTTT GTCTGAACAA AGTGCTAGAC TTTCTGGAGT 1920 

GATAATCTGG TGACAAATAT TCTCTCTGTA GCTGTAAGCA AGTCACTTAA TCTTTCTACC 1980 

TCTTTTTTCT ATCTGCCAAA TTGAGATAAT GATACTTAAC CAGTTAGAAG AGGTAGTGTG 2040 

AATATTAATT AGTTTATATT ACTCTCATTC TTTGAACATG AACTATGCCT ATGTAGTGTC 2100 

TTTATTTGCT CAGCTGGCTG AGACACTGAA GAAGTCACTG AACAAAACCT ACACACGTAC 2160 

CTTCATGTGA TTCACTGCCT TCCTCTCTCT ACCAGTCTAT TTCCACTGAA CAAAACCTAC 2220 

ACACATACCT TCATGTGGTT CAGTGCCTTC CTCTCTCTAC CAGTCTATTT CCACTGAACA 2280 

AAACCTACGC ACATACCTTC ATGTGGCTCA GTGCCTTCCT CTCTCTACCA GTCTATTTCC 2340 

ATTCTTTCAG CTGTGTCTGA CATGTTTGTG CTCTGTTCCA TTTTAACAAC TGCTCTTACT 2400 

TTTCCAGTCT GTACAGAATG CTATTTCACT TGAGCAAGAT GATGTATGGA AAGGGTGTTG 2460 

GCACTGGTGT CTGGAGACCT GGATTTGAGT CTTGGTGCTA TCAATCACCG TCTGTGTTTG 2520 

AGCAAGGCAT TTGGCTGCTG TAAGCTTATT GCTTCATCTG TAAGCGGTGG TTTGTAATTC 2580 

CTGATCTTCC CACCTCACAG TGATGTTGTG GGGATCCAGT GAGATAGAAT ACATGTAAGT 2640 

GTGGTTTTGT AATTTGAAAA GTGCTATACT AAGGGAAAGA ATTGAGGAAT TAACTGCATA 2700 

CGTTTTGGTG TTGCTTTTCA AATGTTTGAA AATAAAAAAA TGTTAAGAAA TGGGTTTCTT 2760 

GCCTTAACCA GTCTCTCAAG TGATGAGACA GTGAAGTAAA ATTGAGTGCA CTAAACGAAT 2820 

AAGATTCTGA GGAAGTCTTA TCTTCTGCAG TGAGTATGGC CCAATGCTTT CTGTGGCTAA 2880 

ACAGATGTAA TGGGAAGAAA TAAAAGCCTA CGTGTTGGTA AATCCAACAG CAAGGGAGAT 2940 

TTTTGAATCA TAATAACTCA TAAGGTGCTA TCTGTTCAGT GATGCCCTCA GAGCTCTTGC 3000 

TGTTAGCTGG CAGCTGACGC TGCTAGGATA GTTAGTTTGG AAATGGTACT TCATAATAAA 3060 

CTACACAAGG AAAGTCAGCC ACCGTGTCTT ATGAGGAATT- GGACCTAATA AATTTTAGTG 3120 

TGCCTTCCAA ACCTGAGAAT ATATGCTTTT GGAAGTTAAA ATTTAAATGG CTTTTGCCAC 3180 

ATACATAGAT CTTCATGATG TGTGAGTGTA ATTCCATGTG GATATCAGTT ACCAAACATT 3240 

ACAAAAAAAT TTTATGGCCC AAAATGACCA ACGAAATTGT TACAATAGAA TTTATCCAAT 3300 

TTTGATCTTT TTATATTCTT CTACCACACC TGGAAACAGA CCAATAGACA TTTTGGGGTT 3360 

TTATAATGGG AATTTGTATA AAGCATTACT CTTTTTCAAT AAATTGTTTT TTAATTTAAA 3420 
AAAAGGAAAA AAAAAAAAAA AAA 

Seq ID NO: 593 Protein sequence 
Protein Accession if: AAD16433.1 

1 11 21 31 41 51 

I I I I I I 

MANAGLQLLG FILAFLGWIG AIVSTALPQW RIYSYAGDNI VTAQAMYEGIi WMSCVSQSTG 60 

QIQCKVFDSL LNLSSTLQAT RALMWGILL GVIAIFVATV GMKCMKCLED DEVQKMRMAV 120 

IGGAIFLLAG LAILVATAWY GNRIVQEFYD PMTPVNARYE FGQALFTGWA AASLCLLGGA 180 
LLCCSCPRKT TSYPTPRPYP KPAP.SSGKDY V 
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Seq ID NO: 594 DNA sequence 

Nucleic Acid Accession #: NM_006180.1 

Coding sequence: 352.. 2820 

1 11 21 31 41 51 

I I I I I I 

CCCCCATTCG CATCTAACAA GGAATCTGCG CCCCAGAGAG TCCCGGACGC CGCCGGTCGG 60 

TGCCCGGCGC GCCGGGCCAT GCAGCGACGG CCGCCGCGGA GCTCCGAGCA GCGGTAGCGC 120 

CCCCCTGTAA AGCGGTTCGC TATGCCGGGA CCACTGTGAA CCCTGCCGCC TGCCGGAACA 180 

CTCTTCGCTC CGGACCAGCT CAGCCTCTGA TAAGCTGGAC TCGGCACGCC CGCAACAAGC 240 

ACCGAGGAGT TAAGAGAGCC GCAAGCGCAG GGAAGGCCTC CCCGCACGGG TGGGGGAAAG 300 

CGGCCGGTGC AGCGCGGGGA CAGGCACTCG GGCTGGCACT GGCTGCTAGG GATGTCGTCC 360 

TGGATAAGGT GGCATGGACC CGCCATGGCG CGGCTCTGGG GCTTCTGCTG GCTGGTTGTG 420 

GGCTTCTGGA GGGCCGCTTT CGCCTGTCCC ACGTCCTGCA AATGCAGTGC CTCTCGGATC 480 

TGGTGCAGCG ACCCTTCTCC TGGCATCGTG GCATTTCCGA GATTGGAGCC TAACAGTGTA 540 

■ GATCCTGAGA ACATCACCGA AATTTTCATC GCAAACCAGA AAAGGTTAGA AATCATCAAC 600 

GAAGATGATG TTGAAGCTTA TGTGGGACTG AGAAATCTGA CAATTGTGGA TTCTGGATTA 660 

AAATTTGTGG CTCATAAAGC ATTTCTGAAA AACAGCAACC TGCAGCACAT CAATTTTACC 720 

CGAAACAAAC TGACGAGTTT GTCTAGGAAA CATTTCCGTC ACCTTGACTT GTCTGAACTG 780 

ATCCTGGTGG GCAATCCATT TACATGCTCC TGTGACATTA TGTGGATCAA GACTCTCCAA 840 

GAGGCTAAAT CCAGTCCAGA CACTCAGGAT TTGTACTGCC TGAATGAAAG CAGCAAGAAT 900 

ATTCCCCTGG CAAACCTGCA GATACCCAAT TGTGGTTTGC CATCTGCAAA TCTGGCCGCA 960 

CCTAACCTCA CTGTGGAGGA AGGAAAGTCT ATCACATTAT CCTGTAGTGT GGCAGGTGAT 1020 

CCGGTTCCTA ATATGTATTG GGATGTTGGT AACCTGGTTT CCAAACATAT GAATGAAACA 1080 

AGCCACACAC AGGGCTCCTT AAGGATAACT AACATTTCAT CCGATGACAG TGGGAAGCAG 1140 

, AT CTGTTGTG TGGCGGAAAA TCTTGTAGGA GAAGATCAAG ATTCTGTCAA CCTCACTGTG 1200 

CATTTTGCAC CAACTATCAC ATTTCTCGAA TCTCCAACCT CAGACCACCA CTGGTGCATT 1260 

CCATTCACTG TGAAAGGCAA CCCCAAACCA GCGCTTCAGT GGTTCTATAA CGGGGCAATA 1320 

TTGAATGAGT CCAAATACAT CTGTACTAAA ATACATGTTA CCAATCACAC GGAGTACCAC 1380 

GGCTGCCTCC AGCTGGATAA TCCCACTCAC ATGAACAATG GGGACTACAC TCTAATAGCC 1440 

AAGAATGAGT ATGGGAAGGA TGAGAAACAG ATTTCTGCTC ACTTCATGGG CTGGCCTGGA 1500 

ATTGACGATG GTGCAAACCC AAATTATCCT GATGTAATTT ATGAAGATTA TGGAACTGCA 1560 

GCGAATGACA TCGGGGACAC CACGAACAGA AGTAATGAAA TCCCTTCCAC AGACGTCACT 1620 

GATAAAACCG GTCGGGAACA TCTCTCGGTC TATGCTGTGG TGGTGATTGC GTCTGTGGTG 1680 

GGATTTTGCC TTTTGGTAAT GCTGTTTCTG CTTAAGTTGG CAAGACACTC CAAGTTTGGC 1740 

ATGAAAGGCC CAGCCTCCGT TATCAGCAAT GATGATGACT CTGCCAGCCC ACTCCATCAC 1800 

ATCTCCAATG GGAGTAACAC TCCATCTTCT TCGGAAGGTG GCCCAGATGC TGTCATTATT 1860 

GGAATGACCA AGATCCCTGT CATTGAAAAT CCCCAGTACT TTGGCATCAC CAACAGTCAG 1920 

CTCAAGCCAG ACACATTTGT TCAGCACATC AAGCGACATA ACATTGTTCT GAAAAGGGAG 1980 

CTAGGCGAAG GAGCCTTTGG AAAAGTGTTC CTAGCTGAAT GCTATAACCT CTGTCCTGAG 2040 

CAGGACAAGA TCTTGGTGGC AGTGAAGACC CTGAAGGATG CCAGTGACAA TGCACGCAAG 2100 

GACTTCCACC GTGAGGCCGA GCTCCTGACC AACCTCCAGC ATGAGCACAT CGTCAAGTTC 2160 

TATGGCGTCT GCGTGGAGGG CGACCCCCTC ATCATGGTCT TTGAGTACAT GAAGCATGGG 2220 

GACCTCAACA AGTTCCTCAG GGCACACGGC CCTGATGCCG TGCTGATGGC TGAGGGCAAC 2280 

CCGCCCACGG AACTGACGCA GTCGCAGATG CTGCATATAG CCCAGCAGAT CGCCGCGGGC 2340 

ATGGTCTACC TGGCGTCCCA GCACTTCGTG CACCGCGATT TGGCCACCAG GAACTGCCTG 2400 

GTCGGGGAGA ACTTGCTGGT GAAAATCGGG GACTTTGGGA TGTCCCGGGA CGTGTACAGC 2460 

ACTGACTACT ACAGGGTCGG TGGCCACACA ATGCTGCCCA TTCGCTGGAT GCCTCCAGAG 2520 

AGCATCATGT ACAGGAAATT CACGACGGAA AGCGACGTCT GGAGCCTGGG GGTCGTGTTG 2580 

TGGGAGATTT TCACCTATGG CAAACAGCCC TGGTACCAGC TGTCAAACAA TGAGGTGATA 2640 

GAGTGTATCA CTCAGGGCCG AGTCCTGCAG CGACCCCGCA CGTGCCCCCA GGAGGTGTAT 2700 

GAGCTGATGC TGGGGTGCTG GCAGCGAGAG CCCCACATGA GGAAGAACAT CAAGGGCATC 2760 

CATACCCTCC , TTCAGAACTT GGCCAAGGCA TCTCCGGTCT ACCTGGACAT TCTAGGCTAG 2820 

GGCCCTTTTC CCCAGACCGA TCCTTCCCAA CGTACTCCTC AGACGGGCTG AGAGGATGAA 2880 

CATCTTTTAA CTGCCGCTGG AGGCCACCAA GCTGCTCTCC TTCACTCTGA CAGTATTAAC 2940 

ATCAAAGACT CCGAGAAGCT CTCGAGGGAA GCAGTGTGTA CTTCTTCATC CATAGACACA 3000 

GTATTGACTT CTTTTTGGCA TTATCTCTTT CTCTCTTTCC ATCTCCCTTG GTTGTTCCTT 3060 

TTTCTTTTTT TAAATTTTCT TTTTCTTCTT TTTTTTCGTC TTCCCTGCTT CACGATTCTT 3120 

ACCCTTTCTT TTGAATCAAT CTGGCTTCTG CATTACTATT AACTCTGCAT AGACAAAGGC 3180 

CTTAACAAAC GTAATTTGTT ATATCAGCAG ACACTCCAGT TTGCCCACCA CAACTAACAA 3240 

TGCCTTGTTG TATTCCTGCC TTTGATGTGG ATGAAAAAAA GGGAAAACAA ATATTTCACT 3300 

TAAACTTTGT CACTTCTGCT GTACAGATAT CGAGAGTTTC TATGGATTCA CTTCTATTTA 3360 

TTTATTATTA TTACTGTTCT TATTGTTTTT GGATGGCTTA AGCCTGTGTA TAAAAAAGAA 3420 

AACTTGTGTT CAATCTGTGA AGCCTTTATC TATGGGAGAT TAAAACCAGA GAGAAAGAAG 3480 

ATTTATTATG AACCGCAATA TGGGAGGAAC AAAGACAACC ACTGGGATCA GCTGGTGTCA 3540 

GTCCCTACTT AGGAAATACT CAGCAACTGT TAGCTGGGAA GAATGTATTC GGCACCTTCC 3600 

CCTGAGGACC TTTCTGAGGA GTAAAAAGAC TACTGGCCTC TGTGCCATGG ATGATTCTTT 3660 
TCCCATCACC AGAAATGATA GCGTGCAGTA GAGAGCAAAG ATGGCTT 

Seq ID NO: 595 Protein sequence 
Protein Accession #: NP_006171.1 

1 11 21 31 41 51 

I I I I I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE I INEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NFTRNKLTSli SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGPASV ISNDDDSASP 480 

LHHISNGSNT PSSSEGGPDA VIIGMTKIPV IENPQYFGIT NSQIiKPDTFV QHIKRHNIVL 540 

KRELGEGAFG KVFLAECYNL CPEQDKILVA VKTLKDASDN ARKDFHREAE LLTNLQHEHI 600 

VKFYGVCVEG DPLIMVFEYM KHGDLNKFLR AHGPDAVLMA EGNPPTELTQ SQMLHIAQQI 660 

AAGMVYLASQ HFVHRDLATR NCLVGENLLV KIGDFGMSRD VYSTDYYRVG GHTMLPIRWM 720 



417 



WO 02/086443 

PPESIMYRKF TTESDVWSLG WLWEIFTYG KQPWYQLSNN EVIECITQGR VLQRPRTCPQ 780 
EVYELMLGCW QREPHMRKNI KGIHTLLQNL AKASPVYLDI LG 

Seq ID NO: 596 DNA sequence 

Nucleic Acid Accession #: AF410899 , 
Coding sequence: 483.. 2999 

1 11 21 31 41 51 

I I I I I I 

GGGAGCAGGA GCCTCGCTGG CTGCTTCGCT CGCGCTCTAC GCGCTCAGTC CCCGGCGGTA 60 

GCAGGAGCCT GGACCCAGGC GCCGGCGGCG GGCGTGAGGC GCCGGAGCCC GGCCTCGAGG 120 

TGCATACCGG ACCCCCATTC GCATCTAACA AGGAATCTGC GCCCCAGAGA GTCCCGGACG 180 

CCGCCGGTCG GTGCCCGGCG CGCCGGGCCA TGCAGCGACG GCCGCCGCGG AGCTCCGAGC 240 

AGCGGTAGCG CCCCCCTGTA AAGCGGTTCG CTATGCCGGG ACCACTGTGA ACCCTGCCGC 300 

CTGCCGGAAC ACTCTTCGCT CCGGACCAGC TCAGCCTCTG ATAAGCTGGA CTCGGCACGC 360 

CCGCAACAAG CACCGAGGAG TTAAGAGAGC CGCZAAGCGCA GGGAAGGCCT CCCCGCACGG 420 

GTGGGGGAAA GCGGCCGGTG CAGCGCGGGG ACAGGCACTC GGGCTGGCAC TGGCTGCTAG 480 

GGATGTCGTC CTGGATAAGG TGGCATGGAC CCGCCATGGC GCGGCTCTGG GGCTTCTGCT 540 

GGCTGGTTGT GGGCTTCTGG AGGGCCGCTT TCGCCTGTCC CACGTCCTGC AAATGCAGTG 600 

CCTCTCGGAT CTGGTGCAGC GACCCTTCTC CTGGCATCGT GGCATTTCCG AGATTGGAGC 660 

CTAACAGTGT AGATCCTGAG AACATCACCG AAATTTTCAT CGCAAACCAG AAAAGGTTAG 720 

AAATCATCAA CGAAGATGAT GTTGAAGCTT ATGTGGGACT GAGAAATCTG ACAATTGTGG 780 

ATTCTGGATT AAAATTTGTG GCTCATAAAG CATTTCTGAA AAACAGCAAC CTGCAGCACA 840 

TCAATTTTAC CCGAAACAAA CTGACGAGTT TGTCTAGGAA ACATTTCCGT CACCTTGACT 900 

TGTCTGAACT GATCCTGGTG GGCAATCCAT- TTACATGCTC CTGTGACATT ATGTGGATCA 960 

AGACTCTCCA AGAGGCTAAA TCCAGTCCAG ACACTCAGGA TTTGTACTGC CTGAATGAAA 1020 

GCAGCAAGAA TATTCCCCTG GCAAACCTGC AGATACCCAA TTGTGGTTTG CCATCTGCAA 1080 

ATCTGGCCGC ACCTAACCTC ACTGTGGAGG AAGGAAAGTC TATCACATTA TCCTGTAGTG 1140 

TGGCAGGTGA TCCGGTTCCT AATATGTATT GGGATGTTGG TAACCTGGTT TCCAAACATA 1200 

TGAATGAAAC AAGCCACACA CAGGGCTCCT TAAGGATAAC TAACATTTCA TCCGATGACA 1260 

GTGGGAAGCA GATCTCTTGT GTGGCGGAAA ATCTTGTAGG AGAAGATCAA GATTCTGTCA 1320 

ACCTCACTGT GCATTTTGCA CCAACTATCA CATTTCTCGA ATCTCCAACC TCAGACCACC 1380 

ACTGGTGCAT TCCATTCACT GTGAAAGGCA ACCCCAAACC AGCGCTTCAG TGGTTCTATA 1440 

ACGGGGCAAT ATTGAATGAG TCCAAATACA TCTGTACTAA AATACATGTT ACCAATCACA 1500 

CGGAGTACCA CGGCTGCCTC CAGCTGGATA ATCCCACTCA CATGAACAAT GGGGACTACA 1560 

CTCTAATAGC CAAGAATGAG TATGGGAAGG ATGAGAAACA GATTTCTGCT CACTTCATGG 1620 

GCTGGCCTGG AATTGACGAT GGTGCAAACC CAAATTATCC TGATGTAATT TATGAAGATT 1680 

ATGGAACTGC AGCGAATGAC ATCGGGGACA CCACGAACAG AAGTAATGAA ATCCCTTCCA 1740 

CAGACGTCAC TGATAAAACC GGTCGGGAAC ATCTCTCGGT CTATGCTGTG GTGGTGATTG 1800 

CGTCTGTGGT GGGATTTTGC CTTTTGGTAA TGCTGTTTCT GCTTAAGTTG GCAAGACACT 1860 

CCAAGTTTGG CATGAAAGAT TTCTCATGGT TTGGATTTGG GAAAGTAAAA TCAAGACAAG 1920 

GTGTTGGCCC AGCCTCCGTT ATCAGCAATG ATGATGACTC TGCCAGCCCA CTCCATCACA 1980 

TCTCCAATGG GAGTAACACT CCATCTTCTT CGGAAGGTGG CCCAGATGCT GTCATTATTG 2040 

GAATGACCAA GATCCCTGTC ATTGAAAATC CCCAGTACTT TGGCATCACC AACAGTCAGC 2100 

TCAAGCCAGA CACATTTGTT CAGCACATCA AGCGACATAA CATTGTTCTG AAAAGGGAGC 2160 

TAGGCGAAGG AGCCTTTGGA AAAGTGTTCC TAGCTGAATG CTATAACCTC TGTCCTGAGC 2220 

AGGACAAGAT CTTGGTGGCA GTGAAGACCC TGAAGGATGC CAGTGACAAT GCACGCAAGG 2280 

ACTTCCACCG TGAGGCCGAG CTCCTGACCA ACCTCCAGCA TGAGCACATC GTCAAGTTCT 2340 

ATGGCGTCTG CGTGGAGGGC GACCCCCTCA TCATGGTCTT TGAGTACATG AAGCATGGGG 2400 

ACCTCAACAA GTTCCTCAGG GCACACGGCC CTGATGCCGT GCTGATGGCT GAGGGCAACC 2460 

CGCCCACGGA ACTGACGCAG TCGCAGATGC TGCATATAGC CCAGCAGATC GCCGCGGGCA 2520 

TGGTCTACCT GGCGTCCCAG CACTTCGTGC ACCGCGATTT GGCCACCAGG AACTGCCTGG 2580 

TCGGGGAGAA CTTGCTGGTG AAAATCGGGG ACTTTGGGAT GTCCCGGGAC GTGTACAGCA 2640 

CTGACTACTA CAGGGTCGGT GGCCACACAA TGCTGCCCAT TCGCTGGATG CCTCCAGAGA 2700 

GCATCATGTA CAGGAAATTC ACGACGGAAA GCGACGTCTG GAGCCTGGGG GTCGTGTTGT 2760 

GGGAGATTTT CACCTATGGC AAACAGCCCT GGTACCAGCT GTCAAACAAT GAGGTGATAG 2820 

AGTGTATCAC TCAGGGCCGA GTCCTGCAGC GACCCCGCAC GTGCCCCCAG GAGGTGTATG 2880 

AGCTGATGCT GGGGTGCTGG CAGCGAGAGC CCCACATGAG GAAGAACATC AAGGGCATCC 2940 

ATACCCTCCT TCAGAACTTG GCCAAGGCAT CTCCGGTCTA CCTGGACATT CTAGGCTAGG 3000 

GCCCTTTTCC CCAGACCGAT CCTTCCCAAC GTACTCCTCA GACGGGCTGA GAGGATGAAC 3060 

ATCTTTTAAC TGCCGCTGGA GGCCACCAAG CTGCTCTCCT TCACTCTGAC AGTATTAACA 3120 

TCAAAGACTC CGAGAAGCTC TCGAGGGAAG CAGTGTGTAC TTCTTCATCC ATAGACACAG 3180 

TATTGACTTC TTTTTGGCAT TATCTCTTTC TCTCTTTCCA TCTCCCTTGG TTGTTCCTTT 3240 

TTCTTTTTTT AAAT TT TCTT TTTCTTCTTT TTTTTCGTCT TCCCTGCTTC ACGATTCTTA 3300 

CCCTTTCTTT TGAATCAATC TGGCTTCTGC ATTACTATTA ACTCTGCATA GACAAAGGCC 3360 

TTAACAAACG TAATTTGTTA TATCAGCAGA CACTCCAGTT TGCCCACCAC AACTAACAAT 3420 

GCCTTGTTGT ATTCCTGCCT TTGATGTGGA TGAAAAAAAG GGAAAACAAA TATTTCACTT 3480 

AAACTTTGTC ACTTCTGCTG TACAGATATC GAGAGTTTCT ATGGATTCAC TTCTATTTAT 3540 

TTATTATTAT TACTGTTCTT ATTGTTTTTG GATGGCTTAA GCCTGTGTAT AAAAAAGAAA 3600 

ACTTGTGTTC AATCTGTGAA GCCTTTATCT ATGGGAGATT AAAACCAGAG AGAAAGAAGA 3660 

TTTATTATGA ACCGCAATAT GGGAGGAACA AAGACAACCA CTGGGATCAG CTGGTGTCAG 3720 

TCCCTACTTA GGAAATACTC AGCAACTGTT AGCTGGGAAG AATGTATTCG GCACCTTCCC 3780 

CTGAGGACCT TTCTGAGGAG TAAAAAGACT ACTGGCCTCT GTGCCATGGA TGATTCTTTT 3840 

CCCATCACCA GAAATGATAG CGTGCAGTAG AGAGCAAAGA TGGCTTCCGT GAGACACAAG 3900 

ATGGCGCATA GTGTGCTCGG ACACAGTTTT GTCTTCGTAG GTTGTGATGA TAGCACTGGT 3960 

TTGTTTCTCA AGCGCTATCC ACAGAACCTT TGTCAACTTC AGTTGAAAAG AGGTGGATTC 4020 
ATGTCCAGAG CTCATTTCGG GGTCAGGTGG GAAAGCC 

Seq ID NO; 597 Protein sequence 
Protein Accession #: AAL67965.1 

1 11 21 31 41 51 

I ! I I I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE I INEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 
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NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN 
WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT 
LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY 
DVTDKTGRSH LSVYAWVIA SWGFChLVM LFLLKLARHS 
VGPASVISND DDSASPLHHI SNGSNTPSSS EGGPDAVIIG 
KPDTFVQHIK RHNIVLKREL GEGAFGKVPL AECYNLCPEQ 
FHREAELLTN LQHEHIVKFY GVCVEGDPLI MVFEYMKHGD 
PTELTQSQML HIAQQIAAGM VYLASQHFVH RDLATRNCLV 
DYYRVGGHTM LPIRWMPPES IMYRKFTTES DVWSLGWLW 
CITQGRVLQR PRTCPQEVYE LMLGCWQREP HMRKNIKGIH 

Seq ID NO: 598 DNA sequence 
Nucleic Acid Accession #: AB052906 
Coding sequence: 74.. 814 



PCT/US02/12476 



AAAACCTTGA 
CTCTGGGTCC 
GCTCCTGCTG 
CATCACCGTC 
GGATGAAAAG 
CCTGGGGAAG 
GGTGGTGGAC 
GG AACCCCTC 
TGGATCTTGG 
AATGTGGACA 
GGTTGTGGCC 
CTTCTTGATG 
CTCAGGCACA 
CATCCTCCCC 
AAGCTGATAC 
CCAGCTGCCC 
TGGACCCAAT 
TACCTAACAT 
TTCTGGCTGA 
GTACTTCTTT 
TAGACTTCAG 
ATAAGAAAAA 
TTTAAATAAA 



11 
I 

GGTGATTCAT 
TTAATGGCAG 
TCCGGCTGGT 
ATCCCTAAGT 
ACTTTTCTTC 
AAACTAAATG 
ATACTTACAG 
ACCCTGCAGG 
CAGTTCAGTT 
ACGGTTCATC 
ATGTCCTTCC 
GGCATGGACA 
ACCCAACTCA 
TGCTTCATCC 
CAAAAGGCTC 
ACGACCTACG 
AGCTCATTCA 
ATTATGCAAT 
CTAAACAAGA 
GAATGATGAT 
ACCTCTGGGG 
ATTTATATTA 
GAGTTCTATT 



21 
1 

CTTCCAGGCT 
CAGCCGCCGC 
CCCGGGCTGG 
TCAGACCTGG 
ACTATGACTG 
TCACAACGGC 
AGCAACTGCG 
CCAGGATGTC 
TCGATGGGCA 
CTGGAGCCAG 
ATTACTTCTC 
GCACCCTGGA 
GGGCCACAGC 
TCCCTGGCAT 
CTGTGAGCAC 
GTGTATGTCC 
CTGCCTTGAT 
TTTCTCTTGG 
TATATCATTT 
CTCTTTCTTG 
ATTCTTTOOG 
ATGATTGTTT 
TCCCAAAAAA 



31 
I 

CTCCTTCCAT 
TACCAAGATC 
GCGAGCCGAC 
ACCACGGTGG 
TGGCAACAAG 
CTGGAAAGCA 
TGACATTCAG 
TTGTGAGCAG 
GATCTTCCTC 
AAAGATGAAA 
AATGGGAGAC 
GCCAAGTGCA 
CACCACCCTC 
CTGAGGAGAG 
GGTCTTGATC 
AGTGGCCTCC 
TCCTTTTGCC 
TGCTACCTGA 
TCTTTCTTCT 
CAAATGATAT 
TGTCCTGAAA 
CCTTTAGTAA 
AAAAAAAAAA 



LTVHFAPTIT 
BYHGCLQLDN 
GTAANDIGDT 
KFGMKDPSWF 
MTKIPVIBNP 
DKILVAVKTL 
LNKPLRAHGP 
GENLLVKIGD 
EIFTYGKQPW 
TLLQNLAKAS 



41 
I 

CAAGTCTCTC 
CTTCTGTGCC 
CCTCACTCTC 
TGTGCGGTTC 
ACAGTCACAC 
CAGAACCCAG 
CTGGAGAATT 
AAAGCTGAAG 
CTCTTTGACT 
GAAAAGTGGG 
TGTATAGGAT 
GGAGCACCAC 
ATCCTTTGCT 
TCCTTTAGAG 
AAACTCGCCC 
AGCAGATCAT 
AACAATTTTA 
TGGAATTCCT 
CTTTTTGTTT 
TGTCAGTAAA 
GAGAATTTTT 
TTTATTGTTC 
AA 



FLESPTSDHH 
PTHMNNGDYT 
TNRSNEIPST 
GFGKVKSRQG 
QYFGITNSQL 
KDASDNARKD 
DAVLMAEGNP 
FGMSRDVYST 
YQLSNNEVIE 
PVYLDILG 



51 
I 

CTCCCTAGCG 
TCCCGCTTCT 
TTTGCTATGA 
AAGGCCAGGT 
CTGTCAGTCC 
TACTGAGAGA 
ACACACCCAA 
GACACAGCAG 
CAGAGAAGAG 
AGAATGACAA 
GGCTTGAGGA 
TCGCCATGTC 
GCCTCCTCAT 
TGACAGGTTA 
TTCTGTCTGG 
GATGACATCA 
CCAGCAGTTA 
GCACTTAAAG 
GGAAAATCAA 
ATAATCACGT 
AAATTATTTA 
TGTACTGATA 



Seq ID NO: 599 Protein sequence 
Protein Accession #: BAB61048.1 



1 11 21 31 41 51 

I I I I 1 I 

MAAAAATKIL LCLPLLLLLS GWSRAGRADP HSLCYDITVI PKFRPGPRWC AVQGQVDEKT 
FLHYDCGNKT VTPVSPLGKK LNVTTAWKAQ NPVLREWDI LTEQLRDIQL ENYTPKEPLT 
LQARMSCEQK AEGHSSGSWQ FSFDGQIFLL FDSEKRMWTT VHPGARKMKE KWENDKWAM 
SFHYFSMGDC IGWLEDFLMG MDSTLEPSAG APLAMSSGTT QLRATATTLI LCCLLIILPC 
FILPGI 



300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



60 
120 
180 
240 



55 
60 
65 
70 



Seq ID NO: 600 DNA sequence 

Nucleic Acid Accession fl: NM_001898.1 

Coding sequence: 57.. 482 



GGCTCTCACC 
CCCAGTATCT 
GCCCCAAGGA 
AGTGGGTACA 
ACTACTACAG 
ATTACTTCTT 
ACACCTGTGC 
TCTACGAAGT 
AGGGATCTGT 
CCACCCCTGG 
GACAGACAGA 
CTTCCTTCTT 
AAACAGTAGC 



11 

t 

CTCCTCTCCT 
GAGTACCCTG 
GGAGGATAGG 
GCGTGCCCTT 
ACGTCCGCTG 
CGACGTAGAG 
CTTCCATGAA 
TCCCTGGGAG 
GCCAGGCCAT 
ACTGGTGGCC 
GAAGGCTGCA 
GCTTCTAATA 
ATCGCC 



21 
I 

GCAGCTCCAG 
CTGCTCCTGC 
ATAATCCCGG 
CACTTCGCCA 
CGGGTACTAA 
GTGGGCCGCA 
CAGCCAGAAC 
AACAGAAGGT 
TCGCACCAGC 
CCCACCCTGC 
GGAGTCCTTT 
GCCCTGGTAC 



31 
I 

CTTTGTGCTC 
TGGCCACCCT 
GTGGCATCTA 
TCAGCGAGTA 
GAGCCAGGCA 
CCATATGTAC 
TGCAGAAGAA 
CCCTGGTGAA 
CACCACCCAC 
GGGAGGCCTC 
GTTGCTCAGC 
ATGGTACACA 



41 
I 

TGCCTCTGAG 
AGCTGTGGCC 
TAACGCAGAC 
TAACAAGGCC 
ACAGACCGTT 
CAAGTCCCAG 
ACAGTTGTGC 
ATCCAGGTGT 
TCCCACCCCC 
CCCATGTGCC 
AGGGCGCTCT 
CCCCCCCACC 



51 
I 

GAGACCATGG 
CTGGCCTGGA 
CTCAATGATG 
ACCAAAGATG 
GGGGGGGTGA 
CCCAACTTGG 
TCTTTCGAGA 
CAAGAATCCT 
TGTAGTGCTC 
TGCGCCAAGA 
GCCCTCCCTC 
TCCTGCAATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



75 
80 
85 



Seq ID NO: 601 Protein sequence 
Protein Accession 8: NP 001889.1 



51 



1 11 21 31 41 

I I I I I ! 

MAQYLSTLLL LLATLAVALA WSPKEEDRII PGGIYNADLN DEWVQRALHF AISEYNKATK 
DDYYRRPLRV LRARQQTVGG VNYFFDVEVG RTICTKSQPN LDTCAFHEQP ELQKKQLCSF 
EIYEVPWENR RSLVKSRCQE S 

Seq ID NO: 602 DNA sequence 

Nucleic Acid Accession #: NM_003976.2 

Coding sequence: 299.961 



11 



21 



31 



41 



51 



60 
120 



419 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 . 



WO 02/086443 

CTCTGAGCTT CTCTGAGCCT TGTTTGCTCA TCTGGAAAAA GGGGATTAAA CCATTTACCT 60 

CATGGAGTTG TGAAAQAATA GCTGCAAAGC ACCTAACACA TAGTAAGGTT CCCAGTGCAG 120 

CTACTTCTGC TGGGTTGAGT CTAGCTGTGT AGGCCCCTTG TTCCTCACCT GGAGAAACTG 180 

GGGTGGCAGG CCGGTCCCCC ACAAAAGATA ACTCATCTCT TAATTTGCAA GCTGCCTCAA 240 

CAGGAGGGTG GGGGAACAGC TCAACAATGG CTGATGGGCG CTCCTGGTGT TGATAGAGAT 300 

GGAACTTGGA CTTGGAGGCC TCTCCACGCT GTCCCACTGC CCCTGGCCTA GGOGGCAGCC 360 

TGCCCTGTGG CCCACCCTGG CCGCTCTGGC TCTGCTGAGC AGCGTCGCAG AGGCCTCCCT 420 

GGGCTCCGCG CCCCGCAGCC CTGCCCCCCG CGAAGGCCCC CCGCCTGTCC TGGCGTCCCC 480 

CGCCGGCCAC CTGCCGGGGG GACGCACGGC CCGCTGGTGC AGTGGAAGAG CCCGGCGGCC 540 

GCCGCCGCAG CCTTCTCGGC CCGCGCCCCC GCCGCCTGCA CCCCCATCTG CTCTTCCCCG 600 

CGGGGGCCGC GCGGCGCGGG CTGGGGGCCC GGGCAGCCGC GCTCGGGCAG CGGGGGCGCG 660 

GGGCTGCCGC CTGCGCTCGC AGCTGGTGCC GGTGCGCGCG CTCGGCCTGG GCCACCGCTC 720 

CGACGAGCTG GTGCGTTTCC GCTTCTGCAG CGGCTCCTGC CGCCGCGCGC GCTCTCCACA 780 

CGACCTCAGC CTGGCCAGCC TACTGGGCGC CGGGGCCCTG CGACCGCCCC CGGGCTCCCG 840 

GCCCGTCAGC CAGCCCTGCT GCCGACCCAC GCGCTACGAA GCGGTCTCCT TCATGGACGT 900 

CAACAGCACC TGGAGAACCG TGGACCGCCT CTCCGCCACC GCCTGCGGCT GCCTGGGCTG 960 

AGGGCTCGCT CCAGGGCTTT GCAGACTGGA CCCTTACCGG TGGCTCTTCC TGCCTGGGAC 1020 

CCTCCOGCAG AGTCCCACTA GCCAGCGGCC TCAGCCAGGG ACGAAGGCCT CAAAGCTGAG 1080 

AGGCCCCTAC CGGTGGGTGA TGGATATCAT CCCCGAACAG GTGAAGGGAC AACTGACTAG 1140 

CAGCCCCAGA GCCCTCACCC TGCGGATCCC AGCCTAAAAG ACACCAGAGA CCTCAGCTAT 1200 

GGAGCCCTTC GGACCCACTT CTCACAGACT CTGGCACTGG CCAGGCCTOG AACCTGGGAC 1260 

CCCTCCTCTG ATGAACACTA CAGTGGCTGA GGCATCAGCC CCCGCCCAGG CCCTGTAGGG 1320 

ACAGCATTTG AAGGACACAT ATTGCAGTTG CTTGGTTGAA AGTGCCTGTG CTGGAACTGG 1380 
CCTGTACTCA CTCATGGGAG CTGGCCCC 

Seg ID NO: 603 Protein sequence 
Protein Accession #: NP_003967.1 

1 11 21 31 41 51 

I I I I I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 
PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 
RGCRIjRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 604 DNA sequence 
Nucleic Acid Accession 8: NM_057091.1 
Coding sequence: 783.. 1445 



PCT/US02/12476 



ACTGGCCGCT 
GGACCCCCAA 
TCGCTCCCCG 
CGCGTGTCTA 
CTCCATATCC 
CAAGCTAGGG 
CGGGGCAGGG 
CACCGGACGG 
CAGACAAGGC 
CCCGGGCCTG 
TAAAAGAGGC 
GCCCAGCACT 
TCAACAGGAG 
AGATGGAACT 
AGCCTGCCCT 
CCCTGGGCTC 
CCCCCGCCGG 
GGCCGCCGCC 
CCCGCGGGGG 
CGCGGGGCTG 
GCTCCGACGA 
CACACGACCT 

ACGTCAACAG 
GCTGAGGGCT 
GGACCCTCCC 
TGAGAGGCCC 
CTAGCAGCCC 
CTATGGAGCC 
GGACCCCTCC 
AGGGACAGCA 
CTGGCCTGTA 



11 
I 

GAGAGAAGAA 
ATCTGCACGT 
CCCTCACTCA 
CAAACTCAAC 
GAGGGGCCCC 
GGGACTGGAT 
GCGCTCCCAG 
CTGCGGCGGC 
CCGGGGGCTC 
GAGCCCCACA 
ACTGCCAGGT 
GGTCCCCGGA 
GGTGGGGGAA 
TGGACTTGGA 
GTGGCCCACC 
CGCGCCCCGC 
CCACCTGCCG 
GCAGCCTTCT 
CCGCGCGGCG 
CCGCCTGCGC 
GCTGGTGCGT 
CAGCCTGGCC 
CAGCCAGCCC 
CACCTGGAGA 
CGCTCCAGGG 
GCAGAGTCCC 
CTACCGGTGG 
CAGAGCCCTC 
CTTCGGACCC 
TCTGATGAAC 
TTTGAAGGAC 
CTCACTCATG 



21 
) 

TCGGGTGGAG 
ACCAGCAGTC 
CTTTCTCCCG 
TCCCGGTTTC 
TCCCAGCATC 
CCGACGGGTG 
CCCCACCCCG 
GGGCAGGAGG 
CGCCAGCAGC 
CCCGAGGGTG 
GTACAGTCCT 
AAGGTGCCTA 
CAGCTCAACA 
GGCCTCTCCA 
CTGGCCGCTC 
AGCCCTGCCC 
GGGGGACGCA 
CGGCCCGCGC 
CGGGCTGGGG 
TCGCAGCTGG 
TTCCGCTTCT 
AGCCTACTGG 
TGCTGCCGAC 
ACCGTGGACC 
CTTTGCAGAC 
ACTAGCCAGC 
GTGATGGATA 
ACCCTGCGGA 
ACTTCTCACA 
ACTACAGTGG 
ACATATTGCA 
GGAGCTGGCC 



31 

1 

CAGAGAGCAG 
AGCCGCCCCA 
CCCTCGGCCC 
CGTGCCTCTC 
TACCCCCCTC 
GAGCAGCCAG 
GGATCTGGTG 
CTGCTGAGGG 
AGGTCCCTCG 
CAGACTGGCT 
GGGCATGCGC 
GAAGAACAAG 
ATGGCTGATG 
CGCTGTCCCA 
TGGCTCTGCT 
CCCGCGAAGG 
CGGCCCGCTG 
CCCCGCCGCC 
GCCCGGGCAG 
TGCCGGTGCG 
GCAGCGGCTC 
GCGCCGGGGC 
CCACGCGCTA 
GCCTCTCCGC 
TGGACCCTTA 
GGCCTCAGCC 
TCATCCCCGA 
TCCCAGCCTA 
GACTCTGGCA 
CTGAGGCATC 
GTTGCTTGGT 
CC 



41 
I 

CTGCTGCAGG 
CGCAGGGACC 
GGCCTCCCAG 
CACCGCTCGA 
CCAACCTCGG 
GTGAGCCCCG 
ACGCTGGGGC 
ATGGAGTTGG 
GGCCCCAGCC 
GCCAAGGCCA 
TGTTTGAGCT 
GTGCAGGACC 
GGCGCTCCTG 
CTGCCCCTGG 
GAGCAGCGTC 
CCCCCCGCCT 
GTGCAGTGGA 
TGCACCCCCA 
CCGCGCTCGG 
CGCGCTCGGC 
CTGCCGCCGC 
CCTGCGACCG 
CGAAGCGGTC 
CACCGCCTGC 
CCGGTGGCTC 
AGGGACGAAG 
ACAGGTGAAG 
AAAGACACCA 
CTGGCCAGGC 
AGCCCCCGCC 
TGAAAGTGCC 



51 
! 

GCAGACAGCC 
GGCTTACCCC 
CTCTCTACTT 
GTTCTCTACT 
GGGACCTAGC 
AAAGGTGGGG 
TGGAATTTGA 
GCCCGGCCCC 
CTCGCTGCCA 
CACTTTTGGC 
TCGGGGGAGA 
CCGTGCTGCC 
GTGTTGATAG 
CCTAGGCGGC 
GCAGAGGCCT 
GTCCTGGCGT 
AGAGCCCGGC 
TCTGCTCTTC 
GCAGCGGGGG 
CTGGGCCACC 
GCGCGCTCTC 
CCCCCGGGCT 
TCCTTCATGG 
GGCTGCCTGG 
TTCCTGCCTG 
GCCTCAAAGC 
GGACAACTGA 
GAGACCTCAG 
CTCGAACCTG 
CAGGCCCTGT 
TGTGCTGGAA 



60 
120 
180 
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600 
660 
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780 
840 
900 
960 
1020 
1080 
1140 
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Seq ID NO: 605 Protein sequence 
Protein Accession #: NP 003967.1 



80 



85 



1 11 21 31 41 51 

I I I I I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 
PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 
RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 



Seq ID NO: 606 DNA sequence 

Nucleic Acid Accession 8: NM 057160.1 



60 
120 
180 



420 



WO 02/086443 

Coding sequence: 1..714 



PCT/US02/12476 



1 11 21 31 41 51 

,1 I I I I I 

J ATGCCCGGCC TGATCTCAGC CCGAGGACAG CCCCTCCTTG AGGTCCTTCC TCCCCAAGCC 60 

CACCTGGGTG CCCTCTTTCT CCCTGAGGCT CCACTTGGTC TCTCCGCGCA GCCTGCCCTG 120 

TGGCCCACCC TGGCCGCTCT GGCTCTGCTG AGCAGCGTCG CAGAGGCCTC CCTGGGCTCC 180 

GCGCCCCGCA GCCCTGCCCC CCGCGAAGGC CCCCCGCCTG TCCTGGCGTC CCCCGCCGGC 240 

CACCTGCCGG GGGGACGCAC GGCCCGCTGG TGCAGTGGAA GAGCCCGGCG GCCGCCGC03 300 

10 CAGCCTTCTC GGCCCGCGCC CCCGCCGCCT GCACCCCCAT CTGCTCTTCC CCGCGGGGGC 360 

CGCGCGGCGC GGGCTGGGGG CCCGGGCAGC CGCGCTCGGG CAGCGGGGGC GCGGGGCTGC 420 

CGCCTGCGCT CGCAGCTGGT GCCGGTGCGC GCGCTCGGCC TGGGCCACCG CTCCGACGAG 480 

CTGGTGCGTT TCCGCTTCTG CAGCGGCTCC TGCCGCCGCG CGCGCTCTCC ACACGACCTC 540 

AGCCTGGCCA GCCTACTGGG CGCCGGGGCC CTGCGACCGC CCCCGGGCTC CCGGCCCGTC 600 

15 AGCCAGCCCT GCTGCCGACC CACGCGCTAC GAAGCGGTCT CCTTCATGGA CGTCAACAGC 660 

ACCTGGAGAA CCGTGGACCG CCTCTCCGCC ACCGCCTGCG GCTGCCTGGG CTGAGGGCTC 720 

GCTCCAGGGC TTTGCAGACT GGACCCTTAC CGGTGGCTCT TCCTGCCTGG GACCCTCCCG 780 

CAGAGTCCCA CTAGCCAGCG GCCTCAGCCA GGGACGAAGG CCTCAAAGCT GAGAGGCCCC 840 

TACCGGTGGG TGATGGATAT CATCCCOGAA CAGGTGAAGG GACAACTGAC TAGCAGCCCC 900 

20 AGAGCCCTCA CCCTGCGGAT CCCAGCCTAA AAGACACCAG AGACCTCAGC TATGGAGCCC 960 

TTCGGACCCA CTTCTCACAG ACTCTGGCAC TGGCCAGGCC TCGAACCTGG GACCCCTCCT 1020 

CTGATGAACA CTACAGTGGC TGAGGCATCA GCCCCCGCCC AGGCCCTGTA GGGACAGCAT 1080 

TTGAAGGACA CATATTGCAG TTGCTTGGTT GAAAGTGCCT GTGCTGGAAC TGGCCTGTAC 1140 
TCACTCATGG GAGCTGGCCC C 



25 



35 



65 



Seq ID NO: 607 Protein sequence 
Protein Accession ft: NP__476501.1 



1 11 21 31 41 51 

30 | | | | | | 

MPGLISARGQ PLLEVLPPQA HLGALFLPEA PLGLSAQPAL WPTLAALALIi SSVAEASLGS 60 
APRSPAPREG PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP APPSALPRGG 120 
RAARAGGPGS RARAAGARGC RLRSQLVPVR ALGLGHRSDE LVRFRFCSGS CRRARSPHDL 180 
SLASIjLGAGA LRPPPGSRPV SQPCCRPTRY EAVSFMDVNS TWRTVDRLSA TACGCLG 



Seq ID NO: 608 DNA sequence 

Nucleic Acid Accession ft: NM_057090.1 

Coding sequence: 29.. 715 



40 1 11 21 31 41 51 

I I I I I I 

CTGATGGGCG CTCCTGGTGT TGATAGAGAT GGAACTTGGA CTTGGAGGCC TCTCCACGCT 60 

GTCCCACTGC CCCTGGCCTA GGCGGCAGGC TCCACTTGGT CTCTCCGCGC AGCCTGCCCT 120 

GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT CCCTGGGCTC 180 

45 CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT CCCCCGCCGG 240 

CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC GGCCGCCGCC 300 

GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC CCCGCGGGGG 360 

CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG CGCGGGGCTG 420 

CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC GCTCCGACGA 480 

50 GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC CACACGACCT 540 

CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCOCGGGCT CCCGGCCCGT 600 

CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG ACGTCAACAG 660 

CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG GCTGAGGGCT 720 

CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG GGACCCTCCC 780 

55 GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC TGAGAGGCCC 840 

CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA CTAGCAGCCC 900 

CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG CTATGGAGCC 960 

CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG GGACCCCTCC 1020 

TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT AGGGACAGCA 1080 

60 TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA CTGGCCTGTA 1140 
CTCACTCATG GGAGCTGGCC CC 



Seq ID NO: 609 Protein sequence 
Protein Accession ft: NP_476431.1 



1 11 21 31 41 51 

I I I I I I 

MELGLGGLST LSHCPWPRRQ APLGLSAQPA LWPTLAAIiAL LSSVAEASLG SAPRSPAPRE 60 
GPPPVLASPA GHLPGGRTAR WCSGRARRPP PQPSRPAPPP PAPPSALPRG GRAARAGGPG 120 
70 SRARAAGARG CRLRSQLVPV RALGLGHRSD ELVRFRFCSG SCRRARSPHD LSLASLLGAG 180 
ALRPPPGSRP VSQPCCRPTR YEAVSFMDVN STWRTVDRLS ATACGCLG 

Seq ID NO: 610 DNA sequence 
Nucleic Acid Accession ft: Eos sequence 
75 Coding sequence: 1..1746 

1 11 21 31 41 51 

I I I I I I 

_ A ATGCCACTGA AGCATTATCT CCTTTTGCTG GTGGGCTGCC AAGCCTGGGG TGCAGGGTTG 60 

80 GCCTACCATG GCTGCCCTAG CGAGTGTACC TGCTCCAGGG CCTCCCAGGT GGAGTGCACC 120 

GGGGCACGCA TTGTGGCGGT GCCCACCCCT CTGCCCTGGA ACGCCATGAG CCTGCAGATC 180 

CTCAACACGC ACATCACTGA ACTCAATGAG TCCCCGTTCC TCAATATCTC AGCCCTCATC 240 

GCCCTGAGGA TTGAGAAGAA TGAGCTGTCG CGCATCACGC CTGGGGCCTT CCGAAACCTG 300 

GGCTCGCTGC GCTATCTCAG CCTCGCCAAC AACAAGCTGC AGGTTCTGCC CATCGGCCTC 360 

85 TTCCAGGGCC TGGACAGCCT TGAGTCTCTC CTTCTGTCCA GTAACCAGCT GTTGCAGATC 420 

CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC TGCAGTTGCA CGGCAACCAC 480 

CTGGAATACA TCCCTGACGG AGCCTTCGAC CACCTGGTAG GACTCACGAA GCTCAATCTG 540 



421 



WO 02/086443 

GGCAAGAATA GCCTCACCCA CATCTCACCC 
GTCCTCCGGC TGTATGAGAA CAGGCTCACG 
GTTAACCTGC AGGAACTGGC TCTACAGCAG 
TTCCACAACA ACCACAACCT CCAGAGACTC 
CCACCCAGCA TCTTCATGCA GCTGCCCCAG 
CTGAAGGAGC TCTCTCTGGG GATCTTCGGG 
TATGACAACC ACATCTCTTC TCTACCCGAC 
GTCCTGATTC TTAGCCGCAA TCAGATCAGC 
ACGGAGCTTC GGGAGCTGTC CCTCCACACC 
TTCCGCATGT TGGCCAACCT GCAGAACATC 
CCAGGGAATA TCTTCGCCAA CGTCAATGGC 
CTGGAGAACT TGCCCCTCGG CATCTTCGAT 
TATGACAATC CCTGGAGGTG TGACTCAGAC 
AACCAGCCTA GGTTAGGGAC GGACACTGTA 
GGCCAGTCCC TCATTATCAT CAATGTCAAC 
GTGCCTAGTT ACCCAGAAAC ACCATGGTAC 
TCCGTCTCTT CTACCACTGA GCTAACCAGC 
ATTCAGGTCA CTGATGACCG CAGCGTTTGG 
ATTGCCGCCA TTGTAATTGG CATTGTCGCC 
TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT 
TGTTAAAGAG GCAGGCTGGA GCAGGGCTGG 
TCATCTTTCT GCCTCCACCC CTGGGTCCAT 
CTAGATAAAG GTGTGCCTAC CTCTTCCTGA 
CGTGCCGGAC CTTCCTACAA TCAGGAAGAT 
GGATTTCCGA TTCATACCCC TGGGCTTCCT 
ACCTGTCCTC CAAGAACAGC CTTCCCTGCG 
AGTTAGTCCA CAGCCTGCTC ACTTCGTGGG 
CCTAAGTATT ATGTAAGTTG ATTTCCCTTC 
ACCCAGCATG TCCCCTCAAA TGAAAGTTCT 
TGAGTTCTCT CCTCAAAGAA GACTTCAAAC 
CAGCCTGGTT TTGGGGATGC TATGAAAGAG 
AGACAGAAGA GCCGTCATCA GTGTCTCACT 
CCCCAGCACA GCAAGCTCAG CCTTTTAGAG 
TGAAAAGTTT AGCCCTTTAA GGAATGAAAT 
AAAATCAGCT TATTAATACG GGATAGAGAA 
CACCCCTAGA GTTTGTTTTA AAATTTTTAA 
GTGGGAACAT GATAGTGTAT GGCTTGGTGG 
CAGCATCTAG ACCCAGACCC AGAGCATCAC 
GGAGATGGGG GCTTCTGAAG ATGGACTTAC 
TCCCCCCACA GTCAGCCTGT GCAAAGGCCC 
TGTGGACAGG ATGGGAGACT GTGGCCTGAA 
AGAGACCCTG AGACCTGGGG CACCATGGCT 
GTCCGTGCAG CCACACCCTC TTCCCTGCCA 
TCCGCCTGGA GCCTTCTATG GACGTGATAT 
ACTTAGGGGA AGTGAAATCG CTCAGAGATG 
GAATCTAGTG TCTTTCTAAT GTGGTAAAAT 
TGAACTTCAG AATCTCACTT ACAGCAGGCG 
GTCTGGGGGC TCCCTGGAGC TCCTCCTGCG 
TCCAGGGTTA TTCTCCTCCT CGAGTCACAG 
CTGCTATACA CATATTCACA TGGCGCTCAA 
CTCTGGACAA CTGGCCCAGT TTACAGTGAA 
AGGAAAGAAC TTCAGCTGAC TCCACGGGGA 
TCTTATTAGC TCCCCGCTCC ACAAGACACC 
TCGGCTCTTA TTAGCTCCCC GCTCCACAAG 
CCCGATCGGC TCTTATTAGC TCCCCGCTCC 
CAGGAGCACG TGCTGACCAG TTTTCCCTTC 
TGTTTGCAAA CACTAGTGCA CTTTGTAGCT 
AGATGAGGCC CGTCAGAGTC AAGAGATGTC 
ACTATTGGTG GCACCTGGAG GACATGCACC 
CCAGAGCATG GCACATGAGC ATCACCCGCT 
GGGGCATCCC GGCCCGTACC CCTCCAGACA 
GGTGCTCCTG TGAGTGGCCT CCAGATGTCT 
GAGGGAGGTG GGAAACCTCA TCATCCGGTG 
GGTATTCCTG GCAGTAGCCA TGACATTGGA 
GAGGGCCACT GTCCTCAGAT GACACCACCC 
CCTTATGTGA ACCTCTTGCC TCTTCCTTTC 
GCCTCCTTTT CTTCAGCGGG CCCTTCAACC 
TACTAGAAAA GCTGAGTGGA GTCTCCTTTC 
GGGCTGGAAT GAGCCGGCTG GTCCCCCAGA 
CCTCTCTGTT TACAGCTCCT TGACAGTCCC 
GTGTTGGAGA AGAAACAACA AAAGCCAATT 
TGCACAGATA CTCTTCAAGC ACTGGACGTG 
GGTAGGAGTG CCGCCTCTAC CCACTTGTGA 
GGTGTTCAAT AGGCTGGGAG TTTTATTTAT 
TTGTCTTGGG CTTTCGTCAT TAAACCAAAG 
TTAGTCTTGG TCATCAGAAC CTCACTTGGT 
GGAAAAAATA AACTCTTCCA TCCCTTAAAG 
TGGGCTGTAT GTATATTGTT CTTCCTCCTT 
AACTTTTCAT GGACACAATT TCCACAACCT 
GAACTTCCAA ACTCAGGAAG TTTGCAGAGA 
AGTTGGTCGA CAGATGTTAG ATGTATCCTA 
GCCCCCAGAT CCCACAGTCA GAACTGAATC 
GGAAGGAAGC CATGGCTGTG GTTCAGAGAG 
CTCCTTCCGC CCCAGGTTTC TTCTTCTCTT 
CTTCATGCTG CCTTCAAAGC TAGATCATGT 
GCCCCAGTGC TTGGCGATGC ATTTACAGAT 
AGCCCTGGTG GGCAGGGTTG GGGGGTCTGT 



AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 

GATATCCCCA TGGGCACTTT TGATGGGCTT 660 

AACCAGATTG GACTGCTCTC CCCTGGTCTC 720 

TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 

CTCAACCGTC TTACTCTCTT TGGGAATTCC 840 

CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 900 

AATGTCTTCA GCAACCTCCG CCAGTTGCAG 960 

TTCATCTCCC CGGGTGCCTT CAACGGGCTA 1020 

AACGCACTGC AGGACCTGGA CGGGAATGTC 1080 

TCCCTGCAGA ACAATCGCCT CAGACAGCTC 1140 

CTCATGGCCA TCCAGCTGCA GAACAACCAG 1200 

CACCTGGGGA AACTGTGTGA GCTGCGGCTG 1260 

ATCCTTCCGC TCCGCAACTG GCTCCTGCTC 1320 

CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 1380 

GTTGCTGTTC CAAGCGTCCA TGTCCCTGAG 1440 

CCAGACACAC CCAGTTACCC TGACACCACA 1500 

CCTGTGGAAG ACTACACTGA TCTGACTACC 1560 

GGCATGACCC AGGCCCAGAG CGGGCTGGCC 1620 

CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 1680 

GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 

GGAATGATGG GACTGGAGGA CCTGGGAATT 1800 

GGAGCTTTCC CGTGATTGCT CTTTCTGGCC 1860 

CTTGCCTGAT TCTCCCGTAG AGAAGCAGGT 1920 

AGATCCAACT GGCCATGGCA AAAGCCCTGG 1980 

TCGAGAGGGC TCTTCCTCCA AATCCTCCCC 2040 

CCCAGGCCCC CTCCGGGCCT CTGTAGACTC 2100 

AATAGTTCTC CGCTGAGATA GCCCCTCTCG 2160 

TTTTGTTTCT CTTGTTTGTG CTATGGCTTG' 2220 

CCCCTTGATT TTCTGCTCCT GAAGGCAGGG 2280 

CATTTAACTG GTTTCTTAAG AGCCGTCAAT 2340 

AGAAGGAAAA TCATGCCGCT CAGTTCCTGG 2400 

TGTGATTTTT ATCTGGAAAA GGAAGAAACA 2460 

AAGGATATTT CCAAACTGCA AACTTTGCTT 2520 

CATGTAGAAT TTTGGACTTC TAAAAACATT 2580 

AGAAATCTGG TGCCTGGGGG TCCCTGTGTT 2640 

TTGAAGCATG TGAAGTGTAC STGCAGAAAA 2700 

ATTTTCACAA ACTGAACATA CCTGTGTAAT 2760 

AAATATCCCC CATCCTGGGC TTTTCCCAGA 2820 

CTGGGACCTG CCCCCCATGA GCCAGGACGG 2880 

CGTGGCCAGG GGTGGAGGAG AATATGTGGG 2940 

CAGGAGATTT TATTATATCT GGAGACCCTG 3000 

GGCCAGGTCA GAAGCATCCT GACTGCAGAG 3060 

GCAAGTTGTC TGCGGCTCAT CGGAGGCCCC 3120 

GCCTGTATCT GTTTTTAATT TTCATTCTTC 3180 

AGATCCTTTA ATTGAAAACG AAGTGTAACG 3240 

TCTCCATCAA CATCACAGTC AGCTGGCAGC 3300 

ACACGGGGGT ACACCGATGG GTCACACTGG 3360 

TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 3420 

TCACACGAAT ACCTGCCTTC TCTGGCTTTC 3480 

GAAGTTAGGC TCATGGCAAC GTGTGTCTTT 3540 

ATGGAGAATT TCAGGTCTCC ACGTCTGCCC 3600 

TCTGGAAATC CACGACCAAT CCCGATCGGC 3660 

TGTGCTTTGG AAATCCACCA CCAATCCCGA 3720 

ACACCTGTGA TCTGGAAATC TACCACCAAT 3780 

ACAAGACACC TGTGACATCC TCCAGGGCCA 3840 

CAGTTCCTGC ACAAAAAGTG TCGAGAGGGC 3900 

TTTCACCCTC TGTCCCAGGG AATCTAGGAG 3960 

ATCCCCCCAG GGTCTCCAAG GCATTTCCAC 4020 

AAGGCTTGCC AGAGCCAACA GGAAGTGAGC 4080 

GATGGTGGCC TGCTGTGCCT GGTGCCAACA 4140 

GGAAGCATGG GTTTGCCCAC AGACCTGTCG 4200 

TTGTGCATAG GCACAAGTGG GCCAGGGCTG 4260 

GGCCCTGCCA ATCTTAACCC AGAACCCTTA 4320 

GCACCTTCCT CTCCAGCCAG AGGCTGACCT 4380 

AGGAGCACCC TAGGTGAGGG GTGAGGGCCC 4440 

TCCCATCAGA GTGGTTGGAT GGAGCCATTG 4500 

TCTCTGCACC ATGTTGTCTG GCTGAGGAGC . 4560 

CAACAGGATG ATGCATTTGC TCAATTCTCA 4620 

AAGCTGGAGT GGGGTACAGA GTTCAGTTTT 4680 

ACGCCCATCT GGAGTGGGAG CTGGGAGTTA 4740 

AGAACCACTA TTTTTAAAAA GTGCTTACTG 4800 

GATTCTCTCT CTAGCCCTCA GCACCCCTGC. 4860 

TGGGGTACAG AGGCACTTGC TCTTCTGCAT 4920 

CTCTTCAAAC TTTGTACAAG AGCTCATGGC 4980 

GAAATGGAAG CCATTCCCCT GTTGCTCTCC 5040 

ACCATATAGA TCAAAAGCTT TGTAACCACA 5100 

AATAGAATAG TTTGTCCCTC TCATGGGAAT 5160 

AGAATTTAGA GATACAAGAG TTCTACTTAG 5220 

TTCAGATGCT GATGTAGAGC TATTGGGAAA 5280 

GCAGACAGCT AGAGATAACT CGGGACCCAG 5340 

GCTTTTAGCC ATAAACCACT CAAAGATTCA 5400 

TGCGTTGTTG GGAAGCCAGC AGTGGCCTTG 5460 

GGTGGGCTGG CAAGCCACTT CCGGGGAAAA 5520 

AAGGAGAGAT TGTTCTCACC AACCCGCTGC 5580 

TTGCCTTGCT TAGAGAATTA CTGCAAATCA 5640 

TTCTAGGCCC TCAGGGTTTT GTAGAGTGTG 5700 

CTTCTGCTGG ATGCTGCTTG TAATCCATTT 5760 
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WO 02/086443 

GGTGTACAGA ATCAACAATA AATAATATAC ATGTAT 

Seq ID NO: 611 Protein sequence 
Protein Accession 8: BAB84587.1 



PCT/US02/12476 



x 
I 

MPLKHYLLLL 
LNTHITELNE 
FQGLDSLESL 
GKNSLTHISP 
FHNNHNLQRL 
YDNHISSLPD 
FRMLANLQNI 
YDNPWRCDSD 
VPSYPETPWY 
IAAIVIGIVA 



11 
I 

VGCQAWGAGL 
SPFLNISALI 
LLSSNQLLQI 
RVFQHLGNLQ 
YLSNNHISQL 
NVFSNLRQLQ 
SLQNNRLRQL 
ILPLRNWLLL 
PDTPSYPDTT 
LACSLAACVG 



21 
I 

AYHGCPSECT 
ALRIEKNELS 
QPAHFSQCSN 
VLRLYENRLT 
PPSIFMQLPQ 
VLILSRNQIS 
PGNIFANVNG 
NQPRLGTDTV 
SVSSTTELTS 
CCCCKKRSQA 



31 
I 

CSRASQVECT 
RITPGAFRNL 
LKELQLHGNH 
DIPMGTFDGL 
LNRLTLFGNS 
FISPGAFNGIi 
LMAIQLQNNQ 
PVCFSPANVR 
PVEDYTDLTT 
VLMQMKAPNE 



41 
I 

GARIVAVPTP 
GSLRYLSLAN 
LEYIPDGAFD 
VNLQELALQQ 
LKELSLGIFG 
TELRELSLHT 
LENLPLGIFD 
GQSIiIIINVN 
IQVTDDRSVW 
C 



51 
I 

LPWNAMSLQI 
NKLQVLPIGL 
HLVGLTKLNL 
NQIGLLSPGL 
PMPNLRELWL 
NALQDLDGNV 
HLGKLCELRL 
VAVPSVHVPE 
GMTQAQSGLA 



Seq ID NO: 612 DNA sequence 
Nucleic Acid Accession #: XMJ)98151 
Coding sequence : 1 . . 447 



ATGATGCATT 
AGTGGGGTAC 
_ TCTGGAGTGG 
CTATTTTTAA 
TCTCTAGCCC 
CAGAGGCACT 
AACTTTGTAC 
AAGCCATTCC 



11 
I 

TGCTCAATTC 
AGAGTTCAGT 
GAGCTGGGAG 
AAAGTGCTTA 
TCAGCACCCC 
TGCTCTTCTG 
AAGAGCTCAT 
CCTGTTGCTC 



21 
I 

TCAGGGCTGG 
TTTCCTCTCT 
TCAGTGTTGG 
CTGTGCACAG 
TGCGGTAGGA 
CATGGTGTTC 
GGCTTGTCTT 
TCCTTAG 



31 
I 

AATGAGCCGG 
GTTTACAGCT 
AGAAGAAACA 
ATACTCTTCA 
GTGCCGCCTC 
AATAGGCTGG 
GGGCTTTCGT 



41 
I 

CTGGTCCCCC 
CCTTGACAGT 
ACAAAAGCCA 
AGCACTGGAC 
TACCCACTTG 
GAGTTTTATT 
CATTAAACCA 



51 
I 

AGAAAGCTGG 
CCCACGCCCA 
ATTAGAACCA 
GTGGATTCTC 
TGATGGGGTA 
TATCTCTTCA 
AAGGAAATGG 



Seq ID NO: 613 Protein sequence 
Protein Accession #: XPJ>98151 

21 



1 11 21 31 41 51 

I I I I I I 

MMHLLNSQGW NEPAGPPESW SGVQSSVFLS VYSSLTVPRP SGVGAGSQCW RRNNXSQLEP 
LFLKSAYCAQ ILFKHWTWIL SLALSTPAVG VPPLPTCDGV ORHLIiFCMVF NRLGVLFISS 
NFVQELMACL GLSSLNQRKW KPFPCCSP 

Seq ID NO: 614 DNA sequence 

Nucleic Acid Accession #: NM_002658.1 

Coding sequence : 77 . . 1372 



11 



21 



31 



41 



51 



GTCCCCGCAG 
CCCCGACCTC 
GAGCGACTCC 
TGGAGGAACA 
GAAATTCGGA 
TCACTTTTAC 
CTCTGCCACT 
CCTGGGGAAA 
GCAGGTGGGC 
AAAGCCCTCC 
CCGCTTTAAG 
CATCTACAGG 
CCCTTGCTGG 
CATCGTCTAC 
GGTGGAAAAC 
CATTGCCTTG 
ACAGACCATC 
CACTGGCTTT 
TGTTGTGAAG 
CACCACCAAA 
CTCAGGGGGA 
CTGGGGCCGT 
CTTACCCTGG 
AGGGAGGAAA 
TCCATCAGCT 
CACCACCAGG 
CAGACCCTCT 
TGTCTTTTTC 
GGCTCGAAGG 
AATGAATAAT 
AATGTGGGAG 
ATTCCATGAA 
GCTGTGAGTG 
AAACTGTGTG 
CTGGGGCCTC 
ACCTGTGACC 
ATCCCTTCCT 
ACACTGAATA 
ATCAATAAAA 



CGCCGTCGCG 
GCCACCATGA 
AAAGGCAGCA 
TGTGTGTCCA 
GGGCAGCACT 
CGAGGAAAGG 
GTCCTTCAGC 
CATAATTACT 
CTAAAGCCGC 
TCTCCTCCAG 
ATTATTGGGG 
AGGCACCGGG 
GTGATCAGCG 
CTGGGTCGCT 
CTCATCCTAC 
CTGAAGATCC 
TGCCTGCCCT 
GGAAAAGAGA 
CTGATTTCCC 
ATGCTATGTG 
CCCCTCGTCT 
GGATGTGCCC 
ATCCGCAGTC 
CGGGCACCAC 
GTAAGAAGAG 
GTGAACGACA 
GGCCAGGATG 
TGGACTGAAG 
GAGAGCCAGC 
TTCCCAATTA 
CAGCGGTTTG 
TGTATCAGGA 
TAAGTGTGAG 
GACTGTGATG 
TTGGGTCCCC 
AGCACTGTCT 
TTTAGCCTAG 
TTTATATTTC 
TGTGATTTTT 



CCCTCCTGCC 
GAGCCCTGCT 
ATGAACTTCA 
ACAAGTACTT 
GTGAAATAGA 
CCAGCACTGA 
AAACGTACCA 
GCAGGAACCC 
TTGTCCAAGA 
AAGAATTAAA 
GAGAATTCAC 
GGGGCTCTGT 
CCACACACTG 
CAAGGCTTAA 
ACAAGGACTA 
GTTCCAAGGA 
CGATGTATAA 
ATTCTACCGA 
ACCGGGAGTG 
CTGCTGACCC 
GTTCCCTCCA 
TGAAGGACAA 
ACACCAAGGA 
CCGCTTTCTT 
ACTGGGAAGA 
ATAGCTTTAC 
GAGGGGTGGT 
CCTGCAGGAG 
TCCCCCGACC 
GGAAGTGTAA 
GGGAGCAGAG 
AATATATATG 
TAAGAGCTGG 
CCACACAGAG 
CACGTGACAG 
CAGTTTCACT 
TTCATCCAAT 
ACTATTTTTA 
CTGA 



GCAGGCCACC 
GGCGCGCCTG 
TCAAGTTCCA 
CTCCAACATT 
TAAGTCAAAA 
CACCATGGGC 
TGCCCACAGA 
AGACAACCGG 
GTGCATGGTG 
ATTTCAGTGT 
CACCATCGAG 
CACCTACGTG 
CTTCATTGAT 
CTCCAACACG 
CAGCGCTGAC 
GGGCAGGTGT 
CGATCCCCAG 
CTATCTCTAT 
TCAGCAGCCC 
CCAATGGAAA 
AGGCCGCATG 
GCCAGGCGTC 
AGAGAATGGC 
GCTGGTTGTC 
TAGGCTCTGC 
CCTCACGGAT 
CCTGACTCAA 
TTAAAAAGGG 
GGTGGGCATT 
GCAGCTGAGG 
ACACTAACGA 
TGTGTGTATG 
TGTCTGATTG 
TGGTCTTTCT 
TGCCTGGGAA 
TTCACATAGA 
CCTCACTGGG 
TTTATATTTT 



GAGGCCGCCG 
CTTCTCTGCG 
TCGAACTGTG 
CACTGGTGCA 
ACCTGCTATG 
CGGCCCTGCC 
TCTGATGCTC 
AGGCGACCCT 
CATGACTGCG 
GGCCAAAAGA 
AACCAGCCCT 
TGTGGAGGCA 
TACCCAAAGA 
CAAGGGGAGA 
ACGCTTGCTC 
GCGCAGCCAT 
TTTGGCACAA 
CCGGAGCAGC 
CACTACTACG 
ACAGATTCCT 
ACTTTGACTG 
TACACGAGAG 
CTGGCCCTCT 
ATTTTTGCAG 
ACAGATGGAT 
AGGCCTGGGT 
CATGTTACTG 
CAGGGCATCT 
TGTGAGGCCC 
TCTCTTGAGG 
CTTCAGGGCA 
TTTGCACACT 
TTAAGTCTAA 
GGAGAGGTTA 
TGTACTTATT 
TGTCCCTTTC 
TGGGGTGAGG 
TGTAATTTTA 



CCGTCTAGCG 
TCCTGGTCGT 
ACTGTCTAAA 
ACTGCCCAAA 
AGGGGAATGG 
TGCCCTGGAA 
TTCAGCTGGG 
GGTGCTATGT 
CAGATGGAAA 
CTCTGAGGCC 
GGTTTGCGGC 
GCCTCATCAG 
AGGAGGACTA 
TGAAGTTTGA 
ACCACAACGA 
CCCGGACTAT 
GCTGTGAGAT 
TGAAAATGAC 
GCTCTGAAGT 
GCCAGGGAGA 
GAATTGTGAG 
TCTCACACTT 
GAGGGTCCCC 
TAGAGTCATC 
TTGCCTGTGG 
GCTGGCTGCC 
ACCAGCAACT 
CCTGTGCATG 
ATGGTTGAGA 
GAGCTTAGCC 
GGGCTCTGAT 
TGTTGTGTGG 
ATATTTCCTT 
TAGGTCACTC 
CTGCAGCATG 
TTGGCCAGTT 
ACCACTCCTT 
AATAAAAGTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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Seq ID NO: 615 Protein sequence 
Protein Accession ft: NP_002649.1 

1 11 21 31 41 51 

| I I I I I 

MRALLARLLL CVLWSDSKG SNELHQVPSN CDCLNGGTCV SNKYFSNIHW CNCPKKFGGQ 60 

HCEIDKSKTC YEGNGHFYRG KASTDTMGRP CLPWNSATVL QQTYHAHRSD ALQLGLGKHN 120 

YCRNPDNRRR PWCYVQVGLK PLVQECMVHD CADGKKPSSP PEELKFQCGQ KTLRPRFKI I 1B0 

GGEFTTIENQ PWFAAIYRRH RGGSVTYVCG GSLISPCWVI SATHCFIDYP KKEDYIVYLG 240 

RSRLNSNTQG EMKFEVENLI LHKDYSADTL AHHNDIALLK IRSKEGRCAQ PSRTIQTICL 300 

PSMYNDPQFG TSCEITGFGK ENSTDYLYPE QLKMTWXLI SHRECQQPHY YGSEVTTKML 360 

CAADPQWKTD SCQGDSGGPL VCSLQGRMTL TGIVSWGRGC ALKDKPGVYT RVSHFLPWIR 420 
SHTKEENGLA L 

Seq ID NO: 616 DNA sequence 

Nucleic Acid Accession ft: NM_024422.1 

Coding sequence: 202.. 2907 

1 11 21 31 41 51 

I I I I I I 

CGCCAAAGGA AAAGCCCCTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CTCTCCGOGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 120 

GCTCCGGCCG CGGCCCTCGC CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTCGGCCCGC 180 

GACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 240 

CTCTGCCGGC TGCTCCTGCT GACCCTCGCG ATCTTAATAT TTGCCAGTGA TGCCTGCAAA 300 

AATGTGACAT TACATGTTCC CTCCAAACTA GATGCCGAGA AACTTGTTGG TAGAGTTAAC 360 

CTGAAAGAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT 420 

TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTCGGA GAAGAGAAGT 480 

TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 540 

GAGCATCAAA CAAAGGTCCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AAGGCGCGCC 600 

AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 720 

AGAGGTCCTG GAGTTGACCA AGAACCTCGG AATTTATTTT ATGTGGAGAG AGACACTGGA 780 

AACTTGTATT GTACTCGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GATAATTGCC 840 

TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCCTAAT AATCAAAATA 900 

GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 960 

GAAAATTGCA GAGTGGGCAC TACTGTCGGA CAAGTGTGTG CTACTGACAA AGATGAGCCT 1020 

GACACGATGC ACACACGCCT GAAGTACTCC ATCATTGGGC AGGTGCCACC ATCACCCACC 1080 

CTATTTTCTA TGCATCCAAC TACAGGCGTG ATCACCACAA CATCATCTCA GCTAGACAGA 1140 

GAGTTAATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 1200 

GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 1260 

ACATTTACTC GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 1320 

TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 1380 

ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 1440 

GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 1500 

CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 1620 

CCAATACAGA CTGTTCGCAT GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 1680 

AAAGCATATG ACCCAGAAAC AAGAAGTAGC AGTGGCATAA GGTATAAGAA ATTAACTGAT 1740 

CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 1800 

GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGACGT GAATGATAAC 1920 

AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC CCACCATGTC ATCTGCGGAG 1980 

ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG 2040 

AGTTCTACTT CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 2100 

C G TCT TT CCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 2160 

GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGCATTACC 2220 

GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGCG GTGGAGGAGT ACAACTTGGA 2280 

AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTTTGCAT CCTGTTTACG 2340 

CTGGTCTGTG GGGCTTCTGG GACGTCTAAA CAACCAAAAG TAATTCCTGA TGATTTAGCC 2400 

CAGCAGAACC TAATTGTATC AAACACAGAA GCTCCTGGAG ATGACAAAGT GTATTCTGCG 2460 

AATGGCTTCA CAACCCAAAC TGTGGGCGCT TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 2520 

TCAGGAATCA AAAACGGAGG TCAGGAGACC ATCGAAATGG TGAAAGGAGG ACACCAGACC 2580 

TCGGAATCCT GCCGGGGGGC TGGCCACCAT CACACCCTGG ACTCCTGCAG GGGAGGACAC 2640 

ACGGAGGTGG ACAACTGCAG ATACACTTAC TCGGAGTGGC ACAGTTTTAC TCAGCCCCGT 2700 

CTTGGTGAAA AAGTGTATCT GTGTAATCAA GATGAAAATC ACAAGCATGC CCAAGACTAT 2760 

GTCCTGACAT ATAACTATGA AGGAAGAGGA TCGGTGGCTG GGTCTGTAGG TTGTTGCAGT 2820 

GAACGACAAG AAGAAGATGG GCTTGAATTT TTGGATAATT TGGAGCCCAA ATTTAGGACA 2880 

CTAGCAGAAG CATGCATGAA GAGATGAGTG TGTTCTAATA AGTCTCTGAA AGCCAGTGGC 2940 

TTTATGACTT TTAAAAAAAA TTACAAACCA AGAATTTTTT AAAGCAGAAG ATGCTATTTG 3000 

TGGGGGTTTT TCTCTCATTA TTTGGATGGA ATCTCTTTGG TCAAATGCAC ATTTACAGAG 3060 

AGACACTATA AACAAGTACA CAAATTTTTC AATTTTTACA TATTTTTAAA TTACTTATCT 3120 

TCTATCCAAG GAGGTCTACA GAGAAATTAA AGTCTGCCTT ATTTGTTACA TTTGGGTATA 3180 

ATGACAACAG CCAATTTATA GTGCAATAAA ATGTAATTAA TTCAAGTCCT TATTATAGAC 3240 

TATTTGAAGC ACAACCTAAT GGAAAATTGT AGAGACCTTG CTTTAACATT ATCTCCAGTT 3300 

AATTAAGTGT TCATGTGGTG CTTGGAAACT GTTGTTTTCC TGAACATCTA AAGTGTGTAG 3360 

ACTGCATTCT TGCTATTATT TTATTCTTGT AATGTGACCT TTTCACTGTG CAAAGGGAGA 3420 
TTTCTAGCCA GGCATTGACT ATTACAATTT CATT 

Seq ID NO: 617 Protein sequence 
Protein Accession ft: NP_077740.1 

1 11 21 31 41 51 

I I I I I I 

MEAARPSGSW NGALCRLLLL TLAILIFASD ACKNVTLHVP SKLDAEKLVG RVNLKECFTA 60 

ANLIHSSDPD FQILEDGSVY TTNTILLSSE KRSFTILLSN TENQEKKKIF VFLEHQTKVL 120 



424 



WO 02/086443 

KKRHTKEKVL RRAKRRWAPI PCSMLENSLG PFPLFLQQVQ SDTAQNYTIY YSIRGPGVDQ 180 

EPRNLFYVER DTGNLYCTRP VDREQYESFE I IAFATTPDG YTPELPLPLI IKIEDENDNY 240 

PIPTEETYTF TIFENCRVGT TVGQVCATDK DEPDTMHTRL KYSIIGQVPP SFTLFSMHPT 300 

TGVITTTSSQ LDRELIDKYQ LKIKVQDMDG QYFGLQTTST CIINIDDVND HfcPTFTRTSY 360 

VTSVEENTVD VEILRVTVED KDLVNTANWR ANYTILKGNE NGNFKIVTDA KTNEGVLCW 420 

KPLNYEEKQQ MILQIGWNE APPSREASPR SAMSTATVTV NVEDQDEGPE CNPPIQTVRM 480 

KENAEVGTTS NGYKAYDPET RSSSGIRYKK LTDPTGWVTI DENTGSIKVP RSLDREAETI 540 

KNGIYNITVL ASDQGGRTCT GTLGIILQDV NDNSPFIPKK TVIICKPTMS SAEIVAVDPD 600 

EPIHGPPFDF SLESSTSEVQ RMWRLKAIND TAARLSYQND PPFGSYWPI TVRDRLGMSS 660 

VTSLDVTLCD CITENDCTHR VDPRIGGGGV QLGKWAILAI LLGIALLFCI LFTLVCGASG 720 

TSKQPKVIPD DLAQQNLIVS NTEAPGDDKV YSANGFTTQT VGASAQGVCG TVGSGIKNGG 780 

QETIEMVKGG HQTSESCRGA GHHHTLDSCR GGHTEVDNCR YTYSEWHSFT QPRLGEKVYL 840 

CNQDENHKHA QDYVLTYNYE GRGSVAGSVG CCSERQEEDG LEFLDNLEPK FRTLAEACMK 900 
R 

Seq ID NO: 618 DNA sequence 

Nucleic Acid Accession ft: NM_004949.1 

Coding sequence: 202.. 2745 

1 11 21 31 41 51 

I I I I I I 

CGCCAAAGGA AAAGCCCCTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CTCTCCGCGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 120 

GCTCCGGCCG CGGCCCTCGC CCCGCGGAGC CCTCCTACCC OGGCCOGACG CTCGGCCCGC 180 

GACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 240 

CTCTGCCGGC TGCTCCTGCT GACCCTCGCG ATCTTAATAT TTGCCAGTGA TGCCTGCAAA 300 

AATGTGACAT TACATGTTCC CTCCAAACTA GATGCCGAGA AACTTGTTGG TAGAGTTAAC . 360 

CTGAAAGAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT 420 

TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTCGGA GAAGAGAAGT 480 

TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 540 

GAGCATCAAA CAAAGGTCCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AAGGCGCGCC 600 

AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 720 

AGAGGTCCTG GAGTTGACCA AGAACCTCGG AATTTATTTT ATGTGGAGAG AGACACTGGA 780 

AACTTGTATT GTACTCGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GATAATTGCC 840 

TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCCTAAT AATCAAAATA 900 

GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 960 

GAAAATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAA AGATGAGCCT 1020 

GACACGATGC ACACACGCCT GAAGTACTCC ATCATTGGGC AGGTGCCACC ATCACCCACC 1080 

CTATTTTCTA TGCATCCAAC TACAGGCGTG ATCACCACAA CATCATCTCA GCTAGACAGA 1140 

GAGTTAATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 1200 

GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 1260 

ACATTTACTC GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 1320 

TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 1380 

ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 1440 

GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 1500 

CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 1620 

CCAATACAGA CTGTTCGCAT GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 1680 

AAAGCATATG ACCCAGAAAC AAGAAGTAGC AGTGGCATAA GGTATAAGAA ATTAACTGAT 1740 

CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 1800 

GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGACGT GAATGATAAC 1920 

AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC CCACCATGTC ATCTGCGGAG 1980 

ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG 2040 

AGTTCTACTT CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 2100 

GGTCTTTCCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 2160 

GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGCATTACC 2220 

GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGCG GTGGAGGAGT ACAACTTGGA 2280 

AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTTTGCAT CCTGTTTACG 2340 

CTGGTCTGTG GGGCTTCTGG GACGTCTAAA CAACCAAAAG TAATTCCTGA TGATTTAGCC 2400 

CAGCAGAACC TAATTGTATC AAACACAGAA GCTCCTGGAG ATGACAAAGT GTATTCTGCG 2460 

AATGGCTTCA CAACCCAAAC TGTGGGCGCT TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 2520 

TCAGGAATCA AAAACGGAGG TCAGGAGACC ATCGAAATGG TGAAAGGAGG ACACCAGACC 2580 

TCGGAATCCT GCCGGGGGGC TGGCCACCAT CACACCCTGG ACTCCTGCAG GGGAGGACAC 2640 

ACGGAGGTGG ACAACTGCAG ATACACTTAC TCGGAGTGGC ACAGTTTTAC TCAGCCCCGT 2700 

CTTGGTGAAG AATCCATTAG AGGACACACT CTGATTAAAA ATTAAACAAT GAAAGAAAGT 2760 

GTATCTGTGT AATCAAGATG AAAATCACAA GCATGCCCAA GACTATGTCC TGACATATAA 2820 

CTATGAAGGA AGAGGATCGG TGGCTGGGTC TGTAGGTTGT TGCAGTGAAC GACAAGAAGA 2880 

AGATGGGCTT GAATTTTTGG ATAATTTGGA GCCCAAATTT AGGACACTAG CAGAAGCATG 2940 

CATGAAGAGA TGAGTGTGTT CTAATAAGTC TCTGAAAGCC AGTGGCTTTA TGACTTTTAA 3000 

AAAAAATTAC AAACCAAGAA TTTTTTAAAG CAGAAGATGC TATTTGTGGG GGTTTTTCTC 3060 

TCATTATTTG GATGGAATCT CTTTGGTCAA ATGCACATTT ACAGAGAGAC ACTATAAACA 3120 

AGTACACAAA TTTTTCAATT TTTACATATT TTTAAATTAC TTATCTTCTA TCCAAGGAGG 3180 

TCTACAGAGA AATTAAAGTC TGCCTTATTT GTTACATTTG GGTATAATGA CAACAGCCAA 3240 

TTTATAGTGC AATAAAATGT AATTAATTCA AGTCCTTATT ATAGACTATT TGAAGCACAA 3300 

CCTAATGGAA AATTGTAGAG ACCTTGCTTT AACATTATCT CCAGTTAATT AAGTGTTCAT 3360 

GTGGTGCTTG GAAACTGTTG TTTTCCTGAA CATCTAAAGT GTGTAGACTG CATTCTTGCT 3420 

ATTATTTTAT TCTTGTAATG TGACCTTTTC ACTGTGCAAA GGGAGATTTC TAGCCAGGCA 3480 
TTGACTATTA CAATTTCATT 

Seq ID NO: 619 Protein sequence 
Protein Accession ft: NP_004 940.1 

1 11 21 31 41 51 

I I I I I I 

MEAARPSGSW NGALCRLLLL TLAILIFASD ACKNVTLHVP SKLDAEKLVG RVNLKECPTA 60 
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WO 02/086443 PCT/US02/12476 

ANLIHSSDPD FQILEDGSVY TTNTILLSSE KRSPTILLSN TENQEKKKIP VFLEHQTKVL 120 

KKRHTKEKVL RRAKRRWAPI PCSMLENSLG PFPLFLQQVQ SDTAQNYTIY YSIRGPGVDQ 180 

EPRNLFYVER DTGNLYCTRP VDREQYESFB I I AFATTPDG YTPELPLPLI IKIEDENDNY 240 

PIFTEETYTF TIFENCRVGT TVGQVCATDK DEPDTMHTRL KYSIIGQVPP SPTLFSMHPT 300 

5 TGVITTTSSQ LDRELIDKYQ LKIKVQDMDG QYFGLQTTST CIINIDDVND HLPTFTRTSY 360 

VTSVEENTVD VEILRVTVED KDLVNTANWR ANYTILKGNE NGNFKIVTDA KTNEGVLCW 420 

KPLNYEEKQQ MILQIGWNE APFSREASPR SAMSTATVTV NVEDQDEGPE CNPPIQTVRM 480 

KENAEVGTTS NGYKAYDPET RSSSGIRYKK LTDPTGWVTI DENTGSIKVF RSLDREAETI 540 

KNGIYNITVL ASDQGGRTCT GTLGIILQDV NDNSPFIPKK TVIICKPTMS SAEIVAVDPD 600 

10 EPIHGPPFDF SLESSTSEVQ RMWRLKAIND TAARLSYQND PPFGSYWPI TVRDRLGMSS 660 

VTSLDVTLCD CITENDCTHR VDPRIGGGGV QLGKWAILAI LLGIALLFCI LFTLVCGASG 720 

TSKQPKVIPD DLAQQNLIVS NTEAPGDDKV YSANGFTTQT VGASAQGVCG TVGSGIKNGG 780 

QETIEMVKGG HQTSESCRGA GHHHTLDSCR GGHTEVDNCR YTYSEWHSFT QPRLGEESIR 840 
GHTLIKN 



15 



40 



60 



70 



Seq ID NO; 620 DNA sequence 

Nucleic Acid Accession 8: NM_032545.1 

Coding sequence: 46.. 718 



20 1 11 21 31 41 51 

11)111 

AAACTGATCT TCAATGCACT AAGAGAAGGA GACTCTCAAA CCAAAAATGA CCTGGAGGCA 60 

CCATGTCAGG CTTCTGTTTA CGGTCAGTTT GGCATTACAG ATCATCAATT TGGGAAACAG 120 

CTATCAAAGA GAGAAACATA ACGGCGGTAG AGAGGAAGTC ACCAAGGTTG CCACTCAGAA 180 

25 GCACCGACAG TCACCGCTCA ACTGGACCTC CAGTCATTTC GGAGAGGTGA CTGGGAGCGC 240 

CGAGGGCTGG GGGCCGGAGG AGCCGCTCCC CTACTCCCGG GCTTTCGGAG AGGGTGCGTC 300 

CGCGCGGCCG CGCTGCTGCA GGAACGGCGG TACCTGCGTG CTGGGCAGCT TCTGCGTGTG 360 

CCCGGCCCAC TTCACCGGCC GCTACTGCGA GCATGACCAG AGGCGCAGTG AATGCGGCGC 420 

CCTGGAGCAC GGAGCCTGGA CCCTCCGCGC CTGCCACCTC TGCAGGTGCA TCTTCGGGGC 480 

30 CCTGCACTGC CTCCCCCTCC AGACGCCTGA CCGCTGTGAC CCGAAAGACT TCCTGGCCTC 540 

CCACGCTCAC GGGCCGAGCG CCGGGGGCGC GCCCAGCCTG CTACTCTTGC TGCCCTGCGC 600 

ACTCCTGCAC CGCCTCCTGC GCCCGGATGC GCCCGCGCAC CCTCGGTCCC TGGTCCCTTC 660 

CGTCCTCCAG CGGGAGCGGC GCCCCTGCGG AAGGCCGGGA CTTGGGCATC GCCTTTAATT 720 

TTCTATGTTG TAAATAATAG ATGTGTTTAG TTTACCGTAA GCTGAAGCAC TGGGTGAATA 780 

35 TTTTTATTGG GTAATAAATA TTTTCATGAA AGCGCCAAAA AAAAAAAAAA AAAAAAAAAA 840 
AAAAAA 



Seq ID NO: 621 Protein sequence 
Protein Accession #: NP 115934.1 



1 11 21 31 41 51 

I I I I I I 

MTWRHHVRLL FTVSLALQII NLGNSYQREK HNGGREEVTK VATQKHRQSP LNWTSSHFGE 60 
VTGSAEGWGP EEPLPYSRAF GEGASARPRC CRNGGTCVLG SFCVCPAHFT GRYCEHDQRR 120 
45 SECGALEHGA WTLRACHLCR CIFGALHCLP LQTPDRCDPK DFLASHAHGP SAGGAPSLLL 180 
' LLPCALLHRL LRPDAPAHPR SLVPSVLQRE RRPCGRPGLG HRL 

Seq ID NO: 622 DNA sequence 
Nucleic Acid Accession #: FGENESH predicted 
50 Coding sequence: 1..390 

1 11 21 31 41 51 

I I I I I I 

ATGAGGTTCA GTGTCTCAGG CATGAGGACC GACTACCCCA GGAGTGTGCT GGCTCCTGCT 60 

55 TATGTGTCAG TCTGTCTCCT CCTCTTGTGT CCAAGGGAAG TCATCGCTCC CGCTGGCTCA 120 

GAACCATGGC TGTGCCAGCC GGCACCCAGG TGTGGAGACA AGATCTACAA CCCCTTGGAG 180 

CAGTGCTGTT ACAATGACGC CATCGTGTCC CTGAGCGAGA CCCGCCAATG TGGTCCCCCC 240 

TGCACCTTCT GGCCCTGCTT TGAGCTCTGC TGTCTTGATT CCTTTGGCCT CACAAACGAT 300 

. TTTGTTGTGA AGCTGAAGGT TCAGGGTGTG AATTCCCAGT GCCACTCATC TCCCATCTCC 360 
AGTAAATGTG AAAGAGGCCG GATATGTTAG 

Seq ID NO: 623 Protein sequence 
Protein Accession #: FGENESH predicted 

65 1 11 21 31 41 51 

1 I I I I I 

MRFSVSGMRT DYPRSVLAPA YVSVCLLLLC PREVIAPAGS EPWLCQPAPR CGDKIYNPLE 60 
QCCYNDAIVS LSETRQCGPP CTFWPCFELC CLDSFGLTND FWKLKVQGV NSQCHSSPIS 120 
SKCERGRIC 



Seq ID NO: 624 DNA sequence 
Nucleic Acid Accession #: M18728.1 
Coding sequence: 51.. 1085 



75 1 11 21 31 41 51 

I I I I I I 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 120 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

80 ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 360 

ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 420 

TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 480 

85 TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 540 

CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 



426 



WO 02/086443 

GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA GCGAGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 

ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT .1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 1260 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 1380 

AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 1500 

GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 1620 

AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 

TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 1860 

CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT 2040 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 2160 

TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 2460 

CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 

Seq ID NO: 625 Protein sequence 
Protein Accession th AAA59907.1 

1 11 21 31 41 51 

I I 1 I I I 

MGPPSAPPCR LHVPWKEVLL TASLLTFWNP PTTAKLTIES TPFNVAEGKE VLLLAHNLPQ 60 

NRIGYSWYKG ERVDGNSLIV GYVIGTQQAT PGPAYSGRET IYPNASLLIQ NVTQNDTGFY 120 

TLQVIKSDLV NEEATGQFHV YPELPKPSIS SNNSNPVEDK DAVAPTCEPE VQNTTYLWWV 180 

NGQSLPVSPR LQLSNGNMTL TLLSVKRNDA GSYECEIQNP ASANRSDPVT LNVLYGPDVP 240 

TISPSKANYR PGENLNLSCH AASNPPAQYS WFINGTFQQS TQELFIPNIT VNNSGSYMCQ 300 
AHNSATGLNR TTVTMITVSG SAPVLSAVAT VGITIGVLAR VALI 

Seq ID NO: 626 DNA sequence 
Nucleic Acid Accession tt: M18728.1 
Coding sequence: 1355.. 1657 

1 11 21 31 41 51 

I I I I I I 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 120 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 360 

ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 420 

TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 480 

TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 540 

CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 

GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA GCGAGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 

ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 1260 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 1380 

AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 1500 

GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 1620 

AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 

TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 1860 

CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT 204 0 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 2160 
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TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC 
GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA 
ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC 
TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT 
TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC 
CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT 
TAGCTCTATA ACT 

Seq ID NO: 627 Protein sequence 
Protein Accession 8: AAA59908.1 

1 11 



PCT/US02/12476 



CAACTGAAAT 
AAAAAAAAGA 
TACTTTAACT 
TCTGTGGTTC 
CCAGGGGGAG 
AAAAGCCAAT 



21 31 41 51 

I I I I I I 

MDSFSQDVKT RLLIMIRLLP PFNLSLLMPA SPAWQDDAVI SISQEVASEG NLTECQIYLV 
NPNVLHKIRD PLVHPVTDIS SIFNTAVCSN VQWSFSBLDF 

Seq ID NO: 628 DNA sequence 
Nucleic Acid Accession #: M18728.1 
Coding sequence: 2370.. 2S01 



GGAGCTCAAG 
CCTCAGCCCC 
TTCTAACCTT 
ATGTCGCAGA 
GTTACAGCTG 
TAGGAACTCA 
ATGCATCCCT 
TCATAAAGTC 
TGCCCAAGCC 
CCTTCACCTG 
GCCTCCCGGT 
GCGTCAAAAG 
ACCGCAGTGA 
CCTCAAAGGC 
ACCCACCTGC 
TCTTTATCCC 
CAGCCACTGG 
TCCTCTCAGC 
TATAGCAGCC 
GAATTCTTCT 
CTCTGCTCCT 
ACCCTCAGGC 
GCAAACCATG 
AACAAGACTC 
TGCCTCTTTC 
GGGTAACTTA 
AGATCCTTTA 
AAATGTACAG 
TTTAATTCAA 
GGAGGAGTCT 
GCTGAGACTA 
CTGACTCATT 
CTCTTGGTAT 
CTCTAAAAGC 
GGCTGGAATT 
ATAAAAGCCC 
TCTCACCTAG 
GTTAAGGAAG 
ACACAGGAGA 
TTTACAAAAA 
TGTTTCCTTG 
CTATCACTGT 
TAGCTCTATA 



11 
I 

CTCCTCTACA 
TCCCTGCAGA 
CTGGAACCCA 
GGGGAAGGAG 
GTACAAAGGC 
ACAAGCTACC 
GCTGATCCAG 
AGATCTTGTG 
CTCCATCTCC 
TGAACCTGAG 
CAGTCCCAGG 
GAACGATGCA 
CCCAGTCACC 
CAATTACCGT 
ACAGTACTCT 
CAACATCACT 
CCTCAATAGG 
TGTGGCCACC 
CTGGTGTATT 
AGCTCCTCCA 
GAAGCCCTAT 
CTGAGGTGTG 
GTGAGAAATT 
CTCATCATGA 
GCTTGGCAGG 
ACAGAGTGTC 
GTGCACCCAG 
TGGTCCTTTT 
CCCAGCCATG 
GTGCAGTTTC 
AGTTGTAGAA 
CTTTATTCTA 
TACCCTCCTA 
TTTAAATGTC 
ACAAAACTCA 
CAAATGGTGG 
GTGAGCGCAT 
AAGATAGATC 
TTCCAGTCTA 
AGTAACCTGA 
TTCCAATTTG 
ACTTGTAGAG 
ACT 



21 
I 

AAGAGGTGGA 
TTGCATGTCC 
CCCACCACTG 
GTTCTTCTAC 
GAAAGAGTGG 
CCAGGGCCCG 
AACGTCACCC 
AATGAAGAAG 
AGCAACAACT 
GTTCAGAACA 
CTGCAGCTGT 
GGATCCTATG 
CTGAATGTCC 
CCAGGGGAAA 
TGGTTTATCA 
GTGAATAATA 
ACCACAGTCA 
GTCGGCATCA 
TTCGATATTT 
ATCCCATTTT 
ATGCTGGAGA 
TGCCACTCAG 
GACGACTTCA 
TAAGGCTCTT 
ATGATGCTGT 
AGATCTATCT 
TGACTGACAT 
CAGAGTTGGA 
CAATGCCAAA 
TGACACTTGT 
ATTAACAAAT 
TTTTAGTTGG 
ATAGTCATAC 
TGCATGCAGC 
GAGAAATGTG 
TAACTGATAA 
TGAGCCAGTG 
CAATTAAAAA 
CTTGAGTTAG 
ACTAATCTGA 
ACAAAACCCA 
TGGTGCTGCT 



31 
I 

CAGAGAAGAC 
CCTGGAAGGA 
'CCAAGCTCAC 
TCGCCCACAA 
ATGGCAACAG 
CATACAGTGG 
AGAATGACAC 
CAACCGGACA 
CCAACCCCGT 
CAACCTACCT 
CCAATGGCAA 
AATGTGAAAT 
TCTATGGCCC 
ATCTGAACCT 
ATGGGACGTT 
GCGGATCCTA 
CGATGATCAC 
CGATTGGAGT 
CAGGAAGACT 
ATCCCATGGA 
TGGACAACTC 
AGACTTCACC 
CACTATGGAC 
ACCCCCTTTT 
CATTAGTATT 
TGTCAATCCC 
TAGCAGCATC 
CTTCTAGACT 
TAATAGAATT 
TGTTGAACAT 
GTGCTGCTTG 
TTTGTATCTT 
TAGTAGTCAT 
CAGCCATCAA 
TCATCAGGAG 
TAGCACTAAT 
GTGCTAAATG 
AAATTAAAAC 
CATAATACAG 
TGTTAACCAA 
CTGTTCTTGT 
TTAATTCATA 



41 

I 

AGCAGAGACC 
GGTCCTGCTC 
TATTGAATCC 
CCTGCCCCAG 
TCTAATTGTA 
TCGAGAGACA 
AGGATTCTAT 
GTTCCATGTA 
GGAGGACAAG 
GTGGTGGGTA 
CATGACCCTC 
ACAGAACCCA 
AGATGTCCCC 
CTCCTGCCAC 
CCAGCAATCC 
TATGTGCCAA 
AGTCTCTGGA 
GCTGGCCAGG 
GGCAGATTGG 
ACCACTAAAA 
AATGAAAATT 
TAACTAGAGA 
AGCTTTTCCC 
AATTTGTCCT 
TCACAAGAAG 
AACGTTTTAC 
TTTAACACAG 
CACCTGTTCT 
GCTCCCTACC 
GGCTAAATAC 
GTTAAAATGG 
GCCTAAGGTG 
ACTCCCTGGT 
ATAGTGAATG 
AACATCATAA 
GCTTTAAGAT 
CTACATACTC 
CAATTTAAAA 
AAGTCCCCTC 
TGTATTTATT 
ATTGTATTGC 
AATCACAAAT 



51 
I 

ATGGGACCCC 
ACAGCCTCAC 
ACGCCATTCA 
AATCGTATTG 
GGATATGTAA 
ATATACCCCA 
ACCCTACAAG 
TACCCGGAGC 
GATGCTGTGG 
AATGGTCAGA 
ACTCTACTCA 
GCGAGTGCCA 
ACCATTTCCC 
GCAGCCTCTA 
ACACAAGAGC 
GCCCATAACT 
AGTGCTCCTG 
GTGGCTCTGA 
ACCAGACCCT 
ACAAGGTCTG 
TAAAGGGAAA 
CAGTCAAACT 
AAGATGTCAA 
TGCTTATGCC 
TAGCTTCAGA 
ATAAAATAAG 
CCGTGTGTTC 
CACTCCCTGT 
AGCTGAACAG 
AATGGGTATC 
CTACACTCAT 
CGTAGTCCAA 
GTAGTGTATT 
GTCTCTCTTT 
CCCATGAAGG 
TTGGTCACAC 
CAACTGAAAT 
AAAAAAAAGA 
TACTTTAACT 
TCTGTGGTTC 
CCAGGGGGAG 
AAAAGCCAAT 



2220 
2280 
2340 
2400 
2460 
2520 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



Seq ID NO: 629 Protein sequence 
Protein Accession AAA59909.1 

1 11 21 31 41 

1 I I I I 

MLTNVFISW LFPCSNLTKP TVLVLYCPGG AITVLVEWCC FNS 



51 



75 



80 



85 



Seq ID NO: 630 DNA sequence 

Nucleic Acid Accession ft: NM_016639.1 

Coding sequence : 40 . . 429 



GCGGCGGGCG 
CGGTTGCTGC 
GAGCAAGCGC 
AAGTGCATGG 
GCTGCAGCAC 
CTGACCTTCG 
GAGAAGTTCA 



XI 
I 

CAGACAGCGG 
GGCTCCTCGT 
CAGGCACCGC 
ACTGCGCGTC 
CTCCTGCCCC 
TGCTGGGGCT 
CCACCCCCAT 



21 
I 

CGGGCGCAGG 
GCTGGGGCTC 
CCCCTGCTCC 
TTGCAGGGCG 
CTTCCGGCTG 
GCTTTCTGGC 
AGAGGAGACC 



31 
I 

ACGTGCACTA 
TGGCTGGCGT 
CGCGGCAGCT 
CGACCGCACA 
CTTTGGCCCA 
TTTTTGGTCT 
GGCGGAGAGG 



41 
I 

TGGCTCGGGG 
TGCTGCGCTC 
CCTGGAGCGC 
GCGACTTCTG 
TCCTTGGGGG 
GGAGACGATG 
GCTGCCCAGC 



51 
I 

CTCGCTGCGC 
CGTGGCCGGG 
GGACCTGGAC 
CCTGGGCTGC 
CGCTCTGAGC 
CCGCAGGAGA 
TGTGGCGCTG 



60 
120 
180 
240 
300 
360 
420 
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ATCCAGTGAC AATGTGCCCC CTGCCAGCCG GGGCTCGCCC ACTCATCATT CATTCATCCA 480 

TTCTAGAGCC AGTCTCTGCC TCCCAGACGC GGCGGGAGCC AAGCTCCTCC AACCACAAGG 540 

GGGGTGGGGG GCGGTGAATC ACCTCTGAGG CCTGGGCCCA GGGTTCAGGG GAACCTTCCA 600 

AGGTGTCTGG TTGCCCTGCC TCTGGCTCCA GAACAGAAAG GGAGCCTCAC GCTGGCTCAC 660 

ACAAAACAGC TGACACTGAC TAAGGAACTG CAGCATTTGC ACAGGGGAGG GGGGTGCCCT 720 

CCTTCCTTAG GACCTGGGGG CCAGGCTGAC TTGGGGGGCA GACTTGACAC TAGGCCCCAC 780 

TCACTCAGAT GTCCTGAAAT TCCACCACGG GGGTCACCCT GGGGGGTTAG GGACCTATTT 840 

TTAACACTAG GGGCTGGCCC ACTAGGAGGG CTGGCCCTAA GATACAGACC CCCCCAACTC 900 

CCCAAAGCGG GGAGGAGATA TTTATTTTGG GGAGAGTTTG GAGGGGAGGG AGAATTTATT 960 
AATAAAAGAA TCTTTAACTT TAAAAAAAAA AAAAAAAA 

Seq ID NO: 631 Protein sequence 
Protein Accession #: NP_057723.l 

1 11 21 31 41 51 

1)1111 

MARGSLRRLli RLLVLGLWLA LLRSVAGEQA PGTAPCSRGS SWSADLDKCM DCASCRARPH 60 

SDFCLGCAAA PPAPFRLLWP ILGGALSLTF VLGLLSGFLV WRRCRRREKF TTPIEETGGE 120 
GCPAVALIQ 

Seq ID NO: 632 DNA sequence 

Nucleic Acid Accession 8: NMJ)03816.1 

Coding sequence: 79.. 2538 

1 11 21 31 41 51 

I I I I 1 I 

CGGCAGGGTT GGAAAATGAT GGAAGAGGCG GAGGTGGAGG CGACCGAGTG CTGAGAGGAA 60 

CCTGCGGAAT CGGCCGAGAT GGGGTCTGGC GCGCGCTTTC CCTCGGGGAC CCTTCGTGTC 120 

CGGTGGTTGC TGTTGCTTGG CCTGGTGGGC CCAGTCCTCG GTGCGGCGCG GCCAGGCTTT 180 

CAACAGACCT CACATCTTTC TTCTTATGAA ATTATAACTC CTTGGAGATT AACTAGAGAA 240 

AGAAGAGAAG CCCCTAGGCC CTATTCAAAA CAAGTATCTT ATGTTATTCA GGCTGAAGGA 300 

AAAGAGCATA TTATTCACTT GGAAAGGAAC AAAGACCTTT TGCCTGAAGA TTTTGTGGTT 360 

TATACTTACA ACAAGGAAGG GACTTTAATC ACTGACCATC CCAATATACA GAATCATTGT 420 

CATTATCGGG GCTATGTGGA GGGAGTTCAT AATTCATCCA TTGCTCTTAG CGACTGTTTT 480 

GGACTCAGAG GATTGCTGCA TTTAGAGAAT GCGAGTTATG GGATTGAACC CCTGCAGAAC 540 

AGCTCTCATT TTGAGCACAT CATTTATCGA ATGGATGATG TCTACAAAGA GCCTCTGAAA 600 

TGTGGAGTTT CCAACAAGGA TATAGAGAAA GAAACTGCAA AGGATGAAGA GGAAGAGCCT 660 

CCCAGCATGA CTCAGCTACT TCGAAGAAGA AGAGCTGTCT TGCCACAGAC CCGGTATGTG 720 

GAGCTGTTCA TTGTCGTAGA CAAGGAAAGG TATGACATGA TGGGAAGAAA TCAGACTGCT 780 

GTGAGAGAAG AGATGATTCT CCTGGCAAAC TACTTGGATA GTATGTATAT TATGTTAAAT 840 

ATTCGAATTG TGCTAGTTGG ACTGGAGATT TGGACCAATG GAAACCTGAT CAACATAGTT 900 

GGGGGTGCTG GTGATGTGCT GGGGAACTTC GTGCAGTGGC GGGAAAAGTT TCTTATCACA 960 

CGTCGGAGAC ATGACAGTGC ACAGCTAGTT CTAAAGAAAG GTTTTGGTGG AACTGCAGGA 1020 

ATGGCATTTG TGGGAACAGT GTGTTCAAGG AGCCACGCAG GCGGGATTAA TGTGTTTGGA 1080 

CAAATCACTG TGGAGACATT TGCTTCCATT GTTGCTCATG AATTGGGTCA TAATCTTGGA 1140 

ATGAATCACG ATGATGGGAG AGATTGTTCC TGTGGAGCAA AGAGCTGCAT CATGAATTCA 1200 

GGAGCATCGG GTTCCAGAAA CTTTAGCAGT TGCAGTGCAG AGGACTTTGA GAAGTTAACT 1260 

TTAAATAAAG GAGGAAACTG CCTTCTTAAT ATTCCAAAGC CTGATGAAGC CTATAGTGCT 1320 

CCCTCCTGTG GTAATAAGTT GGTGGACGCT GGGGAAGAGT GTGACTGTGG TACTCCAAAG 1380 

GAATGTGAAT TGGACCCTTG CTGCGAAGGA AGTACCTGTA AGCTTAAATC ATTTGCTGAG 1440 

TGTGCATATG 'GTGACTGTTG TAAAGACTGT CGGTTCCTTC CAGGAGGTAC TTTATGCCGA 1500 

GGAAAAACCA GTGAGTGTGA TGTTCCAGAG TACTGCAATG GTTCTTCTCA GTTCTGTCAG 1560 

CCAGATGTTT TTATTCAGAA TGGATATCCT TGCCAGAATA ACAAAGCCTA TTGCTACAAC 1620 

GGCATGTGCC AGTATTATGA TGCTCAATGT CAAGTCATCT TTGGCTCAAA AGCCAAGGCT 1680 

GCCCCCAAAG ATTGTTTCAT TGAAGTGAAT TCTAAAGGTG ACAGATTTGG CAATTGTGGT 1740 

TTCTCTGGCA ATGAATACAA GAAGTGTGCC ACTGGGAATG CTTTGTGTGG AAAGCTTCAG 1800 

TGTGAGAATG TACAAGAGAT ACCTGTATTT GGAATTGTGC CTGCTATTAT TCAAACGCCT 1860 

AGTCGAGGCA CCAAATGTTG GGGTGTGGAT TTCCAGCTAG GATCAGATGT TCCAGATCCT 1920 

GGGATGGTTA ACGAAGGCAC AAAATGTGGT GCTGGAAAGA TCTGTAGAAA CTTCCAGTGT 1980 

GTAGATGCTT CTGTTCTGAA TTATGACTGT GATGTTCAGA AAAAGTGTCA TGGACATGGG 2040 

GTATGTAATA GCAATAAGAA TTGTCACTGT 'GAAAATGGCT GGGCTCCCCC AAATTGTGAG 2100 

ACTAAAGGAT ACGGAGGAAG TGTGGACAGT GGACCTACAT ACAATGAAAT GAATACTGCA 2160 

TTGAGGGACG GACTTCTGGT CTTCTTCTTC CTAATTGTTC CCCTTATTGT CTGTGCTATT 2220 

TTTATCTTCA TCAAGAGGGA TCAACTGTGG AGAAGCTACT TCAGAAAGAA GAGATCACAA 2280 

ACATATGAGT CAGATGGCAA AAATCAAGCA AACCCTTCTA GACAGCCGGG GAGTGTTCCT 2340 

CGACATGTTT CTCCAGTGAC ACCTCCCAGA GAAGTTCCTA TATATGCAAA CAGATTTGCA 2400 

GTACCAACCT ATGCAGCCAA GCAACCTCAG CAGTTCCCAT CAAGGCCACC TCCACCACAA 2460 

CCGAAAGTAT CATCTCAGGG AAACTTAATT CCTGCCCGTC CTGCTCCTGC ACCTCCTTTA 2520 

TATAGTTCCC TCACTTGATT TTTTTAACCT TCTTTTTGCA AATGTCTTCA GGGAACTGAG 2580 

CTAATACTTT TTTTTTTTCT TGATGTTTTC TTGAAAAGCC TTTCTGTTGC AACTATGAAT 2640 

GAAAACAAAA CACCACAAAA CAGACTTCAC TAACACAGAA AAACAGAAAC TGAGTGTGAG 2700 

AGTTGTGAAA TACAAGGAAA TGCAGTAAAG CCAGGGAATT TACAATAACA TTTCCGTTTC 2760 

CATCATTGAA TAAGTCTTAT TCAGTCATCG GTGAGGTTAA TGCACTAATC ATGGATTTTT 2820 

TGAACATGTT ATTGCAGTGA TTCTCAAATT AACTGTATTG GTGTAAGATT TTTGTCATTA 2880 

AGTGTTTAAG TGTTATTCTG AATTTTCTAC CTTAGTTATC ATTAATGTAG TTCCTCATTG 2940 

AACATGTGAT AATCTAATAC CTGTGAAAAC TGACTAATCA GCTGCCAATA ATATCTAATA 3000 

TTTTTCATCA TGCACGAATT AATAATCATC ATACTCTAGA ATCTTGTCTG TCACTCACTA 3060 

CATGAATAAG CAAATATTGT CTTCAAAAGA ATGCACAAGA ACCACAATTA AGATGTCATA 3120 

TTATTTTGAA AGTACAAAAT ATACTAAAAG AGTGTGTGTG TATTCACGCA GTTACTCGCT 3180 

TCCATTTTTA TGACCTTTCA ACTATAGGTA ATAACTCTTA GAGAAATTAA TTTAATATTA 3240 

GAATTTCTAT TATGAATCAT GTGAAAGCAT GACATTCGTT CACAATAGCA CTATTTTAAA 3300 

TAAATTATAA GCTTTAAGGT ACGAAGTATT TAATAGATCT AATCAAATAT GTTGATTCAT 3360 

GGCTATAATA AAGCAGGAGC AATTATAAAA TCTTCAATCA ATTGAACTTT TACAAAACCA 3420 

CTTGAGAATT TCATGAGCAC TTTAAAATCT GAACTTTCAA AGCTTGCTAT TAAATCATTT 3480 

AGAATGTTTA CATTTACTAA GGTGTGCTGG GTCATGTAAA ATATTAGACA CTAATATTTT 3540 

CATAGAAATT AGGCTGGAGA AAGAAGGAAG AAATGGTTTT CTTAAATACC TACAAAAAAG 3600 

TTACTGTGGT ATCTATGAGT TATCATCTTA GCTGTGTTAA AAATGAATTT TTACTATGGC 3660 
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WO 02/086443 

AGATATGGTA TGGATCGTAA AATTTTAAGC ACTAAAAATT TTTTCATAAC CTTTCATAAT 
AAAGTTTAAT AATAGGTTTA TTAACTGAAT TTCATTAGTT TTTTAAAAGT GTTTTTGGTT 
TGTGTATATA TACATATACA AATACAACAT TTACAATAAA TAAAATACTT GAAATTCTCA 
AAAAAAAAAA AAAAAAAAAA AAAAA 



PCTYUS02/12476 



3720 
37B0 
3840 



Seq ID NO: 633 Protein sequence 
Protein Accession 8: NP 003607.1 



KGSGARFPSG 
PYSKQVSYVI 
EGVHNSSIAL 
DIEKETAKDE 
LLANYLDSMY 
AQLVLKKGFG 
RDCSCGAKSC 
LVDAGEECDC 
DVPEYCNGSS 
IEVNSKGDRF 
WGVDFQLGSD 
NCHCENGWAP 
DQIjWRSYPRK 
KQPQQPPSRP 



11 

I 

TLRVRWLLLL 
QAEGKBHIIH 
SDCFGLRGLL 
EEEPPSMTQL 
IMLNIRIVLV 
GTAGMAFVGT 
IMNSGASGSR 
GTPKECELDP 
QFCQPDVFIQ 
GNCGPSGNEY 
VPDPGMVNEG 
PNCETKGYGG 
KRSQTYESDG 
PPPQPKVSSQ 



21 
I 

GLVGPVIiGAA 
LERNKDLLPE 
HLENASYGIE 
LRRRRAVLPQ 
GliEIWTNGNL 
VCSRSHAGGI 
NFSSCSAEDF 
CCEGSTCKLK 
KGYPCQNNKA 
KKCATGNALC 
TKCGAGKICR 
SVDSGPTYKE 
KNQANPSRQP 
GNLIPARPAP 



31 
I 

RPGPQQTSHL 
DFWYTYNKB 
PLQNSSHFEH 
TRYVELFIW 
INIVGGAGDV 
NVFGQITVET 
EKLTLMKGGN 
SFAECAYGDC 
YCYNGMCQYY 
GKLQCENVQE 
NFQCVDASVL 
MNTALRDGLL 
GSVPRHVSPV 
APPLY SSLT 



41 
I 

SSYEIITPWR 
GTLITDHPNI 
I IYRMDDVYK 
DKERYDMMGR 
LGNFVQWREK 
FASIVAHELG 
CLLNIPKPDE 
CKDCRFLPGG 
DAQCQVIPGS 
IPVFGIVPAI 
NYDCDVQKKC 
VFFFLIVPLI 
TPPREVPIYA 



51 
I 

LTRERREAPR 
QNHCHYRGYV 
EPLKCGVSNK 
NQTAVREEMI 
FIjITRRRHDS 
HNLGMNHDDG 
AYSAPSCGNK 
TLCRGKTSEC 
KAKAAPKDCF 
IQTPSRGTKC 
HGHGVCNSNK 
VCAIFIFIKR 
NRPAVPTYAA 



Seq ID NO: 634 DNA sequence 

Nucleic Acid Accession #< NM_002091.1 

Coding sequence: 56.. 503 



AGTCTCTGCT 
CGGCAGTGAG 
AGCGGTCCCG 
CCACTGGGCG 
TGAGAGAGGG 
GAATTTGCTG 
GGCCTTGGGC 
AGGTTCAAAA 
CCCCCAGCTG 
TAAGAGACTG 
AAATATTTGA 
CTTCTGGTTT 
TTTTTATATC 
TAAAAGCTTA 



11 

I 

CTTCCCAGCC 
CTCCCGCTGG 
CTGCCTGCGG 
GTGGGGCACT 
AGCCTGAAGC 
GGTCTCATAG 
AATCAGCAGC 
GGCAAAGTTG 
AACCAGCAAT 
AGTTCTGCAA 
CTATTCTGTA 
AAACTTGTTT 
TAGGCTACCT 
AACACAT 



21 
I 

TCTCCGGCGC 
TCCTGCTGGC 
GCGGAGGGAC 
TAATGGGGAA 
AGCAGCTGAG 
AAGCAAAGGA 
CTTCGTGGGA 
GTAGACTCTC 
GATAATGATG 
GCATCAGTTC 
TCTTTCATCC 
GCTGTGAACA 
GTTGGTTAGA 



31 
I 

GCTCCAAGGG 
GCTGGTCCTC 
CGTGCTGACC 
AAAGAGCACA 
AGAGTACATC 
GAACAGAAAC 
TTCAGAGGAT 
TGCTCCAGGT 
GCCTCTCTCA 
TACGGATCAT 
TTGACTAAAT 
ATTGTCGAAA 
TTCAAGGCCC 



41 
I 

CTTCCCGTCG 
TGCCTAGCGC 
AAGATGTACC 
GGGGAGTCTT 
AGGTGGGAAG 
CACCAGCCAC 
AGCAGCAACT 
TCTCAACGTG 
AAAGAGAAAA 
CAACAAGATT 
TCGTGATTTT 
AGAGTCTTCC 
CGAGCTGTTA 



51 
I 

GGACCATGCG 
CCCGGGGGCG 
CGCGCGGCAA 
CTTCTGTTTC 
AAGCTGCAAG 
CTCAACCCAA 
TCAAAGATGT 
AAGGAAGGAA 
ACAAAACCCC 
TCCTTGTGCA 
CAAGCAGCAT 
AATTAATGCT 
CCATTCACAA 



Seq ID NO: 635 Protein sequence 
Protein Accession fh NP_002082.1 

21 



1 11 21 31 41 51 

I I I I I I 

MRGSELPLVL LALVLCLAPR GRAVPLPAGG GTVLTKMYPR GNHWAVGHLM GKKSTGESSS 
VSERGSLKQQ LREYIRWEEA ARNLLGLIEA KENRNHQPPQ PKALGNQQPS WDSEDSSNFK 
DVGSKGKVGR LSAPGSQREG RNPQLNQQ 

Seq ID NO: 636 DNA sequence 

Nucleic Acid Accession #: NM_016522.1 

Coding sequence: 265.. 1299 



1 
I 

GCGGAAGCAG 
CTGGCAAAAG 
TTTTCTCCTC 
CCGCACCCCA 
TCGGGGAAGT 
TGCCTCGTGG 
AGCGGAGATG 
GCCACCCTCA 
ACCATCCTCT 
AACACCCAAA 
TACACCTGCT 
CAAGTATCTC 
ATTAGCCTCA 
TCTCCCAAAG 
CGGGAACAGT 
CGGAGAGTAA 
GTCCCCGTGG 
TTCCAGTGGT 
AACAGACCTT 
TACACTTGCG 
CCAGGCGCCG 
CTGCCTCTTC 
CGGGAAAGGC 
CCAATCAGAT 
GGGAGGGGAA 
CCTTGCAGAT 



11 
I 

CGAGGAGGGA 
CCGAGGCTGG 
CCCGCGCCTC 
CCCACTTCCT 
TGTGGCTGTC 
TCGTGTCTCT 
CCACCTTCCC 
GGTGCACTAT 
ATGCTGGGAA 
CGCAGTACAG 
CGGTGCAGAC 
CCAAAATTGT 
CCTGCATAGC 
CGGTTGGCTT 
CAGGGGACTA 
AGGTCACCGT 
GACAAAAGGG 
ACAAGGATGA 
TCCTCTCAAA 
TGGCCTCCAA 
TCAGCGAGGT 
TGGTCTTGCA 
TGCCGCCACC 
ATATACAAAT 
CAAAGAATAC 
ATTTAGGTAC 



21 
I 

GCCCCCTTTG 
ATTTGGGGGA 
CCGGTCGCCG 
GTGCTCGCCC 
GAGAATGGGG 
CAGGCTGCTG 
CAAAGCTATG 
TGACAACCGG 
TGACAAGTGG 
CATCGAGATC 
AGACAACCAC 
AGAGATTTCT 
AACTGGTAGA 
TGTGAGTGAA 
CGAGTGCAGT 
GAACTATCCA 
GACACTGCAG 
CAAAAGACTG 
ACTCATCTTC 
CAAGCTGGGC 
GAGCAACGGC 
CCTGCTTCTC 
ACCACCACCA 
GAAATTAGAA 
TTTGGGGGGA 
AATGGAGTTT 



31 
I 

GCCGTCCTCC 
GGAATATTAG 
CGGGTTCACC 
GGGGGGCGTG 
GTCTGTGGGT 
TTCCTTGTAC 
GACAACGTGA 
GTCACCCGGG 
TGCCTGGATC 
CAGAACGTGG 
CCAAAGACCT 
TCAGATATCT 
CCAGAGCCTA 
GACGAATACT 
GCCTCCAATG 
CCATACATTT 
TGTGAAGCCT 
ATTGAAGGAA 
TTCAATGTCT 
CACACCAATG 
ACGTCGAGGA 
AAATTTTGAT 
ACACAACAGC 
GAAACACAGC 
AAAGAGTTTT 
TCTTTTCCCA 



41 
i 

GTGGAACCGG 
ACTCGGAGGA 
GCTCAGTCCC 
TGCCGTGCGG 
ACCTGTTCCT 
CCACAGGAGT 
CGGTCCGGCA 
TGGCCTGGCT 
CTCGCGTGGT 
ATGTGTATGA 
CTAGGGTCCA 
CCATTAATGA 
CGGTTACTTG 
TGGAAATTCA 
ACGTGGCCGC 
CAGAAGCCAA 
CAGCAGTCCC 
AGAAAGGGGT 
CTGAACATGA 
CCAGCATCAT 
GGGCAGGCTG 
GTGAGTGCCA 
AATGGCAACA 
CTCATGGGAC 
AAAAAAGAAA 
AACGGGAAGA 



51 

I 

TTTTCCGAGG 
GTCTGCGCGC 
CGCGCTCGCT 
CTGCCGGAGT 
GCCCTGGAAG 
GCCCGTGCGC 
GGGGGAGAGC 
AAACCGCAGC 
CCTTCTGAGC 
CGAGGGCCCT 
CCTCATTGTG 
AGGGAACAAT 
GAGACACATC 
GGGCATCACC 
GCCCGTGGTA 
GGGTACAGGT 
CTCAGCAGAA 
GAAAGTGGAA 
CTATGGGAAC 
GCTATTTGGT 
CGTCTGGCTG 
CTTCCCCACC 
CCGACAGCAA 
AGAAATTTGA 
TTGAAAATTG 
ACACAGCACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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WO 02/086443 

CCCGGCTTGG ACCCACTGCA AGCTGCATCG TGCAACCTCT TTGGTGCCAG TGTGGGCAAG 
GGCTCAGCCT CTCTGCCCAC AGACTGCCCC CACGTGGAAC ATTCTGGAGC TGGCCATCCC 
AAATTCAATC AGTCCATAGA GACGAACAGA ATGAGACCTT CCGGCCCAAG CGTGGCGCTT 
CCGGCCCAAG CGTGGCGCTG CGGGCACTTT GGTAGACTGT GCCACCACGG CGTGTGTTGT 
GAAACGTGAA ATAAAAAGAG CAAAAAAAAA AAAAAAAAA 

Seq ID NO: 637 Protein sequence 
Protein Accession 8: NP_057606.1 



PCTAJS02/12476 



MGVCGYLFLP 
NRVTRVAWLN 
NHPKTSRVHL 
SEDEYhBIQG 
LQCEASAVPS 
LGHTNASIML 



11 
I 

WKCLWVSIiR 
RSTILYAGND 
IVQVSPKIVE 
1TREQSGDYE 
AEFQWYKDDK 
FGPGAVSEVS 



21 
I 

LLFLVPTGVP 
KWCLDPRWL 
ISSDISINEG 
CSASNDVAAP 
RLIEGKKGVK 
NGTSRRAGCV 



31 



41 



51 



VRSGDATFPK AMDNVTVRQG ESATLRCTID 
LSNTQTQYSI EIQNVDVYDE GPYTCSVQTD 
NNISLTCIAT GRPEPTVTWR HISPKAVGFV 
WRRVKVTVN YPPYISEAKG TGVPVGQKGT 
VENRPFLSKL IFFNVSEHDY GNYTCVASNK 
WLLPLLVLHL LLKF 



Seq ID NO: 638 DNA sequence 

Nucleic Acid Accession NM_012261.1 

Coding sequence: 203.. 1045 



GATTTGCTCT 
ACAGAATACG 
CACTCCAGCG 
CCTCATTOGG 
ACTTCGAGTT 
GGAAAATCTC 
TGGGACGACG 
GGCCAGCAAC 
TGAGGTGAAG 
CGCATATGCA 
GGCGACTTGG 
CAAAGACGCA 
CACCCCCGCT 
TGATCCGCAG 
TATCTCAGAT 
GGAAGAAACC 
CGCGATTTAC 
ATCCCAGTAT 
CCAACTGGAT 
CATAGCTACA 
AACCCACGGA 
ATGCTGGGGA 
TGACTCTCCA 
TTGAAAACAT 
TGCTCCCTTG 
TCATGCTCCC 
GTTTAGTGAT 
AAAACGACTA 
GGGGGACCTG 
TTCTCTGGC 



11 
I 

GCCAGCAGCT 
CGCTCCCTCC 
GCGACTTTGA 
GGCACTGCGA 
CTCCTGATGT 
TCAGGCCTTT 
TGTCTCATGG 
TACGTAGATC 
GGCCGCTGTG 
CTCAAAATGC 
AGGCTGAGCA 
GTCAGTGCTG 
GGGAAGTCCT 
AAGACGGTCA 
TTTGTCTTCA 
TTGCCCCTGA 
CACGTCCACC 
AAGCACATGG 
CAGGTAGAAC 
ATCAAACAGG 
AGGGGGAGAC 
GGAGGGGAGG 
AAGAGCAATA 
GCTTCTTTGA 
GACACAGCTG 
TGCAGCAAGA 
TGTCTTGGGA 
ATGTAACTAT 
AAGAATCAAT 



21 
I 

GTCGGTGCCG 
CTCCCCCTTC 
GGGATTCCCT 
GTATGGATCT 
TGTTCCATAC 
CCACTAACCC 
CAGAGTTTGC 
TGATCACAGA 
GCCACAGCCA 
TCTTTGTAAA 
AAGTGCAGTT 
GGAAGCACAC 
ATGAGTGTCA 
CCATGATCCT 
GTGAAGAGCA 
TTTTGGGGCT 
ACAAAATGAC 
GCTAGAGGCC 
AACAAAAGCA 
CCTGGGTATC 
TCTTTCGGAT 
AGGGTCTCAG 
AATGCCACTT 
GGAGGAAACC 
GCTTATCCTA 
CCCCTGAAAG 
ATGTTTCACT 
GCAGAGTTGT 
CTGTGTGAGT 



31 

I 

CGCTCGACAC 
TCTGTCCCCC 
CTCTGGCGGC 
CCAAGGAAGA 
AATGGCTCAA 
TGAAAAAGAT 
AGCCAAATTT 
ACAGGCCGAT 
GTCGGAGCTG 
GGAAAGCCAC 
TGTCTACGAC 
AGCCAACTCG 
AGCTCAACAA 
GTCTGCGGTC 
TAAATGCCCA 
CATCTTGGGC 
TGCCAACCAG 
GTTAGGCAGG 
CTTTTCCATC 
TGAGGCTTGC 
TTGTAGGGTG 
ACAGCTTTCG 
GGAGCTGTAT 
CCTTTAGGTT 
TACAGTTGTC 
TGATTCATGC 
GCTACCCGCA 
TTGGACTTCT 
CTGTTTTTCA 



41 

I 

CGAGTCCTAG 
GCCTCTCGCT 
CTCTGCAGCA 
GGGGTCCCCA 
ATCATGGCAG 
ATATTTGTGG 
ATTGTACCTT 
ATCGCATTGA 
CAAGTGTTCT 
AACATGTCCA 
TCCTCGGAGA 
CACCACCTCT 
ACCATTTCAC 
CACATCCAAC 
GTGGATGAGC 
CTCGTCATCA 
GTGCAGATCC 
CACCCCCTAT 
TTGTACACGA 
TTGGCTTGTG 
AAATGGCAAT 
TGCTCATGGT 
CTGGCCCCAA 
CAGAAGAATA 
AATGCACACA 
TTCTGGCTGG 
TCCAGCGACT 
TCCTGTGCCA 
AAATGAAATA 



51 
I 

CTAGGCGCTC 
CACCCCGGCC 
GCACAGCCGG 
GCATCGACAG 
AACAAGAAGT 
TGCGGGAAAA 
ATGATGTGTG 
CCCGGGGAGC 
GGGTGGATCG 
AGGGACCTGA 
AAACCCACTT 
CTGCCTTGGT 
TGGCCTCTAG 
CTTTTGACAT 
GGGAGCAACT 
TGGTAACACT 
CTCGGGACAG 
TCCTGCTCCC 
GATACACCAA 
TCCATGCTTA 
TATTCTCTCC 
GGCTTGGCTT 
AGTTTAGGGA 
TGGGGTGCTT 
GAATACAACC 
CATTCTGCAT 
GCAGCACCAG 
GGTCCAAGTC 
AAACACACTA 



Seq ID NO: 639 Protein sequence 
Protein Accession 8: NP_036393.1 



MDLQGRGVPS 
EFAAKFIVPY 
FVKESHNMSK 
ECQAQQTISL 
LGLILGLVIM 



11 
I 

IDRLRVLLML 
DVWASNYVDL 
GPEATWRLSK 
ASSDPQKTVT 
VTLAIYHVHH 



21 



31 41 51 

III 
FHTMAQIMAE QEVENLSGLS TNPEKDIFW RENGTTCLMA 
ITEQADIALT RGAEVKGRCG HSQSELQVFW VDRAYALKML 
VQFVYDSSEK THFKDAVSAG KHTANSHHLS ALVTPAGXSY 
MILSAVHIQP FDIISDFVFS EEHKCPVDER EQLEETIiPLI 
KMTANQVQIP RDRSQYKHMG 



Seq ID NO: 640 DNA sequence 

Nucleic Acid Accession #: NM_002 993.1 

Coding sequence: 64.. 408 



GGCACGAGCC 
ACTATGAGCC 
GCGCTGCTCG 
GTCTCTGCTG 
CCCAAAACGA 
GTGGTAGCCT 
AAGAAAGTCA 
ACCATGCATC 
CAGTAAGAAT 
GAAGAGTGTG 
CTAATATAGT 
CAATTGACCA 
TGAAGATAAC 
ATTTCGTATG 
ACTCACTCTT 



11 

I 

AGTCTCCGCG 
TCCCGTCCAG 
CGCTGCTGCT 
TGCTGACAGA 
TTGGTAAACT 
CCCTGAAGAA 
TCCAGAAAAT 
ATAAAATTGC 
AAGAAGGAAG 
GGGGAAAGCC 
ATTTCCACTA 
TATTGTGAGC 
TATTGTATTT 
GAAATAATGT 
CTCATAAAAT 



21 
I 

CCTCCACCCA 
CCGCGCGGCC 
CCTGCTGACG 
GCTGCGTTGC 
GCAGGTGTTC 
CGGGAAGCAA 
TTTGGACAGT 
CCAGTCTTCA 
GGTTGGTTTT 
TACGCTTCTC 
TTTACTGTTA 
AAAGAATCAC 
CTATCATACA 
TTTATTAGTG 
AGGAAATATT 



31 
I 

GCTCAGGAAC 
CGTGTCCCGG 
CCGCCGGGGC 
ACTTGTTTAC 
CCCGCAGGCC 
GTTTGTCTGG 
GGAAACAAGA 
GCGGAGCAGT 
TTTCCATTTT 
CCTGAAGTTT 
TTTTACCTGA 
TGGTTATTAG 
TTCCTTAAAG 
TGCTGTTGAG 
TTAGTTCTGT 



41 
I 

CCGCGAACCC 
GTCCTTCGGG 
CCCTCGCCAG 
GCGTTACGCT 
CGCAGTGCTC 
ACCCGGAAGC 
AAAACTGAGT 
TTTCTGGAGA 
CTACATGGAT 
ACAGCTCAGC 
TAAGTTATTG 
TCTTTCAATG 
TCTTACCGAA 
GGAGGTATCC 
TTTCTTGGGG 



51 
I 

TCTCTTGACC 
CTCCTTGTGC 
CGCTGGTCCT 
GAGAGTAAAC 
CAAGGTGGAA 
CCCTTTTCTA 
AACAAAAAAG 
TCCCTGGACC 
TCCCTACTTT 
TAATGAAGTA 
AACCCTTTGG 
AATATTGAAT 
AAGGCTGTGG 
TGTTGTTCTT 
AATATGTTAC 



1620 
1680 
1740 
1800 



60 
120 
180 
240 
300 



60 
120 
180 
'240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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ACACTGGGTG 
AAATTTAAGT 
TTCTTAGTAT 
TTCACTATAG 
CCCAAAAACA 
TAAATAAAAG 
CAATTTGAAT 
CAGATATCTA 
GTATCACATT 
TTGGTACTTG 



TGTCATACCG 
TCTGTAAGGG 
GGCATAATGT 
GATGACTATA 
GATAAATTCC 
CAAAATTAAC 
ATAAATTCAT 
TTGTGGATCT 
CACTATATTA 
TATTGTCATT 



WO 02/086443 

TCTTTACCCT AGGATGCTAT TTAAGTTGTA CTGTATTAGA 
TTATCTGTGC AGAATATATT TCCTTATTCA GAATTTCTAA 
CTAATATATT CTCTTCCTAT GGTTTTAGAT GTTTGATGTC 
CATGATTTAC TCATTAAACT TTGATTTTGT ATGCTATTTT 
ATTCTGGTCA CTAAATATAC ACTTTAGATA GATGAAGAAG 
TGATTGCTAA TTTACATAGA AATGTATTCT CTTGGTTTTT 
AATGATCTGT GCTCTGCAAA GTTTTGAAAA TATATTTGAA 
CATTTAGTCC TCAAAATATA TACAGCATTG CTAAGATTTT 
TTTAAAGGTT TTGACCATTT TGTTATGAGG AATTATACAT 
AAATTGCACT* TTTATTTTTT CCTGTGTGTC ATGTTGGTTT 
TGGAGAAACA ATAAAAGATT TCTAAACCAA AAAAAAAAAA 

Seq ID NO: 641 Protein sequence 
Protein Accession #: NP_002984.1 

1 ' 11 21 31 41 51 

I I I I I I 

MSLPSSRAAR VPGPSGSLCA LLALLLLLTP PGPLASAGPV SAVLTELRCT CLRVTLRVNP 
KTIGKLOVFP AGPQCSKVEV VASLKNGKQV CLDPEAPPLK KVIQKILDSG NKKN 

Seq ID NO: 642 DNA sequence 

Nucleic Acid Accession #: NM_013271.1 

Coding sequence: 27.. 80 9 " 



PCTYUS02/12476 



I 

TCCGGAGCCA 
CCGGGGGCGT 
TCTGCGCGCG 
AGACTGGCGC 
TGCAGGAGCT 
GGGCOGAGGC 
TCTGGGGCGC 
CTGCAGCGCA 
CCCAGCTTGT 
ACGACGGCCC 
CCGAGCTGTT 
TGGCAGCCCC 
CTGAGGGCGT 
TGCCTGCACG 
CAGAAGTGCC 
TTACCCCGGC 
GATCTGAGC 



11 
I 

GGCTCGCTGG 
CGGCCTTTTG 
GCCGGTAAAG 
TCCTCGCCGC 
GGCGCGGGCG 
GCAGGAGGCT 
CCCCCGCAAC 
GCTCGCTCGC 
CCCCGCGCCC 
CGCGGGCCCG 
GAGGTACTTG 
GCGCCGCCTC 
GCTGGGGGCG 
CCGCCTCTTG 
CCCGCCATCC 
CAGCCAGCCC 



21 
I 

GGCAGCATGG 
GTGCTGCTGC 
GAACCCCGCG 
TTCCGGCGGT 
CTGGCGCATC 
GAGGATCAGC 
TCTGATCCGG 
GCTCTGCTCC 
GTCCCCGCCG 
GATGCTGAGG 
CTGGGACGGA 
CGCCGTGCCG 
CTGCTGCGTG 
CCACCCTGAG 
CGCCACCAGG 
TCTCACCCGA 



31 
I 

CGGGGTCGCC 
TGCTCGGCCT 
GCCTAAGCGC 
CAGTGCCCCG 
TGCTGGAGGC 
AGGCGCGCGT 
CTCTGGGCCT 
GCGCCCGCCT 
CGGCGCTCCG 
AGGCAGGCGA 
TTCTTGCGGG 
CCGACCACGA 
TGAAACGCCT 
CACTGCCCGG 
ACTTCTCCCC 
GGATCCCTAC 



41 

I 

GCTGCTCTGG 
GTTTCGGCCG 
AGCGTCTCCG 
AGGTGAGGCG 
CGAACGTCAG 
CCTGGCGCAG 
GGACGACGAC 
TGACCCTGCC 
ACCCCGGCCC 
CGAGACACCC 
AAGCGCGGAC 
TGTGGGCTCT 
AGAGACCCCG 
ATCCCGTGCA 
GCCAGCACGT 
CCCCTGGCCC 



51 
I 

GGGCCGCGGG 
CCCCCCGCGC 
CCCTTGGCTG 
GCGGGGGCGG 
GAGCGGGCGC 
CTGCTGCGCG 
CCCGACGCGC 
GCCCTAGCAG 
CCGGTCTACG 
GACGTGGACC 
TCCGAGGGGG 
GAGCTGCCCC 
GCGCCCCAGG 
CCCTGGGACC 
CCAGAGCAAC 
ACAATAACAT 



Seq ID NO: 643 Protein sequence 
Protein Accession ft: NPJ>37403.1 

1 11 21 31 41 51 

i I I I I I 

MAGSPLLWGP RAGGVGLLVL LLLGLFRPPP ALCARPVKEP RGLSAASPPL AETGAPRRPR 
RSVPRGEAAG AVQELARALA HLLEAERQER ARAEAQEAED QQARVLAQLL RVWGAPRNSD 
PALGLDDDPD APAAQLARAL LRARLDPAAL AAQLVPAPVP AAALRPRPPV YDDGPAGPDA 
EEAGDETPDV DPELLRYLLG RILAGSADSE GVAAPRRLRR AADHDVGSEL PPEGVLGALL 
RVKRLETPAP QVPARRLLPP 



Seq ID NO: 644 DNA sequence 
Nucleic Acid Accession ft: NM_002214 
Coding sequence: 681.. 2990 



CCCAGAGCCG 
CTGCCGACTT 
GTTGGCCTCC 
TCCCCTCGAC 
TAGGGTGGTT 
CTAAGCTGAT 
TGTCCCGGAG 
TGGCCGTCGA 
GGCCGTAGGG 
CCGAGCCGCG 
GGCCCCGAGG 
GGGGCGGGCT 
TCTGCCTGCA 
CACTTGTTCT 
CCTGTGCCAG 
TTTCAGGTGG 
GCTCAGTTGA 
TTAATACCCA 
ATTTTATGCT 
ATGTCTCAGC 
CTAGAAAAAT 
AAACAGTTTC 
ACAATTTAGA 
TCACTGAGTT 
AAGGAGGTTT 
AAGAGGCTAA 



11 

I 

CCTCCCCCTG 
GTCTTTGCCC 
CTGCCCACCT 
CTCGCCGGCG 
TCCCCCCCAG 
TTATGCAGCA 
CAGGCTGCGG 
AGGAGGTGCT 
GCCCTGAGAT 
GGGTCOGCCT 
TCGCCCGGGA 
GTTTTGCATT 
AAACGACCGG 
TGGACTGGGC 
GTGCCTTGCG 
ATCAAGAAGT 
TTCAATAGAA 
GGTGACACCA 
GAAAGTTCAT 
ATCAATGCAC 
GGCATTTTTC 
ACCATACATT 
CTGCATGCCT 
TGAGAAAGCA 
TGACGCCATG 
AAGATTGCTG 



21 
I 

TTGCTGGCAT 
GCTGCTCCGC 
GTGGAAGCAA 
TACCCTCCCA 
CTTCGGGCTT 
GAAGCCCCAC 
AGCCCTTGCA 
TCTCGCGGAG 
GCCGAGCGGT 
GCTAGGCCTG 
GGCCGAGCCC 
ATGTGCGGCT 
CGAGGTCCCG 
CAAGGTGAAG 
CTGGGTCCAG 
GAACGTTGTG 
TACCCATCTG 
GGAGAAGTGT 
CCTCTGAAGA 
AATAATATAG 
TCCCGTGACT 
AGCATCCACC 
CCCCATGGAT 
GTTCATAGAC 
CTTCAGGCAG 
CTGGTGATGA 



31 
I 

CCOGAGCTTC 
AGACGGGGCT 
CTGCGCTGAT 
CAGATCCAGC 
TGTTTGGGTT 
CGGCTGGAGA 
GAGCCCTCTC 
ACCGCGGGAC 
GCCCGGGCCC 
CGGAAAACGT 
GCGTCCGGAA 
CGGCCCTGGC 
CCTCGTTCCT 
ACAATAGATG 
AATGTGGATG 
ATATTGTTTC 
TGCATGTTAT 
CTATCCAGCT 
AATATCCTGT 
AAAAATTAAA 
TTCGTCTTGG 
CCGAAAGGAT 
ACATCCATGT 
AGAAGATCTC 
CTGTCTGTGA 
CAGATCAGAC 



41 

I 

CTCCCTTGCC 
GCAAAGCTGC 
TGATGCGCCA 
ATCACCCAGT 
TGATTGTGTT 
GAAACAAAAG 
TCCAGTCGCC 
CCGCCGTGCC 
GCTTACCTGC 
CCTAGOGACA 
GGCAGCCAGG 
TTTTTTTACC 
CTGGGCAGCC 
TGCATCTTCA 
GTGTGTTCAA 
CAATTTAATA 
AATACCCACT 
GCGTCCAGGA 
GGATCTTTAT 
TTCCGTTGGA 
ATTTGGCTCA 
TCATAATCAA 
GCTGTCTTTG 
TGGAAACATA 
AAGTCATATC 
GTCTCATCTC 



51 
I 

AGCCAGGACG 
AACTAATGGT 
CAGACTTTTT 
GAATGTACAT 
TGGCTCTTCG 
CTCTTTTCTT 
GCCGGGCCCT 
GAGCCGGGAG 
ACCGCTTGCT 
CTCGCCCGCG 
CGGCGGGCGC 
GCTGCATTTG 

AATGCAGCAT 
GAGGATTTCA' 
AGCAAAGGCT 
GAAAATGAAA 
GCCGAAGCTA 
TATCTTGTTG 
AACGATTTAT 
TACGTTGATA 
TGCAGTGACT 
ACAGAGAACA 
GATACACCAG 
GGATGGCGAA 
GCTCTTGATA 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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GCAAATTGGC AGGCATAGTG GTGCCCAATG ACGGAAACTG TCATCTGAAA AACAACGTCT 1620 

ACGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAG AAATTAATAG 1680 

ACAACAACAT TAATGTCATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG TATAAGGATC 1740 
TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA • 1B00 

ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 1860 

ACCAGGTACA AGGCATCTAT TTTAACATTA CCGCCATCTG TCCAGATGGG TCCAGAAAGC 1920 

CAGGCATGGA AGGATGCAGA AACGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 1980 

TTACAATGAA AAAATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 2040 

GTTTTAATGA AACCGCTAAA ATTCATATAC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 2100 

ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT 2160 

GTGATGAGAA TAAATGTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT TGCAAGTCAC 2220 

ACAAGGATCA GCCTGTTTGC AGTGGTCGAG GAGTTTGTGT TTGTGGGAAA TGTTCATGTC 2280 

ACAAAATTAA GCTTGGAAAA GTGTATGGAA AATACTGTGA AAAGGATGAC TTTTCTTGTC 2340 

CATATCACCA TGGAAATCTG TGTGCTGGGC ATGGAGAGTG TGAAGCAGGC AGATGCCAAT 2400 

GCTTCAGTGG CTGGGAAGGT GATCGATGCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 2460 

TCAATTCAAA GGGCCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA AGGTGTGAGT 2520 

GCACCGATCC CAGGAGCATC GGCCGCTTCT GTGAACACTG CCCCACCTGT TATACAGCCT 2580 

GCAAGGAAAA CTGGAATTGT ATGCAATGCC TTCACCCTCA CAATTTGTCT CAGGCTATAC 2640 

TTGATCAGTG CAAAACCTCA TGTGCTCTCA TGGAACAACA GCATTATGTC GACCAAACTT 2700 

CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC ATAGTTACAT 2760 

TCTTGATTGG GTTGCTTAAA GTCCTGATCA TTAGACAGGT GATACTACAA TGGAATAGTA 2820 

ATAAAATTAA GTCCTCATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT AAGTTGATTC 2880 

TGCAAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGTGA GAAGCCTGAA GAAATAAAAA 2940 

TGGATATCAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAAAGATTT 3000 

TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 3060 

AAGTCACAGG AGGAGACAAA TTGCTCACGG TCATGCCAGT TGCTGGTTGT ACACTCGAAC 3120 

GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT TTTTCAGAGA 3180 

AAAATGTGTC TTACTACTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 3240 

ACTTATTTAG ATCAGCATAG AATGTAGATC CTCTGAAGAG CACTGATTAC ACTTTACAGG 3300 

TACCTGTTAT CCCTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 3360 

CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA ATCTGGCAAG 3420 

TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAGCTAG 3480 

ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3540 

AAAGATTATT GCTTTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3600 

TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 

TTTTTACAGG ATAAGTTTAT GTATGTCACA GATGACTGGA TTAATTAAGT GCTAAGTTAC 3720 

TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 3780 
GAATGTTAA 

Seq ID NO: 645 Protein sequence 
Protein Accession #: NP_002205 

1 11 21 31 41 51 

I I I I I I 

MCGSALAFFT AAFVCLQNDR RGPASFLWAA WVFSLVLGLG QGEDNRCASS NAASCARCLA 60 

LGPECGWCVQ EDFISGGSRS ERCDIVSNLI SKGCSVDSIE YPSVHVIIPT ENEINTQVTP 120 

GEVSIQLRPG AEANFMLKVH PLKKYPVDLY YLVDVSASMH NNIEKLNSVG NDLSRKMAFF 180 

SRDFRLGFGS YVDKTVSPYI SIHPERIHNQ CSDYNLDCMP PHGYIHVLSL TENITEPEKA 240 

VHRQKISGNI DTPEGGFDAM LQAAVCESHI GWRKEAKRLL LVMTDQTSHL ALDSKLAGIV 300 

VPNDGNCHLK NNVYVKSTTM EHPSLGQLSE KLIDNNINVI FAVQGKQFHW YKDLiLPLLPG 360 

TIAGEIESKA ANLNNLWEA YQKLISEVKV QVENQVQGIY FNITAICPDG SRKPGMEGCR 420 

NVTSNDEVLF NVTVTMKKCD VTGGKNYAI I KPIGFKETAK IHIHRNCSCQ CEDNRGPKGK 480 

CVDETFLDSK CFQCDENKCH FDEDQFSSES CKSHKDQPVC SGRGVCVCGK CSCHKIKLGK 540 

VYGKYCEKDD FSCPYHHGNL CAGHGECEAG RCQCFSGWEG DRCQCPSAAA QHCVNSKGQV 600 

CSGRGTCVCG RCECTDPRSI GRFCEHCPTC YTACKENWNC MQCLHPHNLS QAILDQCKTS 660 

CALMEQQHYV DQTSECFSSP SYLRIFFIIF IVTFLIGLLK VLIIRQVILQ WNSNKIKSSS 720 
DYRVSASKKD KLILQSVCTR AVTYRREKPE EIKMDISKLN AHETFRCNF 

Seq ID NO: 646 DNA sequence 

Nucleic Acid Accession #: NM_003318.1 

Coding sequence: 1..2574 

1 11 21 31 41 51 

I I I I I I 

ATGGAATCCG AGGATTTAAG TGGCAGAGAA TTGACAATTG ATTCCATAAT GAACAAAGTG 60 

AGAGACATTA AAAATAAGTT TAAAAATGAA GACCTTACTG ATGAACTAAG CTTGAATAAA 120 

ATTTCTGCTG ATACTACAGA TAACTCGGGA ACTGTTAACC AAATTATGAT GATGGCAAAC 180 

AACCCAGAGG ACTGGTTGAG TTTGTTGCTC AAACTAGAGA AAAACAGTGT TCCGCTAAGT 240 

GATGCTCTTT TAAATAAATT GATTGGTCGT TACAGTCAAG CAATTGAAGC GCTTCCCCCA 300 

GATAAATATG GCCAAAATGA GAGTTTTGCT AGAATTCAAG TGAGATTTGC TGAATTAAAA 360 

GCTATTCAAG AGCCAGATGA TGCACGTGAC TACTTTCAAA TGGCCAGAGC AAACTGCAAG 420 

AAATTTGCTT TTGTTCATAT ATCTTTTGCA CAATTTGAAC TGTCACAAGG TAATGTCAAA 480 

AAAAGTAAAC AACTTCTTCA AAAAGCTGTA GAACGTGGAG CAGTACCACT AGAAATGCTG 540 

GAAATTGCCC TGCGGAATTT AAACCTCCAA AAAAAGCAGC TGCTTTCAGA GGAGGAAAAG 600 

AAGAATTTAT CAGCATCTAC GGTATTAACT GCCCAAGAAT CATTTTCCGG TTCACTTGGG 660 

CATTTACAGA ATAGGAACAA CAGTTGTGAT TCCAGAGGAC AGACTACTAA AGCCAGGTTT 720 

TTATATGGAG AGAACATGCC ACCACAAGAT GCAGAAATAG GTTACCGGAA TTCATTGAGA 780 

CAAACTAACA AAACTAAACA GTCATGCCCA TTTGGAAGAG TCCCAGTTAA CCTTCTAAAT 840 

AGCCCAGATT GTGATGTGAA GACAGATGAT TCAGTTGTAC CTTGTTTTAT GAAAAGACAA 900 

ACCTCTAGAT CAGAATGCCG AGATTTGGTT GTGCCTGGAT CTAAACCAAG TGGAAATGAT 960 

TCCTGTGAAT TAAGAAATTT AAAGTCTGTT CAAAATAGTC ATTTCAAGGA ACCTCTGGTG 1020 

TCAGATGAAA AGAGTTCTGA ACTTATTATT ACTGATTCAA TAACCCTGAA GAATAAAACG 1080 

GAATCAAGTC TTCTAGCTAA ATTAGAAGAA ACTAAAGAGT ATCAAGAACC AGAGGTTCCA 1140 

GAGAGTAACC AGAAACAGTG GCAATCTAAG AGAAAGTCAG AGTGTATTAA CCAGAATCCT 1200 

GCTGCATCTT CAAATCACTG GCAGATTCCG GAGTTAGCCC GAAAAGTTAA TACAGAGCAG 1260 

AAACATACCA CTTTTGAGCA ACCTGTCTTT TCAGTTTCAA AACAGTCACC ACCAATATCA 1320 

ACATCTAAAT GGTTTGACCC AAAATCTATT TGTAAGACAC CAAGCAGCAA TACCTTGGAT 1380 



433 
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GATTACATGA GCTGTTTTAG AACTCCAGTT GTAAAGAATG ACTTTCCACC TGCTTGTCAG 1440 

TTGTCAACAC CTTATGGCCA ACCTGCCTGT TTCCAGCAGC AACAGCATCA AATACTTGCC 1500 

ACTCCACTTC AAAATTTACA GGTTTTAGCA TCTTCTTCAG CAAATGAATG CATTTCGGTT 1560 

AAAGGAAGAA TTTATTCCAT TTTAAAGCAG ATAGGAAGTG GAGGTTCAAG CAAGGTATTT 1620 

CAGGTGTTAA ATGAAAAGAA ACAGATATAT GCTATAAAAT ATGTGAACTT AGAAGAAGCA 1680 

GATAACCAAA CTCTTGATAG TTACCGGAAC GAAATAGCTT ATTTGAATAA ACTACAACAA 1740 

CACAGTGATA AGATCATCCG ACTTTATGAT TATGAAATCA CGGACCAGTA CATCTACATG 1800 

GTAATGGAGT GTGGAAATAT TGATCTTAAT AGTTGGCTTA AAAAGAAAAA ATCCATTGAT 1860 

CCATGGGAAC GCAAGAGTTA CTGGAAAAAT ATGTTAGAGG CAGTTCACAC AATCCATCAA 1920 

CATGGCATTG TTCACAGTGA TCTTAAACCA GCTAACTTTC TGATAGTTGA TGGAATGCTA 1980 

AAGCTAATTG ATTTTGGGAT TGCAAACCAA ATGCAACCAG ATACAACAAG TGTTGTTAAA 2040 

GATTCTCAGG TTGGCACAGT TAATTATATG CCACCAGAAG CAATCAAAGA TATGTCTTCC 2100 

TCCAGAGAGA ATGGGAAATC TAAGTCAAAG ATAAGCCCCA AAAGTGATGT TTGGTCCTTA 2160 

GGATGTATTT TGTACTATAT GACTTACGGG AAAACACCAT TTCAGCAGAT AATTAATCAG 2220 

ATTTCTAAAT TACATGCCAT AATTGATCCT AATCATGAAA TTGAATTTCC CGATATTCCA 2280 

GAGAAAGATC TTCAAGATGT GTTAAAGTGT TGTTTAAAAA GGGACCCAAA ACAGAGGATA 2340 

TCCATTCCTG AGCTCCTGGC TCATCCCTAT GTTCAAATTC AAACTCATCC AGTTAACCAA 2400 

ATGGCCAAGG GAACCACTGA AGAAATGAAA TATGTTCTGG GCCAACTTGT TGGTCTGAAT 2460 

TCTCCTAACT CCATTTTGAA AGCTGCTAAA ACTTTATATG AACACTATAG TGGTGGTGAA 2520 
AGTCATAATT CTTCATCCTC CAAGACTTTT GAAAAAAAAA GGGGAAAAAA ATGA 

Seq ID NO: 647 Protein sequence 
Protein Accession |: NP_003309.1 

1 11 21 31 41 51 

I I I I I I 

MESEDLSGRE LTIDSIMNKV RDIKNKFKNE DLTDELSLNK ISADTTDNSG TVNQIMMMAN 60 

NPEDWLSLLL KLEKNSVPLS DAIiLNKLIGR YSQAIEALPP DKYGQNESFA RIQVRFAELK 120 

AIQEPDDARD YFQMARANCK KFAFVHISPA QFELSQGNVK KSKQLLQKAV ERGAVPLEML 180 

EIALRNLNLQ KKQLLSEEEK KNLSASTVLT AQESFSGSLG HLQNRNNSCD SRGQTTKARF 240 

LYGENMPPQD AEIGYRNSLR QTNKTKQSCP FGRVPVNLLN SPDCDVKTDD SWPCFMKRQ 300 

TSRSECRDLV VPGSKPSGND SCELRNLKSV QNSHFKEPLV SDEKSSELII TDSITLKNKT 360 

ESSLLAKLEE 'TKEYQEPEVP ESNQKQWQSK RKSECINQNP AASSNHWQIP EliARKVNTEQ 420 

KHTTFEQPVF SVSKQSPPIS TSKWFDPKSI CKTPSSNTLD DYMSCFRTPV VKNDFPPACQ 4 80 

LSTPYGQPAC FQQQQHQILA TPLQNLQVLA SSSANECISV KGRIYSILKQ IGSGGSSKVF 540 

QVLNEKKQIY AIKYVNLEEA ENQTLDSYRN EIAYLNKLQQ HSDKIIRLYD YEITDQYIYM 600 

VMECGNIDLN SWLKKKKSID PWERKSYWKN MLEAVHTIHQ HGIVHSDLKP ANFLIVDGML 660 

KLIDFGIANQ KQPDTTSWK DSQVGTVNYM PPEAIKDMSS SRENGKSKSK ISPKSDVWSIi 720 

GCILYYMTYG KTPFQQIINQ ISKLHAIIDP NHEIEFPDIP EKDLQDVLKC CUCRDPKQRI 780 

SIPELLAHPY VQIQTHPVNQ MAKGTTEEMK YVLGQLVGLN SPNSILKAAK TLYEHYSGGE 840 
SHNSSSSKTF EKKRGKK 

Seq ID NO: 648 DNA sequence 
Nucleic Acid Accession #: NM_015507 
Coding sequence: 241.. 1902 

1 11 21 31 41 51 

I I I I I I 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 

CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 

CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG 180 

GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 

ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 

GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 

TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 

TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 

AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 

AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 

CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 

ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 

TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 

GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 

AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 

AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 

GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 

ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 

AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1140 

CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 

ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 

GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 

AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 

CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 

GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 

TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 

GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 

AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 

TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 

TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 

GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 1860 

TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGACT GAATGTTACT ATCTTTATAT 1920 

TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 

TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 

TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 

TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 

CAGTATATCT GATTTGTATA AGTAAGTIGA TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 

TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 

TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
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TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 



Seq ID NO: 649 Protein sequence 
Protein Accession #: NP_056322 

1 11 21 31 41 51 

I J I I I I 

MPLPWSLALP LLLSWVAGGF GNAASARHHG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 
CEATCEPGCK FGECVGPNKC RCPPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCFC 120 
LSGHMLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 
GKVICPYNRR CVNTFGSYYC KCHIGPELQY ISGKYDCIDI NECTMDSHTC SHHANCPNTQ 240 
GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI KKLLAHKNSM KKKAKIKNVT 300 
PEPTRTPTPK VNLQPFNYEE IVSRGGNSHG GKKGNEBKMK EGLEDEKREE KALKNDIEER 360 
SLRGDVFFPK VNEAGEFGLI LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 
WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 480 
FVKNSNNAIiA WEKTTSEDEK WKTGKIQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 540 
SGLCPDSLLS VDD 

Seq ID NO: 650 DNA sequence 

Nucleic Acid Accession tf: NM_0O3506.1 

Coding sequence: 259.-2379 

1 11 21 31 41 51 

I I I I I I 

GCAGCTCCAG TCCCGGACGC AACCCCGGAG CCGTCTCAGG TCCCTGGGGG GAACGGTGGG 60 
TTAGACGGGG ACGGGAAGGG ACAGCGGCCT TCGACCGCCC CCCGAGTAAT TGACCCAGGA 120 
CTCATTTTCA GGAAAGCCTG AAAATGAGTA AAATAGTGAA ATGAGGAATT TGAACATTTT 180 
ATCTTTGGAT GGGGATCTTC TGAGGATGCA AAGAGTGATT CATCCAAGCC ATGTGGTAAA 240 
ATCAGGAATT TGAAGAAAAT GGAGATGTTT ACATTTTTGT TGACGTGTAT TTTTCTACCC 300 
CTCCTAAGAG GGCACAGTCT CTTCACCTGT GAACCAATTA CTGTTCCCAG ATGTATGAAA 360 
ATGGCCTACA ACATGACGTT TTTCCCTAAT CTGATGGGTC ATTATGACCA GAGTATTGCC 420 
GCGGTGGAAA TGGAGCATTT TCTTCCTCTC GCAAATCTGG AATGTTCACC AAACATTGAA 480 
ACTTTCCTCT GCAAAGCATT TGTACCAACC TGCATAGAAC AAATTCATGT GGTTCCACCT 540 
TGTCGTAAAC TTTGTGAGAA AGTATATTCT GATTGCAAAA AATTAATTGA CACTTTTGGG 600 
ATCCGATGGC CTGAGGAGCT TGAATGTGAC AGATTACAAT ACTGTGATGA GACTGTTCCT 660 
GTAACTTTTG ATCCACACAC AGAATTTCTT GGTCCTCAGA AGAAAACAGA ACAAGTCCAA 720 
AGAGACATTG GATTTTGGTG TCCAAGGCAT CTTAAGACTT CTGGGGGACA AGGATATAAG 780 
TTTCTGGGAA TTGACCAGTG TGCGCCTCCA TGCCCCAACA TGTATTTTAA AAGTGATGAG 840 
CTAGAGTTTG CAAAAAGTTT TATTGGAACA GTTTCAATAT TTTGTCTTTG TGCAACTCTG 900 
TTCACATTCC TTACTTTTTT AATTGATGTT AGAAGATTCA GATACCCAGA GAGACCAATT 960 

ATATATTACT CTGTCTGTTA CAGCATTGTA TCTCTTATGT ACTTCATTGG ATTTTTGCTG 1020 

GGCGATAGCA CAGCCTGCAA TAAGGCAGAT GAGAAGCTAG AACTTGGTGA CACTGTTGTC 1080 

CTAGGCTCTC AAAATAAGGC TTGCACCGTT TTGTTCATGC TTTTGTATTT TTTCACAATG 1140 

GCTGGCACTG TGTGGTGGGT GATTCTTACC ATTACTTGGT TCTTAGCTGC AGGAAGAAAA 1200 

TGGAGTTGTG AAGCCATCGA GCAAAAAGCA GTGTGGTTTC ATGCTGTTGC ATGGGGAACA 1260 

CCAGGTTTCC TGACTGTTAT GCTTCTTGCT CTGAACAAAG TTGAAGGAGA CAACATTAGT 1320 

GGAGTTTGCT TTGTTGGCCT TTATGACCTG GATGCTTCTC GCTACTTTGT ACTCTTGCCA 1380 

CTGTGCCTTT GTGTGTTTGT TGGGCTCTCT CTTCTTTTAG CTGGCATTAT TTCCTTAAAT 1440 

CATGTTCGAC AAGTCATACA ACATGATGGC CGGAACCAAG AAAAACTAAA GAAATTTATG 1500 

ATTCGAATTG GAGTCTTCAG CGGCTTGTAT CTTGTGCCAT TAGTGACACT TCTCGGATGT 1560 

TACGTCTATG AGCAAGTGAA CAGGATTACC TGGGAGATAA CTTGGGTCTC TGATCATTGT 1620 

CGTCAGTACC ATATCCCATG TCCTTATCAG GCAAAAGCAA AAGCTCGACC AGAATTGGCT 1680 

TTATTTATGA TAAAATACCT GATGACATTA ATTGTTGGCA TCTCTGCTGT CTTCTGGGTT 1740 

GGAAGCAAAA AGACATGCAC AGAATGGGCT GGGTTTTTTA AACGAAATCG CAAGAGAGAT 1800 

CCAATCAGTG AAAGTCGAAG AGTACTACAG GAATCATGTG AGTTTTTCTT AAAGCACAAT 1860 

TCTAAAGTTA AACACAAAAA GAAGCACTAT AAACCAAGTT CACACAAGCT GAAGGTCATT 1920 

TCCAAATCCA TGGGAACCAG CACAGGAGCT ACAGCAAATC ATGGCACTTC TGCAGTAGCA 1980 

ATTACTAGCC ATGATTACCT AGGACAAGAA ACTTTGACAG AAATCCAAAC CTCACCAGAA 2040 

ACATCAATGA GAGAGGTGAA AGCGGACGGA GCTAGCACCC CCAGGTTAAG AGAACAGGAC 2100 

TGTGGTGAAC CTGCCTCGCC AGCAGCATCC ATCTCCAGAC TCTCTGGGGA ACAGGTCGAC 2160 

GGGAAGGGCC AGGCAGGCAG TGTATCTGAA AGTGCGCGGA GTGAAGGAAG GATTAGTCCA 2220 

AAGAGTGATA TTACTGACAC TGGCCTGGCA CAGAGCAACA ATTTGCAGGT CCCCAGTTCT 2280 

TCAGAACCAA GCAGCCTCAA AGGTTCCACA TCTCTGCTTG TTCACCCAGT TTCAGGAGTG 2340 

AGAAAAGAGC AGGGAGGTGG TTGTCATTCA GATACTTGAA GAACATTTTC TCTCGTTACT 2400 

CAGAAGCAAA TTTGTGTTAC ACTGGAAGTG ACCTATGCAC TGTTTTGTAA GAATCACTGT 2460 

TACGTTCTTC TTTTGCACTT AAAGTTGCAT TGCCTACTGT TATACTGGAA AAAATAGAGT 2520 

TCAAGAATAA TATGACTCAT TTCACACAAA GGTTAATGAC AACAATATAC CTGAAAACAG 2580 

AAATGTGCAG GTTAATAATA TTTTTTTAAT AGTGTGGGAG GACAGAGTTA GAGGAATCTT 2640 

CCTTTTCTAT TTATGAAGAT TCTACTCTTG GTAAGAGTAT TTTAAGATGT ACTATGCTAT 2700 

TTTACCTTTT TGATATAAAA TCAAGATATT TCTTTGCTGA AGTATTTAAA TCTTATCCTT 2760 

GTATCTTTTT ATACATATTT GAAAATAAGC TTATATGTAT TTGAACTTTT TTGAAATCCT 2820 

ATTCAAGTAT TTTTATCATG CTATTGTGAT ATTTTAGCAC TTTGGTAGCT TTTACACTGA 2880 

ATTTCTAAGA AAATTGTAAA ATAGTCTTCT TTTATACTGT AAAAAAAGAT ATACCAAAAA 2940 

GTCTTATAAT AGGAATTTAA CTTTAAAAAC CCACTTATTG ATACCTTACC ATCTAAAATG 30 00 

TGTGATTTTT ATAGTCTCGT TTTAGGAATT TCACAGATCT AAATTATGTA ACTGAAATAA 30 60 

GGTGCTTACT CAAAGAGTGT CCACTATTGA TTGTATTATG CTGCTCACTG ATCCTTCTGC 3120 

ATATTTAAAA TAAAATGTCC TAAAGGGTTA GTAGACAAAA TGTTAGTCTT TTGTATATTA 3180 

GGCCAAGTGC AATTGACTTC CCTTTTTTAA TGTTTCATGA CCACCCATTG ATTGTATTAT 3240 

AACCACTTAC AGTTGCTTAT ATTTTTTGTT TTAACTTTTG TTTCTTAACA TTTAGAATAT 3300 
TACATTTTGT ATTATACAGT ACCTTTCTCA GACATTTTGT AG 

Seq ID NO: 651 Protein sequence 
Protein Accession #: NP_003497.1 

31 41 51 

I I I 
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I I I 
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MEMFTPLLTC IFLPLLRGHS LFTCEPITVP RCMKMAYNMT 
PLPLANLECS PNIETFLCKA FVPTCIEQIH WPPCRKLCE 
LECDRIiQYCD ETVPVTFDPH TEPU3PQKKT EQVQRDIGPH 
CAPPCPNMYF KSDELEFAKS FIGTVSIFCL CATLFTFLTF 
YSIVSLMYPI GFLLGDSTAC HKADEKLELG DTWI/3SQNK 
VILTITWPLA AGHKWSCEAI EQKAVWFHAV AWGTPGFLTV 
LYDLDASRYF VIiLPLCLCVF VGLSLLLAGI ISLNHVRQVI 
SGLYLVPLVT LLGCYVYEQV NRITWEITWV SDHCRQYHIP 
LMTLIVGISA VFWVGSKKTC TEWAGFFKRN RKRDPISESR 
KKHYKPSSHK LKVISKSMGT STGATANHGT SAVAITSHDY 
KADGASTPRL REQDCGEPAS PAASISRLSG EQVDGKGQAG 
TGLAQSNNLQ VPSSSEPSSL KGSTSLLVHP VSGVRKEQGG 

Seq ID NOs 652 DNA sequence 

Nucleic Acid Accession #: NMJU4791.1 

Coding sequence: 171.. 2126 



PCT/US02/12476 



FFPNLMGHYD 
KVYSDCKKLI 
CPRHLKTSGG 
LIDVRRFRYP 
ACTVLFMLLY 
MLLALNKVEG 
QHDGRNQEKL 
CPYQAKAKAR 
RVLQESCEFF 
LGQETLTEIQ 
SVSESARSEG 
GCHSDT 



QSIAAVEMEH 
DTFGIRWPEE 
QGYKFLGIDQ 
ERPIIYYSVC 
PPTMAGTVWW 
DNISGVCFVG 
KKFMIRIGVF 
PELALPMIKY 
LKHNSKVKHK 
TSPETSMREV 
RISPKSDITD 



TTGGCGGGCG 
GCCGCGTCTC 
TCCGCCCCTC 
ATGATGAACT 
AGGTCAAACT 
AAAACACACT 
TGAGACATCA 
TGGTTCTTGA 
TGTCAGAAGA 
ACAGCCAGGG 
ATAAATTAAA 
ATCTACAGAC 
CATATCTTGG 
GTGGATTTCT 
GAAAATATGA 
TGCAGGTGGA 
TGCAAGATTA 
ATGATTGCGT 
TAATTTCACT 
AGGCTCGGGG 
CTACCCCATT 
ATAAAAATTA 
GTGCTGCTAC 
AATCTAAATC 
AAAATGTATA 
CAAAGACTCC 
ACACTACACC 
TAAATTCAAC 
GCTCAGTGGA 
CCAAAGTGTT 
GCAAAAGGAA 
CTACAACTAG 
AGAAGCATGT 
TTGGGAAAGT 
TGGGTATCAG 
ACATCCTATC 
GGTGTGATAC 
CTACCAACTT 
GATATTATTT 
TAATCATGTG 
AATGTAAGCT 
TTGTGAATAT 



11 
I 

GAAGCGGCCA 
TCAGGACAGC 
AGGTTCTTTT 
TCTCAAATAT 
TGCCTGCCAT 
AGGGAGTGAT 
GCATATATGT 
GTACTGCCCT 
GGAGACCCGG 
CTATGCTCAC 
GCTGATTGAC 
ATGCTGTGGG 
ATCAGAGGCA 
ACCATTTGAT 
TGTTCCCAAG 
CCCAAAGAAA 
CAACTATCCT 
AACAGAACTT 
GTGGCAGTAT 
AAAACCAGTT 
CACAGACATC 
TGTGGCGGGA 
TCCCCGAACA 
ATTAACTCCA 
TACTCCTAAG 
AGTTAATAAG 
CTCAAAAGCT 
AGGAACAGAC 
ATTGGATCTC 
TGGGAGCCTT 
GGGTTCTGCC 
ATTAGTGAAT 
TGACTTTGTA 
GACAATGCAA 
GAGGCAGCGG 
TAGCTGCAAG 
AGCCTACATA 
GTTTCTAAAG 
TGTGTATGAA 
GTTTTGTATA 
CTTAACTATG 



21 

I 

CAACCCGGCG 
AGGCCCCTGT 
TCTAATTCCA 
TATGAATTAC 
ATCCTTACTG 
TTGCCCCGGA 
CAACTCTACC 
GGAGGAGAGC 
GTTGTCTTCC 
AGGGACCTCA 

AGTCTGGCTT 
GATGTTTGGA 
GATGATAATG 
TGGCTCTCTC 
CGGATTTCTA 
GTTGAGTGGC 
TCTGTACATC 
GATCACCTCA 
CGTTTAAGGC 
AAGTCAAATA 
TTAATAGACT 
TCACAGTTTA 
GCCTTATGCA 
TCTGCTGTAA 
AACCAGCATA 
AGAAACCAGT 
AAGTTAATGA 
AACCAAGCAC 
GAAAGGGGGT 
AGAGACGGGC 
CCAGATCAAC 
CAAAAGGGTT 
TTTGAATTAG 
CTTAAGGGCG 
GTATAATTGA 
AAGACTGTTA 
AGCTATCTTA 
TCTAAATCAA 
TTAATAATTG 
TCTCTTTGTA 



31 
I 

ATCGAAAAGA 
CCTTCTGTCG 
AATAAACTTG 
ATGAAACTAT 
GAGAGATGGT 
TCAAAACGGA 
ATGTGCTAGA 
TGTTTGACTA 
GTCAGATAGT 
AGCCAGAAAA 
GTGCAAAACC 
ATGCAGCACC 
GCATGGGCAT 
TAATGGCTTT 
CCAGTAGCAT 
TGAAAAATCT 
AAAGCAAGAA 
ACAGAAACAA 
CGGCTACCTA 
TTTCTTCTTT 
ATTGGAGTCT 
ATGATTGGTG 
CCAAGTACTG 
GAACACCTGC 
AGAATGAAGA 
AGAGAGAAAT 
GCCTGAAAGA 
CAGGTGTCAT 
ATATGGAGGA 
TGGATAAGGT 
CCAGAAGACT 
TGTTGAATGA 
ATACACTGAA 
AAGTGTGCCA 
ATGCCTGGGT 
TGGATTCTTC 
TGATCGCTTT 
AGACCAATAT 
GCCCATCTGT 
TTGACTTTCT 
ATGTGTAATT 



41 

I 

TTCTTAGGAA 
GGCGCCGCTC 
CAAGAGGACT 
TGGGACAGGT 
AGCTATAAAA 
GATTGAGGCC 
GACAGCCAAC 
TATAATTTCC 
ATCTGCTGTT 
TTTGCTGTTT 
CAAGGGTAAC 
TGAGTTAATA 
ACTGTTATAT 
ATACAAGAAG 
TCTGCTTCTT 
ATTGAACCAT 
TCCTTTTATT 
CAGGCAAACA 
TCTTCTGCTT 
CTCCTGTGGA 
GGAAGATGTG 
TGAAGATGAT 
GACAGAATCA 
AAATAAATTA 
GTACTTTATG 
ACTCACTACG 
AACTCCAATT 
TAGCCCTGAG 
GACTCCAAAA 
TATCACTGTG 
AAAGCTTCAC 
AATAATGTCT 
GTGTCAAACA 
GCTTCAAAAA 
TTACAAAAGA 
CATCCTGCCG 
GATTTTAAAG 
CTCTTTGTTT 
CATTATGTTA 
TAGATTCACT 
TCTTTCTGAA 



51 
I 

CGCCGTACCA 
AGCCGTGCCC 
ATGAAAGATT 
GGCTTTGCAA 
ATCATGGATA 
TTGAAGAACC 
AAAATATTCA 
CAGGATCGCC 
GCTTATGTGC 
GATGAATATC 
AAGGATTACC 
CAAGGCAAAT 
GTTCTTATGT 
ATTATGAGAG 
CAACAAATGC 
CCCTGGATCA 
CACCTCGATG 
ATGGAGGATT 
CTAGCCAAGA 
CAAGCCAGTG 
ACCGCAAGTG 
TTATCAACAG 
AATGGGGTGG 
AAGAACAAAG 
TTTCCTGAGC 
CCAAATCGTT 
AAAATACCAG 
AGGCGGTGCC 
AGAAAGGGAG 
CTCACCAGGA 
TATAATGTGA 
ATTCTTCCAA 
CAGTCAGATT 
CCCGATGTGG 
TTAGTGGAAG 
GATGAGTGTG 
TTCATTGGAA 
TTAAACAAAA 
CTGTCTTTTT 
TCCATATGTG 
ATAAAACCAT 



Seq ID NO: 653 Protein sequence 
Protein Accession #: NP 055606.1 



MKDYDELLKY 
LKNLRHQHIC 
AYVHSQGYAH 
QGKSYLGSEA 
QQMLQVDPKK 
MEDLISLWQY 
TASDKNYVAG 
KNKENVYTPK 
KIPVNSTGTD 
LTRSKRKGSA 
QSDFGKVTMQ 



11 
I 

YELHETIGTG 
QLYHVLETAN 
RDLKPENLLF 
DVWSMGILLY 
RISMKNLLNH 
DHLTATYLLL 
LIDYDWCEDD 
SAVKNEEYPM 
KLMTGVISPE 
RDGPRRLKLH 
FELEVCQLQK 



21 
I 

GFAKVKLACH 
KIFMVLEYCP 
DEYHKLKXjID 
VLMCGFLPFD 
PWIMQDYNYP 
LAKKARGKPV 
LSTGAATPRT 
FPEPKTPVNK 
RRCRSVELDL 
YNVTTTRLVN 
PDWGIRRQR 



31 
I 

ILTGEMVAIK 
GGELFDYIIS 
FGLCAKPKGK 
DDNVMALYKK 
VEWQSKNPFI 
RLRLSSFSCG 
SQFTKYWTES 
NQHKREILTT 
NQAHMEETPK 
PDQLLNEIMS 
LKGDAWVYKR 



41 
I 

IMDKNTLGSD 
QDRLSEEETR 
KDYHLQTCCG 
IMRGKYDVPK 
HLDDDCVTEL 
QASATPFTDI 
NGVESKSLTP 
PNRYTTPSKA 
RKGAKVPGSL 
ILPKKHVDFV 
LVEDILSSCK 



51 
I 

LPRIKTEIEA 
WFRQIVSAV 
SLAYAAPELI 
WLSPSSILLL 
SVHHRNNRQT 
KSNNWSLEDV 
ALCRTPANKL 
RNQCLKETPI 
ERGLDKVITV 
QKGYTLKCQT 
V 



Seq ID NO: 654 DNA sequence 
Nucleic Acid Accession #: NM_000582 
Coding sequence : 88 . . 990 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
480 
540* 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1B00 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



51 



1 11 21 31 41 

I I I I I I 

GCAGAGCACA GCATCGTCGG GACCAGACTC GTCTCAGGCC AGTTGCAGCC TTCTCAGCCA 
AACGCCGACC AAGGAAAACT CACTACCATG AGAATTGCAG TGATTTGCTT TTGCCTCCTA 
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WO 02/086443 

GGCATCACCT GTGCCATACC AGTTAAACAG GCTGATTCTG 
CTTTACAACA AATACCCAGA TGCTGTGGCC ACATGGCTAA 
CAGAATCTCC TAGCCCCACA GACCCTTCCA AGTAAGTCCA 
GATGATATGG ATGATGAAGA TGATGATGAC CATGTGGACA 
AACGACTCTG ATGATGTAGA TGACACTGAT GATTCTCACC 
TCTGATGAAT CTGATGAACT GGTCACTGAT TTTCCCACGG 
TTCACTCCAG TTGTCCCCAC AGTAGACACA TATGATGGCC 
GGACTGAGGT CAAAATCTAA GAAGTTTCGC AGACCTGACA 
GACGAGGACA TCACCTCACA CATGGAAAGC GAGGAGTTGA 
CCCGTTGCCC AGGACCTGAA CG03CCTTCT GATTGGGACA 
GAAACGAGTC AGCTGGATGA CCAGAGTGCT GAAACCCACA 
TATAAGCGGA AAGCCAATGA TGAGAGCAAT GAGCATTCCG 
CTTTCCAAAG TCAGCCGTGA ATTCCACAGC CATGAATTTC 
GTTGTAGACC CCAAAAGTAA GGAAGAAGAT AAACACCTGA 
TTAGATAGTG CATCTTCTGA GGTCAATTAA AAGGAGAAAA 
ATTTAGTCAA AAGAAAAAAT GCTTTATAGC AAAATGAAAG 
CTCAGTTTAT TGGTTGAATG TGTATCTATT TGAGTCTGGA 
ATTAGTTTAG TTTGTGGCTT CATGGAAACT CCCTGTAAAC 
CTATGTTCAT TCTATAGAAG AAATGCAAAC TATCACTGTA 
TCATGAATAG AAATTTATGT AGAAGCAAAC AAAATACTTT 
ATAACATTTT ATGTCACTAT AATCTTTTGT TTTTTAAGTT 
TATCTTTTTG TGGTGTGAAT AAATCTTTTA TCTTGAATGT 
AATTGCTTAT TTGTTTTCCC ACGGTTGTCC AGCAATTAAT 
GCCTAAAAAA AAAAAAAAAA AAAA 

Seq ID NO: 655 Protein sequence 
Protein Accession ft: NP 000573 



PCT/US02/12476 



li 



21 



MRIAVICFOj 
PSKSNESHDH 
DFPTDLPATB 
SEBLNGAYKA 
NEHSDVIDSQ 



LGITCAIPVK QADSGSSEEK 
MDDMDDEDDD DHVDSQDSID 
VFTPWPTVD TYDGRGDSW 
IPVAQDLNAP SDWDSRGKDS 
ELSKVSREFH SHEFHSHEDM 



31 
I 

QLYNKYPDAV 
SNDSDDVDDT 
YGltRSKSKKF 
YETSQLDDQS 
LWDPKSKEE 



GAAGTTCTGA 
ACCCTGACCC 
ACGAAAGCCA 
GCCAGGACTC 
AGTCTGATGA 
ACCTGCCAGC 
GAGGTGATAG 
TCCAGTACCC 
ATGGTGCATA 
GCCGTGGGAA 
GCCACAAGCA 
ATGTGATTGA 
ACAGCCATGA 
AATTTCGTAT 
AATACAATTT 
AGAACATGAA 
AATAACTAAT 
TAAAAGCTTC 
TTTTAATATT 
TACCCACTTA 
AGTGTATATT 
AATAAGAATT 
AAAACATAAC 



41 
I 

ATWLNPDPSQ 
DDSHQSDESH 
RRPDIQYPDA 
AETHSHKQSR 
DKHLKFRISH 



GGAAAAGCAG 
ATCTCAGAAG 
TGACCACATG 
CATTGACTCG 
GTCTCACCAT 
AACCGAAGTT 
TGTGGTTTAT 
TGATGCTACA 
CAAGGCCATC 
GGACAGTTAT 
GTCCAGATTA 
TAGTCAGGAA 
AGATATGCTG 
TTCTCATGAA 
CTCACTTTGC 
ATGCTTCTTT 
GTGTTTGATA 
AGGGTTATGT 
TGTTATTCTC 
AAAAGAGAAT 
TTGTTGTGAT 
TGGTGGTGTC 
CTTTTTTACT 



51 
I 

KQNLIiAPOTL 
HSDESDELVT 
TDEDITSHME 
LYKRKANDES 
ELDSASSEVN 



Seq ID NO: 656 DNA sequence 

Nucleic Acid Accession ft: NMJ>03108.1 

Coding sequence: 76.. 1401 



GGGGTGGGAG 
GCCCTGCAAC 
CGGGAGGCGC 
GAGAGCGACC 
TTCATGGTAT 
AACGCCGAGA 
ATCCCGTTCA 
TACAAGTACC 
CAGAGCCCAG 
GGTGCCAAGA 
GCGGGCGCCA 
GACGACTACG 
ACGGTCAAGT 
CAGCTGCAGA 
CTGCAGCCGC 
CCCGCCAGCC 
GAGGTGCGGG 
AAGAACATCA 
TCGCGCTCGG 
GAGGACGCCG 
GCCAGCGAGC 
GATAAGGATT 
TACTGCACGC 
GACCTGGTGT 
AGCTGGGTTC 
ATGATGGTGG 
ATATTGATAA 
TTAAAGTGAA 
TCCTTTATCG 
AAAATGTGTT 
GAGGGGGCGG 
GTCGGTCTTT 
TCTAGGGAGT 
TTTTTAACAA 



11 

I 

GGGGAGGGGG 
GGATCATGGT 
TGGACACGGA 
CAGACTGGTG 
GGTCCAAGAT 
TCTCCAAGAG 
TCCGGGAGGC 
GGCCCCGGAA 
AGAAGAGCGC 
CCTCCAAGGG 
AGGCGGGCGC 
TGCTGGGCAG 
GCGTGTTTCT 
TCAAACAGGA 
CGGGGCAGCA 
CTACGCTGAG 
CCGGCGCGAC 
CCAAGCAGCA 
TGTCCACCTC 
ACGACCTGAT 
AGCAGCTGGG 
TGGATTCGTT 
CGGAGCTGAG 
TCACATATTG 
CTTGGGAGGA 
TGTTGATGGT 
GATGTCGTGA 
ATGAGTAGTT 
TGTCTCAAGG 
TTTGTAATTA 
CGCGGCGGAG 
GAAGTCTGGA 
TGGTGGAGAT 
AAAAAGGG 



21 
I 

ACCTCCGCAC 
GCAGCAGGCG 
GGAGGGCGAA 
CAAGACGGCG 
CGAACGCAGG 
GCTGGGCAAG 
GGAGCGGCTG 
AAAGCCCAAA 
GGCCGGCGGC 
CTCCAGCAAG 
GGGCAAGGCG 
CCTGCGCGTG 
GGATGAGGAC 
GCCGGACGAG 
GCCGTCGCAG 
CAGCTCGGCG 
CTCGGGCGCC 
CCCGCCGCCG 
CTCGTCCAGC 
GTTCGACCTG 
GGGCGGCGCG 
CAGCGAGGGC 
CGAGATGATC 
AAAGGCGCCC 
AGTTGTAGTG 
GGCGGTGGTA 
CGCAAAGAAA 
TTTAAACATT 
TAGTTGCATA 
CTATTTCTTT 
GGGAGGTAGG 
AGACGTCTGC 
ATTTTTTTTT 



31 

I 

GAGACCCAGC 
GAGAGCTTGG 
TTCATGGCTT 
TCGGGCCACA 
AAGATCATGG 
CGCTGGAAAA 
CGGCTCAAGC 
ATGGACCCCT 
GGCGGCGGGA 
AAATGCGGCA 
GCCCAGTCCG 
AGCX3GCTCGG 
GACGACGACG 
GAGGACGAGG 
CTGCTGAGAC 
GAGTCCCCCG 
GGGGGCGGCA 
CTCGCGCAGC 
AGCAGCGGCA 
AGCTTGAATT 
GCGGCCGGGA 
AGCCTGGGCT 
GCGGGGGACT 
GCTGCTCGCT 
GTGATGATGA 
GGGTGGAGGG 
TTGGAAAACA 
TTTCCTGTCC 
CCTAGTCTGG 
TTCCTGAAAT 
ACCCGCTCCG 
AGAGGACCCT 
CTTAAGAGAA 



41 

I 

GGCCCGGGTT 
AAGCGGAGAG 
GCAGCCCGGT 
TCAAGCGGCC 
AGCAGTCTCC 
TGCTGAAGGA 
ACATGGCCGA 
CGGCCAAGCC 
GCGCGGGCGG 
AGCTCAAGGC 
GGGACTACGG 
GCGGCGGCGG 
ACGACGACGA 
AACCACCGCA 
GCTACAACGT 
AGGGAGCGAG 
GCCGCCTCTA 
CCGCGCTGTC 
GCAGCAGCGG 
TCTCTCAAAG 
ACCTGTCCCT 
CCCACTTCGA 
GGCTGGAGGC 
CTTTCTCTCG 
TGATGATGAT 
GAGAGAAGAA 
TGATGAAAAT 
TTTTTTTGTC 
AGTTGTGATT 
TCGTGATTGC 
GAAGGCGCTG 
TTTGGCAGCA 
CTTAAAGAAC 



51 
I 

GGAGCGTCCA 
CAACCTGCCC 
GGCCCTGGAC 
GATGAACGCG 
GGACATGCAC 
CAGCGAGAAG 
CTACCCCGAC 
CAGCGCCAGC 
AGGCGCGGGC 
CCCCGCGGCC 
GGGCGCGGGC 
CGCGGGCAAG 
CGACGAGCTG 
CCAGCAGCTC 
CGCCAAAGTG 
CCTCTACGAC 
CTACAGCTTC 
GCCCGCGTCC 
CAGCAGCGGC 
CGCGCACAGC 
GTCGCTGGTG 
GTTCCCCGAC 
GAACTTCTCC 
GAGGGTGCAG 
AATGATGATG 
GATGCTGATG 
TTTGGTGGAG 
CCCCCTCCCT 
ATTTTCCCAA 
AACAAAGGCA 
TTTGAAGCTT 
CAACTGTTAC 
TGGTGATTTT 



Seq ID NO: 657 Protein sequence 
Protein Accession ft: NP_003099.1 

21 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1S00 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



1 11 21 31 41 51 

I I I I I I 

MVQQAESLEA ESNLPREALD TEEGEFMACS PVALDESDPD WCKTASGHIK RPMNAFMVWS 
KIERRKIMEQ SPDMHNAEIS KRLGKRWKML KDSEKIPFIR EAERLRLKHM ADYPDYKYRP 
RKKPKMDPSA KPSASQSPEK SAAGGGGGSA GGGAGGAKTS KGSSKKCGKL KAPAAAGAKA 
GAGKAAQSGD YGGAGDDYVL GSLRVSGSGG GGAGKTVKCV FLDEDDDDDD DDDELQLQIK 



60 
120 
180 
240 



437 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

QEPDEEDEBP PHQQIiLQPPG QQPSQLIoRRY NVAKVPASPT LSSSAESPEG ASLYDEVRAG 
ATSGAGGGSR LYYSFKNITK QHPPPLAQPA LSPASSRSVS TSSSSSSGSS SGSSGEDADD 
LMFDLSLNFS QSAHSASEQQ LGGGAAAGNL SLSLVDKDLD SFSEGSLGSH FEFPDYCTPE 
LSEMIAGDWL EANFSDLVFT Y 

Seq ID NO: 658 DNA sequence 
Nucleic Acid Accession #: NM_001719 
Coding sequence: 123.. 1418 



PCT/US02/12476 



300 
360 
420 



GGGCGCAGCG 
CTGCCACCTG 
CGATGCACGT 
CCCTGTTCCT 
GCTTCATCCA 
CCATTTTGGG 
CCATGTTCAT 
GCCAGGGCTT 
GCCTGCAAGA 
TGGAACATGA 
TTTCCAAGAT 
ACATCCGGGA 
AGCACTTGGG 
AGGAGGGCTG 
GGCACAACCT 
AGTTGGCGGG 
TCTTCAAGGC 
GCCAGAACCG 
AGAACAGCAG 
GAGACCTGGG 
AGGGGGAGTG 
AGACGCTGGT 
AGCTCAATGC 
ACAGAAACAT 
TTGGGGCCAA 
CTGCCTTTTG 
AAACATGAGC 
TCCTACAAGC 
GCCGGGCCAG 
TTATGAGCGC 
GGGCACATTG 
CAATAAAACG 



11 
I 

GGGCCCGTCT 
GGGCGGTGCG 
GCGCTCACTG 
GCTGCGCTCC 
CCGGCGCCTC 
CTTGCCCCAC 
GCTGGACCTG 
CTCCTACCCC 
TAGCCATTTC 
CAAGGAATTC 
CCCAGAAGGG 
ACGCTTCGAC 
CAGGGAATCG 
GCTGGTGTTT 
GGGCCTGCAG 
CCTGATTGGG 
CACGGAGGTC 
CTCCAAGACG 
CAGCGACCAG 
CTGGCAGGAC 
TGCCTTCCCT 
CCACTTCATC 
CATCTCCGTC 
GGTGGTCCGG 
GTTTTTCTGG 
TGAGACCTTC 
AGCATATGGC 
TGTGCAGGCA 
GTCATTGGCT 
CTACCAGCCA 
GTGTCTGTGC 
AATGAATG 



21 
I 

GCAGCAAGTG 
GGCCCGGAGC 
CGAGCTGCGG 
GCCCTGGCCG 
CGCAGCCAGG 
CGCCCGCGCC 
TACAAOGCCA 
TACAAGGCCG 
CTCACCGACG 
TTCCACCCAC 
GAAGCTGTCA 
AATGAGACGT 
GATCTCTTCC 
GACATCACAG 
CTCTCGGTGG 
CGGCACGGGC 
CACTTCCGCA 
CCCAAGAACC 
AGGCAGGCCT 
TGGATCATCG 
CTGAACTCCT 
AACCCGGAAA 
CTCTACTTCG 
GCCTGTGGCT 
ATCCTCCATT 
CCCTCCCTAT 
TTTTGATCAG 
AAACCTAGCA 
GGGAAGTCTC 
GGCCACCCAG 
GAAAGGAAAA 



31 

! 

ACCGACGGCC 
CCGGAGCCCG 
CGCCGCACAG 
ACTTCAGCCT 
AGCGGCGGGA 
CGCACCTCCA 
7GGCGGTGGA 
TCTTCAGTAC 
CCGACATGGT 
GCTACCACCA 
CGGCAGCCGA 
TCCGGATCAG 
TGCTCGACAG 
CCACCAGCAA 
AGACGCTGGA 
CCCAGAACAA 
GCATCCGGTC 
AGGAAGCCCT 
GTAAGAAGCA 
CGCCTGAAGG 
ACATGAACGC 
CGGTGCCCAA 
ATGACAGCTC 
GCCACTAGCT 
GCTCGCCTTG 
CCGCAACTTT 
TTTTTCAGTG 
GGAAAAAAAA 
AGCCATGCAC 
CCGTGGGAGG 
TTGACCCGGA 



41 
I 

GGGAOGGCCG 
GGTAGCGCGT 
CTTCGTGGCG 
GGACAACGAG 
GATGCAGCGC 
GGGCAAGCAC 
GGAGGGCGGC 
CCAGGGCCCC 
CATGAGCTTC 
TCGAGAGTTC 
ATTCCGGATC 
CGTTTATCAG 
CCGTACCCTC 
CCACTGGGTG 
TGGGCAGAGC 
GCAGCCCTTC 
CACGGGGAGC 
GCGGATGGCC 
CGAGCTGTAT 
CTACGCCGCC 
CACCAACCAC 
GCCCTGCTGT 
CAACGTCATC 
CCTCCGAGAA 
GCCAGGAACC 
AAAGGTGTGA 
GCAGCATCCA 
ACAACGCATA 
GGACTCGTTT 
AAGGGGGCGT 
AGTTCCTGTA 



51 
I 

CCTGCCCCCT 
AGAGCCGGCG 
CTCTGGGCAC 
GTGCACTCGA 
GAGATCCTCT 
AACTCGGCAC 
GGGCCCGGCG 
CCTCTGGCCA 
GTCAACCTCG 
CGGTTTGATC 
TACAAGGACT 
GTGCTCCAGG 
TGGGCCTCGG 
GTCAATCCGC 
ATCAACCCCA 
ATGGTGGCTT 
AAACAGCGCA 
AACGTGGCAG 
GTCAGCTTCC 
TACTACTGTG 
GCCATCGTGC 
GCGCCCACGC 
CTGAAGAAAT 
TTCAGACCCT 
AGCAGACCAA 
GAGTATTAGG 
ATGAACAAGA 
AAGAAAAATG 
CCAGAGGTAA. 
GGCAAGGGGT 
ATAAATGTCA 



Seq ID NO: 659 Protein sequence 
Protein Accession #: NP_001710 



MHVRSLRAAA 
ILGLPHRPRP 
LQDSHFLTDA 
IRERFDNETF 
HNLGLQLSVE 
QNRSKTPKNQ 
GECAFPLNSY 
RNMWRACGC 



11 
I 

PHSFVALWAP 
HLQGKHNSAP 
DMVMSFVNLV 
RISVYQVLQE 
TLDGQSINPK 
EALRMANVAE 
MNATNHAIVQ 
H 



21 
I 

LFLLRSALAD 
MFMLDLYNAM 
EHDKEFFHPR 
HLGRESDLFL 
LAGLIGRHGP 
NSSSDQRQAC 
TLVHFINPET 



31 
I 

FSLDNEVHSS 
AVEEGGGPGG 
YHHREFRFDL 
LDSRTLWASE 
QNKQPFMVAF 
KKHELYVSFR 
VPKPCCAPTQ 



41 
I 

FIHRRLRSQE 
QGFSYPYKAV 
SKIPEGEAVT 
EGWLVFDITA 
FKATEVHFRS 
DLGWQDWIIA 
LNAISVLYFD 



51 
i 

RREMQREILS 
FSTQGPPLAS 
AAEFRIYKDY 
TSNHWWNPR 
IRSTGSKQRS 
PEGYAAYYCE 
DSSNVILKKY 



Seq ID NO: 660 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 211..1B95 



GGATCTGAGG 
GGGGCTTGGG 
GAGGAATTAT 
TGATTTTTTT 
GTGCTTTTTC 
CACAGGTTCC 
CTTGTGCTGA 
GAAGGTAATT 
AAAATATCGG 
TTCCGACACT 
GCCAATTATT 
TTCTTTGAAC 
GCTGTGGCTA 
CACATGCACT 
GTAGTCCATG 
CAAAATTCCA 
GTTGTGATGT 
TACCTGCATA 
ATCTTGATAG 
ACTCTGGCTG 
GCACCGATCT 
CTAGCTACCA 
AAACTGGCCA 



11 

I 

GGCGCCCAGT 
AGGCAGCCTG 
CTGATAAAAT 
CCCTCGAAAA 
TTTTCTCTTC 
TTGAACAGCT 
AAGCGAAAGT 
GTTTCCCTGA 
CTGTTCCATG 
GTAACCCCAA 
CAGACTGCCT 
GCCTCTATGT 
TTCTCATCAT 
TATTTGTGTC 
CTCACATAGG 
TTGAGGCAAC 
TTATTTACTT 
ATCTCATCTT 
GCTGGGGGTT 
ATGCGAGGTG 
TAGCAGCTAT 
AAATCTGGGA 
AATCGACACT 



21 
I 

CACTTCCTCC 
CTCTCCAGTC 
TCCTGGGTTA 
TGACCTTTTT 
TTTTTCTACG 
GGATTCTGAT 
ACAATGTGAA 
ATGGGATGGA 
CCCTCCTTAT 
TGGAACATGG 
TCGCTTTCTG 
AATGTATACC 
TGGTTACTTC 
TTTCATGCTG 
AGTAAAGGAG 
TTCTGTGGAC 
CCTGGCTACA 
TGTGGCTTTC 
TCCAGCAGCA 
CTGGGAACTT 
TGGGCTGAAT 
GACCAATGCA 
GGTCCTGGTC 



31 
I 

ACGTTCTCGT 
CCTATCCACC 
ATATTTTTAA 
ATGCTTCGAA 
ATAAATGAAA 
GGCACCATTA 
CTCAACATCA 
CTCATTTGTT 
ATTTATGACT 
GATTTTATGC 
CAGCCAGATA 
GTTGGCTACT 
AGACGATTGC 
AGAGCTACAA 
CTGGAGTCCC 
AAATCACAAT 
AATTATTATT 
TTTTCGGACA 
TTTGTTGCAG 
AGTGCTGGAG 
TTTATTCTGT 
GTTGGGCATG 
CTAGTCTTTG 



41 
I 

GCTGGGCGGG 
CACAGGTTTT 
AAACGGAGAG 
GCAGTTTGTC 
GCATTTCTTC 
CTATAGAGGA 
CAGCTCAACT 
GGCCCAGAGG 
TCAACCATAA 
ACAGCTTAAA 
TCAGCATAGG 
CCATCTCTTT 
ATTGCACTAG 
GCATCTTTGT 
TAATAATGCA 
ATATCGGGTG 
GGATCCTGGT 
CCAAATACCT 
CATGGGCTGT 
ACATCAAGTG 
TTCTGAATAC 
ACACAAGGAA 
GAGTGCATTA 



51 
I 

AGGAGCGGAT 
TTGGGTCGGA 
TTTTTAAAAA 
AACCAGCATA 
AAGAAAAAGG 
GCAGATTGTC 
CCAGGAGGGA 
AACAGTGGGG 
AGGAGTTGCT 
TAAAACATGG 
AAAGCAAGAA 



GAACTATATC 
CAAAGACAGA 
GGATGACCCA 
CAAGATTGCT 
GGAAGGTCTC 
GTGGGGCTTC 
GGCACGAGCA 
GATTTATCAA 
GGTTAGAGTT 
GCAATACAGG 
CATCGTGTTC 



60 
120 
180 
240 
300 
360 
420 
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540 
600 
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780 
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900 
960 
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60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



438 



WO 02/086443 

GTATGCCTGC CTCACTCCTT CACTGGGCTC GGGTGGGAGA TCCGCATGCA CTGTGAGCTC 1440 

TTCTTCAACT CCTTTCAGGG TTTCTTTGTG TCTATCATCT ACTGCTACTG CAATGGAGAG 1500 

GTTCAGGCAG AGGTGAAGAA GATGTGGAGT CGGTGGAATC TCTCCGTGGA CTGGAAAAGG 1560 

ACACCGCCAT GTGGCAGCCG CAGATGCGGC TCAGTGCTCA CCACCGTGAC GCACAGCACC 1620 

AGCAGCCAGT CACAGGTGGC GGCCAGCACA CGCATGGTGC TTATCTCTGG CAAAGCTGCC 1680 

AAGATCGCCA GCAGACAGCC TGACAGCCAC ATCACTTTAC CTGGCTATGT CTGGAGTAAC 1740 

TCAGAGCAGG ACTGCCTGCC ACACTCTTTC CACGAGGAGA CCAAGGAAGA TAGTGGGAGG 1800 

CAGGGAGATG ATATTCTAAT GGAGAAGCCT TCCAGGCCTA TGGAATCTAA CCCAGACACT 1860 
GAAGGATGCC AAGGAGAAAC TGAGGATGTT CTCTGA 

Seq ID NO: 661 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MLRSSLSTSI VLFhFSSPST INESISSRKR HRFLEQLDSD GTITIEEQIV LVLKAKVQCE 60 

LNITAQLQEG EGNCFPEWDG LICWPRGTVG KISAVPCPPY IYDFNHKGVA FRHCNPNGTW 120 

DPMHSLNKTW ANYSDCLRFL QPDISIGKQE FPERLYVMYT VGYSISFGSL AVAILI IGYF 180 

RRLHCTRNYI HMHLPVSFML RATSIFVKDR WHAHIGVKE LESLIMQDDP QNSIEATSVD 240 

KSQYIGCKIA WMPIYFLAT NYYWILVEGL YLHNLIFVAF ' FSDTKYLWGF ILIGWGFPAA 300 

FVAAWAVARA TLADARCWEL SAGDIKWIYQ APILAAIGLN FILFLNTVRV UATKIWETNA 360 

VGHDTRKQYR KLAKSTLVLV LVFGVHYIVF VCLPHSFTGL GWEIRMHCEL FFNSFQGFFV 420 

SIIYCYCNGE VQAEVKKMWS RWNLSVDWKR TPPCGSRRCG SVLTTVTHST SSQSQVAAST 480 

RMVLISGKAA KIASRQPDSH ITLPGYVWSN SEQDCLPHSF HEETKEDSGR OGDDILMEKP 540 
SRPMESNPDT EGCQGETEDV L 

Seq ID NO; 662 DNA sequence 

Nucleic Acid Accession 8: NM_005048 . 

Coding sequence: 143.. 17 95 

1 11 21 31 41 51 

I I I I I I 

GGCCGGTGGC CCGGGCCCGA CCACCCCAGC TGCGCGTCGT TACTGGCCAC AAGTTTGCTC 60 

TGGGCCAGCC AAGTTGGCAA CTTGGAAGCT TCTCCCGGGC TCTGGAGGAG GGTCCCTGCT 120 

TCTTCCTACA GCCGTTCCGG GCATGGCCGG GCTGGGGGCG TCGCTCCACG TCTGGGGTTG 180 

- GCTAATGCTC GGCAGCTGCC TCCTGGCCAG AGCCCAGCTG GATTCTGATG GCACCATTAC 240 

TATAGAGGAG CAGATTGTCC TTGTGCTGAA ,AGCGAAAGTA CAATGTGAAC TCAACATCAC 300 

AGCTCAACTC CAGGAGGGAG AAGGTAATTG TTTCCCTGAA TGGGATGGAC TCATTTGTTG 360 

GCCCAGAGGA ACAGTGGGGA AAATATCGGC TGTTCCATGC CCTCCTTATA TTTATGACTT 420 

CAACCATAAA GGAGTTGCTT TCCGACACTG TAACCCCAAT GGAACATGGG ATTTTATGCA 480 

CAGCTTAAAT AAAACATGGG CCAATTATTC AGACTGCCTT CGCTTTCTGC AGCCAGATAT 540 

CAGCATAGGA AAGCAAGAAT TCTTTGAACG CCTCTATGTA ATGTATACCG TTGGCTACTC 600 

CATCTCTTTT GGTTCCTTGG CTGTGGCTAT TCTCATCATT GGTTACTTCA GACGATTGCA 660 

TTGCACTAGG AACTATATCC ACATGCACTT ATTTGTGTCT TTCATGCTGA GAGCTACAAG 720 

CATCTTTGTC AAAGACAGAG TAGTCCATGC TCACATAGGA GTAAAGGAGC TGGAGTCCCT 780 

AATAATGCAG GATGACCCAC AAAATTCCAT TGAGGCAACT TCTGTGGACA AATCACAATA B40 

TATCGGGTGC AAGATTGCTG TTGTGATGTT TATTTACTTC CTGGCTACAA ATTATTATTG 900 

GATCCTGGTG GAAGGTCTCT ACCTGCATAA TCTCATCTTT GTGGCTTTCT TTTCGGACAC 960 

CAAATACCTG TGGGGCTTCA TCTTGATAGG CTGGGGGTTT CCAGCAGCAT TTGTTGCAGC 1020 

ATGGGCTGTG GCACGAGCAA CTCTGGCTGA TGCGAGGTGC TGGGAACTTA GTGCTGGAGA 1080 

CATCAAGTGG ATTTATCAAG CACCGATCTT AGCAGCTATT GGGCTGAATT TTATTCTGTT 1140 

TCTGAATACG GTTAGAGTTC TAGCTACCAA AATCTGGGAG ACCAATGCAG TTGGGCATGA 1200 

CACAAGGAAG CAATACAGGA AACTGGCCAA ATCGACACTG GTCCTGGTCC TAGTCTTTGG 1260 

AGTGCATTAC ATCGTGTTCG TATGCCTGCC TCACTCCTTC ACTGGGCTCG GGTGGGAGAT 1320 

CCGCATGCAC TGTGAGCTCT TCTTCAACTC CTTTCAGGGT TTCTTTGTGT CTATCATCTA 1380 

CTGCTACTGC AATGGAGAGG TTCAGGCAGA GGTGAAGAAG ATGTGGAGTC GGTGGAATCT 1440 

CTCCGTGGAC TGGAAAAGGA CACCGCCATG TGGCAGCCGC AGATGCGGCT CAGTGCTCAC 1500 

CACCGTGACG CACAGCACCA GCAGCCAGTC ACAGGTGGCG GCCAGCACAC GCATGGTGCT 1560 

TATCTCTGGC AAAGCTGCCA AGATCGCCAG CAGACAGCCT GACAGCCACA TCACTTTACC 1620 

TGGCTATGTC TGGAGTAACT CAGAGCAGGA CTGCCTGCCA CACTCTTTCC ACGAGGAGAC 1680 

CAAGGAAGAT AGTGGGAGGC AGGGAGATGA TATTCTAATG GAGAAGCCTT CCAGGCCTAT 1740 

GGAATCTAAC CCAGACACTG AAGGATGCCA AGGAGAAACT GAGGATGTTC TCTGAATGGA 1800 

CATTTGTGGC TGACTTTCAT GGGCTGGTCC AATGGCTGGT TGTGTGAGAG GGCTTGGCTG 1860 

ATACTCCTAT GCTTGAGTTC AAAGGCTGAA AATTCAGTTA AGGTGTTACT TAATAATAGT 1920 

TTTTAGGCTC CATGAATTGG CTCCTGTAAA TACTAACGAC ATGAAAATGC AAGTGTCAAT 1980 

GGAGTAGTTT ATTACCTTCT ATTGGCATCA AGTTTTCCTC TAAATTAATG TATGGTATTT 2040 

GCTCTGTGAT TGTTCATTTT TTTCTGCTAC TTTTGGGTAG AAAAAAGATT CAATTGCTTG 2100 

GCTGTAGCTT TCTCTCATAT ATATCACCCT AAATATAATG AAGATCTTTT AGTGTGTATC 2160 

ATTTTCCTTT TAGAAACTAG TATTCTCTTA TTTCTTACTT TAATGTACTT CTATCACTGC 2220 

ATTTATTTTG CCTGTGCATA GGAGCAATTA GGATCTAAAA AAATATATGG GAAGATAAAA 2280 

GATCTAAGAA CAAGTACTTG CTGGAAAATT AGTTGGCTGG ACATTGATAA AATAATGCAT 2340 

TTATAACAAT TACATGTGTT TTTGGGAACA AGGAAAATTT CTCAAAAAAG AATATTTCAC 2400 

ACATCCCTTC TTTTGAATGG CCTCTTTGTG ACCAGCCAGA CCTCAGGTCT TCACTCTTTC 2460 

TTCTTTGTAA ACCATGTCAT GTGGAAAGAT TTCCTCAGTT AGTGAGCTTG TGTCTGCAAA 2520 

TTGATTTTGT TTGTAATGTA TTTTGATAGC AAATCATGCT GCATCTATAT CTTTTTCTTG 2580 

TTTGAGCTGT TACTACATTG TACATGGCAT GTGGGATCAA TTAAAAATTT GTTTTAAAAA 2640 
T 

Seq ID NO: 663 Protein sequence 
Protein Accession #: NP_00S039 

1 11 21 31 41 51 

I I I I I I 

MAGLGASLHV WGWLMLGSCL LARAQLDSDG TITIEEQIVL VLKAKVQCEL NITAQLQEGB 60 

GNCFPEWDGL ICWPRGTVGK ISAVPCPPYI YDFNHKGVAF RHCNPNGTWD FMHSLNKTWA 120 

NYSDCLRFLQ PDISIGKQEF FERLYVMYTV GYSISFGSLA VAILIIGYFR RLHCTRNYIH 180 

MHLFVSFMLR ATSIFVKDRV VHAHIGVKEL ESLIMQDDPQ NSIEATSVDK SQYIGCKIAV 240 



439 
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WO 02/086443 

VMFIYFLATN YYWILVEGLY LHNLIFVAFF SDTKYLWGFI 
IiADARCWBLS AGDIKWIYQA PILAAIGLNP ILFLNTVRVL 
LAKSTLVLVL VFGVHYlVFV CLPHSFTGLG WEIRMHCELF 
QAEVKKMWSR WNLSVDWKRT PPCGSRRCGS VLTTVTHSTS 
IASRQPDSHI TLPGYVWSNS EQDCLPHSFH EETKEDSGRQ 
GCQGETEDVL 

Seq ID NO: 664 DNA sequence 
Nucleic Acid Accession ih NM_012152 
Coding sequence: 43.. 1104 



PCT/US02/12476 



LIGWGFPAAF VAAWAVARAT 
ATKIWBTNAV GHDTRKQYRK 
FNSFQGFFVS IIYCYCNGEV 
SQSQVAASTR MVLISGKAAK 
GDDILMEKPS RPMESNPDTE 



CTTCTTTAAA 
GACAAGCACA 
GGAACAAAGC 
TCTAATTCTC 
TACCTGTTGG 
ATGTTTAACA 
GGGCTTCTGG 
AGGCACATGT 
CTGCTCATTT 
TGGAATTGCC 
TACCTTGTTT 
CTGCGGATCT 
TCCATCAGCC 
GCGTTTGTGG 
AGGCAGTGTG 
GTCGTGAACC 
ATGATCTGCT 
GTCCTCAGCA 
GTCTGCAATA 
GTCTTAGG 



11 
i 

TTTCTTTCTA 
TGGACTTTTT 
TTGTGATTGT 
TGGTCATCGC 
CTAATTTAGC 
CAGGCCCAGT 
ACAGTAGCTT 
CAATCATGAG 
TGCTTGTCTG 
TCTGCAACAT 
TCTGGACAGT 
ACGTGTACGT 
GCCGGAGGAC 
TATGCTGGAC 
GCGTGCAGCA 
CCATCATCTA 
GCTTCTCTCA 
GGAGTGACAC 
AAAGCACTTC 



21 
I 

GGATGTTCAC 
TTATAATAGG 
TTTGTGTGTT 
GGCAGTGATC 
TGCTGCCGAT 
TTCAAAAACT 
GACTGCTTCC 
GATGCGGGTC 
GGCCATCGCC 
CTCTGCCTGC 
GTCCAACCTC 
CAAGAGGAAA 
ACCCATGAAG 
CCCGGGCCTG 
TGTGAAAAGG 
CTCCTACAAG 
GGAGAACCCA 
AGGCAGCCAG 
CTAAACTCTG 



31 
I 

TTCTTCTCCA 
AGCAACACTG 
GGGACGTTTT 
AAAAACAGAA 
TTCTTCGCTG 
TTGACTGTCA 
CTCACCAACT 
CATAGCAACC 
ATTTTTATGG 

ATGGCCTTCC 
ACCAACGTCT 
CTAATGAAGA 
GTGGTTCTGC 
TGGTTCCTGC 
GACGAGGACA 
GAGAGGCGTC 
TACATAGAGG 
GATGCCTCTC 



41 

I 

CAATGAATGA 
ATACTGTOGA 
TCTGCCTGTT 
AATTTCATTT 
GAATTGCCTA 
ACCGCTGGTT 
TGCTGGTTAT 
TGACCAAAAA 
GGGCGGTCCC 
CCCCCATTTA 
TCATCATGGT 
TGTCTCCGCA 
CGGTGATGAC 
TCCTCGACGG 
TGCTGGCGCT 
TGTATGGCAC 
CCTCTCGCAT 
ATAGTATTAG 
GGCCCACCCA 



51 
I 

GTGTCACTAT 
TGACTGGACA 
TATTTTTTTT 
CCCCTTCTAC 
TGTATTCCTG 
TCTCCGTCAG 
CGCCGTGGAG 
GAGGGTGACA 
CACACTGGGC 
CAGCAGGAGT 
TGTGGTGTAC 
TACAAGTGGG 
TGTCTTAGGG 
CCTGAACTGC 
GCTCAACTCC 
CATGAAGAAG 
CCCCTCCACA 
CCAAGGTGCA 
GGTGATGACT 



Seq ID NO: 665 Protein sequence 
Protein Accession ft: NP_036284 



l 
I 

MNECHYDKHM 
FHFPFYYLLA 
LVIAVERHMS 
PIYSRSYLVF 
VMTVLGAFW 
YGTMKKMICC 



11 

I ■ 

DFFYNRSNTD 
NLAAADFFAG 
IMRMRVHSNL 
WTVSNLMAFL 
CWTPGLWLL 
FSQENPERRP 



21 
I 

TVDDWTGTKL 
IAYVFLMFNT 
TKKRVTLLIL 
IMVWYLRIY 
LDGLNCRQCG 
SRIPSTVLSR 



31 
I 

VIVLCVGTFF 
GPVSKTLTVN 
LVWAIAIFMG 
VYVKRKTNVL 
VQHVKRWFLL 
SDTGSQYIED 



41 

I 

CLFIFFSNSL 
RWFLRQGLLD 
AVPTLGWNCL 
SPKTSGSISR 
LALLNSWNP 
SISQGAVCNK 



51 
I 

VIAAVIKNRK 
SSLTASLTNL 
CNISACSSLA 
RRTPMKLMKT 
IIYSYKDEDM 
STS 



Seq ID NO: 666 DNA sequence 
Nucleic Acid Accession &: NM_002821 
Coding sequence: 150.. 3362 



I 

AACTCCCGCC 
GCGCTCCGGT 
CCTCAGCTCC 
CCCGCCGGTT 
CCATTGTCTT 
TTCGCTGTGA 
CTGTCCAGGA 
ACCGGCTGCA 
AAGCCCGCAG 
TGAAGCATCC 
ACATTGATGG 
ATGGTCAGAG 
GTCCTGAGCA 
GCAGCCAGAA 
CCCAGGACGT 
AGCCACCCCC 
GCCCCCCACA 
TCCGGCCACG 
TCATCCTGGA 
GGGTGTTTAC 
AGCCCAGCGT 
AGAAGGGCCA 
GCCACGCGGC 
TGCCCTCCTG 
TGGATTGCCT 
TCATCTCAGA 
TGGAGGTGTA 
AGGCGCAAGC 
AGCAGTGCAT 
AGCCCACTAT 
ACGCTGGGAC 
TTGCCTCCAA 
TTATCACCTT 
TGCAGTGCGA 
TCCTGGACCC 



11 
I 

TCGGGACGCC 
GCGTCCGCCT 
TTTTCCTGAG 
GCCTCTGCTC 
CATCAAGCAG 
GGTTGAGGCT 
CACGGAGCGG 
GGACTCTGGC 
TGCCAACGCC 
AGCCTCGGAA 
GCACCCTCGG 
CAACCACACA 
TAGTGGGCTG 
CTTCACCTTG 
GGTAGTAGCG 
GAGCCTGCAG 
CCTCCGCAGA 
CAATGCAGGG 
AGCCACACTT 
AGCTGGCAGC 
GTGGTGGGAG 
CGAGCTGGTG 
CAACCTGGCT 
GCTGAAGAAG 
GACCCAGGCC 
GGACTCACGG 
TGATGGGACA 
CCGTGTCCAA 
GGAGTTTGAC 
TAAGTGGGAA 
CCTGCATTTT 
CGGGCCGCAG 
CAAAGTGGAA 
GGCCCAGGGG 
CACCAAGCTG 



21 
I 

TCGGGGTCGG 
CCTGTGCCCG 
CCCGCCGCGA 
AGCGTCCTGC 
CCGTCCTCCC 
CCGGGCCCGG 
CGTTTCGCCC 
ACCTTCCAGT 
TCCTTCAACA 
GCTGAGATCC 
CCCACCTACC 
GTCAGCAGCA 
TATTCCTGCT 
AGCATTGCTG 
AGGTATGAGG 
TGGCTCTTTG 
GCCACAGTGT 
ATCTACCGCT 
CACCTAGCAG 
GAGGAGCGTG 
CACGCGGGAG 
TTGGCCAATA 
GGTCAGCGGA 
CCCCAAGACA 
ACACCAAAAC 
TTCGAGGTCT 
TGGTACCGTT 
GTGCTGGAAA 
AAGGAGGCCA 
CGGGCAGATG 
GCCCGGGTGA 
GGCCAGATTC 
CCAGAGCGTA 
GACCCCAAGC 
GGACCCAGGA 



31 
I 

GCTCCGGCTG 
CCGCGGAGCA 
TGGGAGCTGC 
TGCTGCCGCT 
AGGATGCACT 
TACATGTGTA 
AGGGCAGCAG 
GTGTGGCTCG 
TCAAATGGAT 
AGCCACAGAC 
AATGGTTCCG 
AGGAGCGGAA 
GCGCCCACAG 
ATGAAAGCTT 
AGGCCATGTT 
AGGATGAGAC 
TTGCCAACGG 
GCATTGGCCA 
AGATTGAAGA 
TGACCTGCCT 
TCCGGCTGCC 
TTGCTGAAAG 
GACAGGATGT 
GCCAGCTGGA 
CTACAGTTGT 
TCAAGAATGG 
GTATGAGCAG 
AGCTCAAGTT 
CGGTGCCCTG 
GGAGCAGCCT 
CTCGAGATGA 
GTGCCCATGT 
CGACTGTGTA 
CGCTGATTCA 
TGCACATCTT 



41 
I 

CGGCTGCTGC 
GTCTGCGGCC 
GCGGGGATCC 
GCTGGGCGGT 
GCAGGGGCGC 
CTGGCTGCTC 
CCTGAGCTTT 
GGATGATGTC 
TGAGGCAGGT 
CCAGGTCACA 
AGATGGGACC 
CCTGACGCTC 
TGCTTTTGGC 
TGCCAGGGTG 
CCATTGCCAG 
TCCCATCACT 
GTCTCTGCTG 
GGGGCAGAGG 
CATGCCGCTA 
TCCCCCCAAG 
CACCCATGGC 
TGATGCTGGT 
CAACATCACT 
GGAGGGCAAA 
CTGGTACAGA 
GACCTTGCGC 
CACCCCAGCC 
CACACCACCA 
TTCAGCCACA 
CCCAGAGTGG 
CGCTGGCAAC 
CCAGCTCACT 
CCAGGGCCAC 
GTGGAAAGGC 
CCAGAATGGC 



51 
I 

TGCGGCGCCC 
CGCCGTGCGC 
CCGGCCAGAC 
ACCCAGACAG 
CGGGCGCTGC 
GATGGGGCCC 
GCAGCTGTGG 
ACTGGAGAAG 
CCTGTGGTCC 
CTTCGTTGCC 
CCCCTTTCTG 
CGGCCAGCTG 
CAGGCTTGCA 
GTGCTGGCAC 
TTCTCAGCCC 
AACCGCAGTC 
CTGACCCAGG 
GGCCCACCCA 
TTTGAGCCAC 
GGTCTGCCAG 
AGGGTCTACC 
GTCTACACCT 
GTGGCCACTG 
CCCGGCTACT 
AACCAGATGC 
ATCAACAGCG 
GGCAGCATCG 
CCCCAGCCAC 
GGCCGAGAGA 
GTGACAGACA 
TACACTTGCA 
GTGGCAGTTT 
ACAGCCCTAC 
AAGGACCGCA 
TCCCTGGTGA 



300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
' 1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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TCCATGACGT GGCCCCTGAG GACTCAGGCC GCTACACCTG CATTGCAGGC AACAGCTGCA 2160 

ACATCAAGCA CACGGAGGCC CCCCTCTATG TCGTGGACAA GCCTGTGCCG GAGGAGTCGG 2220 

AGGGCCCTGG CAGCCCTCCC CCCTACAAGA TGATCCAGAC CATTGGGTTG TCGGTGGGTG 2280 

CCGCTGTGGC CTACATCATT GCOGTGCTGG GCCTCATGTT CTACTGCAAG AAGCGCTGCA 2340 

AAGCCAAGCG GCTGCAGAAG CAGCCCGAGG GCGAGGAGCC AGAGATGGAA TGCCTCAACG 2400 

GAGGGCCTTT GCAGAACGGG CAGCCCTCAG CAGAGATCCA AGAAGAAGTG GCCTTGACCA 2460 

GCTTGGGCTC CGGCCCCGCG GCCACCAACA AACGCCACAG CACAAGTGAT AAGATGCACT 2520 

TCCCACGGTC TAGCCTGCAG CCCATCACCA CGCTGGGGAA GAGTGAGTTT GGGGAGGTGT 2580 

TCCTGGCAAA GGCTCAGGGC TTGGAGGAGG GAGTGGCAGA GACCCTGGTA CTTGTGAAGA 2640 

GCCTGCAGAC GAAGGATGAG CAGCAGCAGC TGGACTTCCG GAGGGAGTTG GAGATGTTTG 2700 

GGAAGCTGAA CCACGCCAAC GTGGTGCGGC TCCTGGGGCT GTGCCGGGAG GCTGAGCCCC 2760 

ACTACATGGT GCTGGAATAT GTGGATCTGG GAGACCTCAA GCAGTTCCTG AGGATTTCCA 2820 

AGAGCAAGGA TGAAAAATTG AAGTCACAGC COCTCAGCAC CAAGCAGAAG GTGGCCCTAT 2880 

GCACCCAGGT AGCCCTGGGC ATGGAGCACC TGTCCAACAA CCGCTTTGTG CATAAGGACT 2940 

TGGCTGCGCG TAACTGCCTG GTCAGTGCCC AGAGACAAGT GAAGGTGTCT GCCCTGGGCC 3000 

TCAGCAAGGA TGTGTACAAC AGTGAGTACT ACCACTTCCG CCAGGCCTGG GTGCCGCTGC 3060 

GCTGGATGTC CCCCGAGGCC ATCCTGGAGG GTGACTTCTC TACCAAGTCT GATGTCTGGG 3120 

CCTTCGGTGT GCTGATGTGG GAAGTGTTTA CACATGGAGA GATGCCCCAT GGTGGGCAGG 3180 

CAGATGATGA AGTACTGGCA GATTTGCAGG CTGGGAAGGC TAGACTTCCT CAGCCCGAGG 3240 

GCTGCCCTTC CAAACTCTAT CGGCTGATGC AGCGCTGCTG GGCCCTCAGC CCCAAGGACC 3300 

GGCCCTCCTT CAGTGAGATT GCCAGCGCCC TGGGAGACAG CACCGTGGAC AGCAAGCCGT 3360 

GAGGAGGGAG CCCGCTCAGG ATGGCCTGGG CAGGGGAGGA CATCTCTAGA GGGAAGCTCA 3420 

CAGCATGATG GGCAAGATCC CTGTCCTCCT GGGCCCTGAG GTGCCCTAGT GCAACAGGCA 3480 

TTGCTGAGGT CTGAGCAGGG CCTGGCCTTT CCTCCTCTTC CTCACCCTCA TCCTTTGGGA 3540 

GGCTGACTTG GACCCAAACT GGGCGACTAG GGCTTTGAGC TGGGCAGTTT CCCCTGCCAC 3600 

CTCTTCCTCT ATCAGGGACA GTGTGGGTGC CACAGGTAAC CCCAATTTCT GGCCTTCAAC 3660 

TTCTCCCCTT GACCGGGTCC AACTCTGCCA CTCATCTGCC AACTTTGCCT GGGGAGGGCT 3720 

AGGCTTGGGA TGAGCTGGGT TTGTGGGGAG TTCCTTAATA TTCTCAAGTT CTGGGCACAC 3780 

AGGGTTAATG AGTCTCTTGC CCACTGGTCC ACTTGGGGGT CTAGACCAGG ATTATAGAGG 3840 

ACACAGCAAG TGAGTCCTCC CCACTCTGGG CTTGTGCACA CTGACCCAGA CCCACGTCTT 3900 

CCCCACCCTT CTCTCCTTTC CTCATCCTAA GTGCCTGGCA GATGAAGGAG TTTTCAGGAG 3960 

CTTTTGACAC TATATAAACC GCCCTTTTTG TATGCACCAC GGGCGGCTTT TATATGTAAT 4020 

TGCAGCGTGG GGTGGGTGGG CATGGGAGGT AGGGGTGGGC CCTGGAGATG AGGAGGGTGG 4080 

GCCATCCTTA CCCCACACTT TTATTGTTGT CGTTTTTTGT TTGTTTTGTT TTTTTGTTTT 4140 
TGTTTTTGTT TTTACACTCG CTGCTCTCAA TAAATAAGCC TTTTTTA 

Seg ID NO: 667 Protein sequence 
Protein Accession ft: NP_002812 

1 11 21 31 41 51 

I I I I I I 

MGAARGSPAR PRRLPLLSVL LLPLLGGTQT AIVFIKQPSS QDALQGRRAL LRCEVEAPGP 60 

VHVYWLLDGA PVQDTERRFA QGSSLSFAAV DRLQDSGTFQ CVARDDVTGE EAR SAN AS FN 120 

IKWIEAGPW LKHPASEAEI QPQTQVTLRC HIDGHPRPTY QWFRDGTPLS DGQSNHTVSS 180 

KERNLTLRPA GPEHSGLYSC CAHSAFGQAC SSQNFTLSIA DESFARWLA PQDVWARYE 240 

EAMFHCQFSA QPPPSLQWLF EDETPITNRS RPPHLRRATV FANGSLLLTQ VRPRNAGIYR 300 

CIGQGQRGPP IILEATLHLA EIEDMPLFEP RVFTAGSEER VTCLPPKGLP EPSVWWEHAG 360 

VRLPTHGRVY QKGHELVLAN IAESDAGVYT CHAANLAGQR RQDVNITVAT VPSWLKKPQD 420 

SQLEEGKPGY LDCLTQATPK PTWWYRNQM LISEDSRFEV FKNGTLRINS VEVYDGTWYR 480 

CMSSTPAGSI EAQARVQVLE KLKFTPPPQP QQCMEFDKEA TVPCSATGRE KPTIKWERAD 540 

GSSLPEWVTD NAGTLHFARV TRDDAGNYTC IASNGPQGQI RAHVQLTVAV FITFKVEPER 600 

TTVYQGHTAL LQCEAQGDPK PLIQWKGKDR ILDPTKLGPR MHIFQNGSLV IHDVAPEDSG 660 

RYTCIAGNSC NIKHTEAPLY WDKPVPEES EGPGSPPPYK MIQTIGLSVG AAVAYIIAVL 720 

GLMFYCKKRC KAKRLQKQPE GEEPEMECLN GGPLQNGQPS AEIQEEVALT SLGSGPAATN 780 

KRHSTSDKMH FPRSSLQPIT TLGKSEFGEV FLAKAQGLEE GVAETLVLVK SLQTKDEQQQ 840 

LDFRREIiEMF GKLNHANWR LLGLCREAEP HYMVLEYVDL GDLKQFLRIS KSKDEKLKSQ 900 

PLSTKQKVAL CTQVALGMEH LSNNRFVHKD LAARNCLVSA QRQVKVSALG LSKDVYNSEY 960 

YHFRQAWVPL RWMSPEAILE GDFSTKSDVW AFGVLMWEVF THGEMPHGGQ ADDEVLADLQ 1020 
AGKARLPQPE GCPSKLYRLM QRCWALSPKD RPSFSEIASA LGDSTVDSKP 

Seq ID NO: 668 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1389 

1 11 21 31 41 51 

I I I I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGh GAGATTTAGA TGACAGAGAA 60 

ACCCTTGTTT CTGAACATGA GTATAAAGAG AAAACCTGTC AGTCTGCTGC TCTTTTTAAT 120 

GTTGTCAACT OGATTATAGG ATCTGGTATA ATAGGATTGC CTTATTCAAT GAAGCAAGCT 180 

GGGTTTCCTT TGGGAATATT GCTTTTATTC TGGGTTT CAT ATGTTACGGA CTTTTCCCTT 240 

GTTTTATTGA TAAAAGGAGG GGCCCTCTCT GGAACAGATA CCTACCAGTC TTTGGTCAAT 300 

AAAACTTTCG GCTTTCCAGG GTATCTGCTC CTCTCTGTTC TTCAGTTTTT GTATCCTTTT 360 

ATAGCAATGA TAAGTTACAA TATAATAGCT GGAGATACTT TGAGCAAAGT TTTTCAAAGA 420 

ATCCCAGGAG TTGATCCTGA AAACGTGTTT ATTGGTCGCC ACTTCATTAT TGGACTTTCC 480 

ACAGTTACCT TTACTCTGCC TTTATCCTTG TACCGAAATA TAGCAAAGCT TGGAAAGGTC 540 

TCCCTCATCT CTACAGGTTT AACAACTCTG ATTCTTGGAA TTGTAATGGC AAGGGCAATT 600 

TCACTGGGTC CACACATACC AAAAACAGAA GACGCTTGGG TATTTGCAAA GCCCAATGCC 660 

ATTCAAGCGG TCGGGGTTAT GTCTTTTGCA TTTATTTGCC ACCATAACTC CTTCTTAGTT 720 

TACAGTTCTC TAGAAGAACC CACAGTAGCT AAGTGGTCCC GCCTTATCCA TATGTCCATC 780 

GTGATTTCTG TATTTATCTG TATATTCTTT GCTACATGTG GATACTTGAC ATTTACTGGC 840 

TTCACCCAAG GGGACTTATT TGAAAATTAC TGCAGAAATG ATGACCTGGT AACATTTGGA 900 

AGATTTTGTT ATGGTGTCAC TGTCATTTTG ACATACCCTA TGGAATGCTT TGTGACAAGA 960 

GAGGTAATTG CCAATGTGTT TTTTGGTGGG AATCTTTCAT CGGTTTTCCA CATTGTTGTA 1020 

ACAGTGATGG TCATCACTGT AGCCACGCTT GTGTCATTGC TGATTGATTG CCTCGGGATA 1080 

GTTCTAGAAC TCAATGGTGT GCTCTGTGCA ACTCCCCTCA TTTTTATCAT TCCATCAGCC 1140 

TGTTATCTGA AACTGTCTGA AGAACCAAGG ACACACTCCG ATAAGATTAT GTCTTGTGTC 1200 

ATGCTTCCCA TTGGTGCTGT GGTGATGGTT TTTGGATTCG TCATGGCTAT TACAAATACT 1260 
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CAAGACTGCA CCCATGGGCA GGAAATGTTC TACTGCTTTC CTGACAATTT CTCTCTCACA 
AATACCTCAG AGTCTCATGT TCAGCAGACA ACACAACTTT CTACTTTAAA TATTAGTATC 
TTTCAATGA 
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Seq ID NO: 669 Protein sequence 
Protein Accession Eos sequence 



MGYQRQEPVI 
GFPLGILLLF 
IAMISYNIIA 
SLISTGLTTL 
YSSLEEPTVA 
RFCYGVTVII* 
VLELNGVLCA 
QDCTHGQEMF 



11 
I 

PPQRDWDRB 
WVSYVTDFSL 
GDTLSKVFQR 
ILGIVMARAI 
KWSRLIHMSI 
TYPMECFVTR 
TPLIFIIPSA 
YCFPDNFSLT 



21 
I 

TLVSEHEYKE 
VLLIKGGALS 
IPGVDPENVF 
SLGPHIPKTE 
VISVFICIFP 
EVIANVFFGG 
CYLKLSEEPR 
NTSESHVQQT 



31 
I 

KTCQSAALFN 
GTDTYQSLVN 
IGRHFIIGLS 
DAWVFAKPNA 
ATCGYLTFTG 
NLSSVPHIW 
THSDKIMSCV 
TQLSTLNISI 



41 
I 

WNSIIGSGI 
KTFGFPGYUj 
TVTFTLPLSIj 
IQAVGVMSFA 
FTQGDIiFENY 
TVMVITVATCj 
MLPIGAWMV 
FQ 



51 
I 

IGLPYSMKQA 
LSVLQFLYPF 
YRNIAKLGKV 
FICHHNSFLV 
CRNDDLVTFG 
VSLLIDCLGI 
FGFVMAITNT 



Seq ID NO: 670 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 1..1284 



ATGGGCTACC 
AAGCAAGCTG 
TTTTCCCTTG 
TTGGTCAATA 
TATCCTTTTA 
TTTCAAAGAA 
GGACTTTCCA 
GGAAAGGTCT 
AGGGCAATTT 
CCCAATGCCA 
TTCTTAGTTT 
ATGTCCATCG 
TTTACTGGCT 
ACATTTGGAA 
GTGACAAGAG 
ATTGTTGTAA 
CTCGGGATAG 
CCATCAGCCT 
TCTTGTGTCA 
ACAAATACTC 
TCTCTCACAA 
ATTAGTATCT 



11 

.1 

AGAGGCAGGA 
GGTTTCCTTT 
TTTTATTGAT 
AAACTTTCGG 
TAGCAATGAT 
TCCCAGGAGT 
CAGTTACCTT 
CCCTCATCTC 
CACTGGGTCC 
TTCAAGCGGT 
ACAGTTCTCT 
TGATTTCTGT 
TCACCCAAGG 
GATTTTGTTA 
AGGTAATTGC 
CAGTGATGGT 
TTCTAGAACT 
GTTATCTGAA 
TGCTTCCCAT 
AAGACTGCAC 
ATACCTCAGA 
TTCAACTCGA 



21 
I 

GCCTGTCATC 
GGGAATATTG 
AAAAGGAGGG 
CTTTCCAGGG 
AAGTTACAAT 
TGATCCTGAA 
TACTCTGCCT 
TACAGGTTTA 
ACACATACCA 
CGGGGTTATG 
AGAAGAACCC 
ATTTATCTGT 
GGACTTATTT 
TGGTGTCACT 
CAATGTGTTT 
CATCACTGTA 
CAATGGTGTG 
ACTGTCTGAA 
TGGTGCTGTG 
CCATGGGCAG 
GTCTCATGTT 
GTAA 



31 
I 

CCGCCGCAGA 
CTTTTATTCT 
GCCCTCTCTG 
TATCTGCTCC 
ATAATAGCTG 
AACGTGTTTA 
TTATCCTTGT 
ACAACTCTGA 
AAAACAGAAG 
TCTTTTGCAT 
ACAGTAGCTA 
ATATTCTTTG 
GAAAATTACT 
GTCATTTTGA 
TTTGGTGGGA 
GCCACGCTTG 
CTCTGTGCAA 
GAACCAAGGA 
GTGATGGTTT 
GAAATGTTCT 
CAGCAGACAA 



41 

I 

GAGGATTGCC 
GGGTTTCATA 
GAACAGATAC 
TCTCTGTTCT 
GAGATACTTT 
TTGGTCGCCA 
ACCGAAATAT 
TTCTTGGAAT 
ACGCTTGGGT 
TTATTTGCCA 
AGTGGTCCCG 
CTACATGTGG 
GCAGAAATGA 
CATACCCTAT 
ATCTTTCATC 
TGTCATTGCT 
■CTCCCCTCAT 
CACACTCCGA 
TTGGATTCGT 
ACTGCTTTCC 
CACAACTTTC 



Seq ID NO: 671 Protein sequence 
Protein Accession Eos sequence 



1 
I 

MGYQRQEPVI 
LVNKTFGFPG 
GLSTVTFTLP 
PNAIQAVGVM 
FTGFTQGDLF 
IWTVMVITV 
SCVMLPIGAV 
ISIFQLE 



11 
I 

PPQRGLPYSM 
YLLLSVLQFL 
LSLYRNIAKL 
SFAFICHHNS 
ENYCRNDDLV 
ATLVSLLIDC 
VMVFGFVMAI 



21 
I 

KQAGFPLGIL 
YPFIAMISYN 
GKVSLISTGL 
FLVYS SLEEP 
TFGRFCYGVT 
LGIVLELNGV 
TNTQDCTHGQ 



31 
I 

LLFWVSYVTD 
I I AGDTLSKV 
TTLILGIVMA 
TVAKWSRLIH 
VILTYPMECF 
LCATPLIFII 
EMFYCFPDNF 



41 
! 

FSLVLLIKGG 
FQRIPGVDPE 
RAISLGPHIP 
MSIVISVFIC 
VTREVIANVF 
PSACYLKLSE 
SLTNTSESHV 



51 
I 

TTATTCAATG 
TGTTACAGAC 
CTACCAGTCT 
TCAGTTTTTG 
GAGCAAAGTT 
CTTCATTATT 
AGCAAAGCTT 
TGTAATGGCA 
ATTTGCAAAG 
CCATAACTCC 
CCTTATCCAT 
ATACTTGACA 
TGACCTGGTA 
GGAATGCTTT 
GGTTTTCCAC 
GATTGATTGC 
TTTTATCATT 
TAAGATTATG 
CATGGCTATT 
TGACAATTTC 
TACTTTAAAT 



51 
I 

ALSGTDTYQS 
NVFIGRHFII 
KTEDAWVFAK 
IFFATCGYLT 
FGGNLSSVFH 
EPRTHSDKIM 
QOTTQLSTLN 



Seq ID NO: 672 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1203 



1 
I 

ATGGGCTACC 
AAAGGAGGGG 
TTTCCAGGGT 
AGTTACAATA 
GATCCTGAAA 
ACTCTGCCTT 
ACAGGTTTAA 
CACATACCAA 
GGGGTTATGT 
GAAGAACCCA 
TTTATCTGTA 
GACTTATTTG 
GGTGTCACTG 
AATGTGTTTT 
ATCACTGTAG 
AATGGTGTGC 
CTGTCTGAAG 
GGTGCTGTGG 
CATGGGCAGG 
TCTCATGTTC 



11 
I 

AGAGGCAGGA 
CCCTCTCTGG 
ATCTGCTCCT 
TAATAGCTGG 
ACGTGTTTAT 
TATCCTTGTA 
CAACTCTGAT 
AAACAGAAGA 
CTTTTGCATT 
CAGTAGCTAA 
TATTCTTTGC 
AAAATTACTG 
TCATTTTGAC 
TTGGTGGGAA 
CCACGCTTGT 
TCTGTGCAAC 
AACCAAGGAC 
TGATGGTTTT 
AAATGTTCTA 
AGCAGACAAC 



21 
I 

GCCTGTCATC 
AACAGATACC 
CTCTGTTCTT 
AGATACTTTG 
TGGTCGCCAC 
CCGAAATATA 
TCTTGGAATT 
CGCTTGGGTA 
TATTTGCCAC 
GTGGTCCCGC 
TACATGTGGA 
CAGAAATGAT 
ATACCCTATG 
TCTTTCATCG 
GTCATTGCTG 
TCCCCTCATT 
ACACTCCGAT 
TGGATTCGTC 
CTGCTTTCCT 
ACAACTTTCT 



31 
I 

CCGCCGCAGT 
TACCAGTCTT 
CAGTTTTTGT 
AGCAAAGTTT 
TTCATTATTG 
GCAAAGCTTG 
GTAATGGCAA 
TTTGCAAAGC 
CATAACTCCT 
CTTATCCATA 
TACTTGACAT 
GACCTGGTAA 
GAATGCTTTG 
GTTTTCCACA 
ATTGATTGCC 
TTTATCATTC 
AAGATTATGT 
ATGGCTATTA 
GACAATTTCT 
ACTTTAAATA 



41 
I 

TTTCCCTTGT 
TGGTCAATAA 
ATCCTTTTAT 
TTCAAAGAAT 
GACTTTCCAC 
GAAAGGTCTC 
GGGCAATTTC 
CCAATGCCAT 
TCTTAGTTTA 
TGTCCATCGT 
TTACTGGCTT 
CATTTGGAAG 
TGACAAGAGA 
TTGTTGTAAC 
TCGGGATAGT 
CATCAGCCTG 
CTTGTGTCAT 
CAAATACTCA 
CTCTCACAAA 
TTAGTATCTT 



51 
I 

TTTATTGATA 
AACTTTCGGC 
AGCAATGATA 
CCCAGGAGTT ' 
AGTTACCTTT 
CCTCATCTCT 
ACTGGGTCCA 
TCAAGCGGTC 
CAGTTCTCTA 
GATTTCTGTA 
CACCCAAGGG 
ATTTTGTTAT 
GGTAATTGCC 
AGTGATGGTC 
TCTAGAACTC 
TTATCTGAAA 
GCTTCCCATT 
AGACTGCACC 
TACCTCAGAG 
TCAACTCGAG 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
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420 
480 
540 
600 
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720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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Seq ID NO: 673 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I 1 

MGYQRQEPVI PPQFSLVLLI KGGALSGTDT YQSLVNKTFG FPGYLLLSVL QFLYPFIAMI 60 
SYNIIAGDTL SKVFQRIPGV DPENVFIGRH FIIGLSTVTF TLPLSLYRNI AKLGKVSLIS 120 
TGLTTLILGI VMARAISLGP HIPKTEDAWV FAKPNAIQAV GVMSFAFICH HNSPLVYSSL 180 
EEPTVAKWSR LIHMSIVISV PICIFFATCG YLTFTGPTQG DLFENYCRND DLVTFGRFCY 240 
GVTVILTYPM ECFVTREVIA NVFFGGNLSS VFHIWTVMV ITVATLVSLL IDCLGIVLEL 300 
NGVLCATPLI FIIPSACYLK LSEEPRTHSD KIMSCVMLPI GAWMVFGFV MAITNTQDCT 360 
HGQEMFYCFP DNFSLTNTSE SHVQQTTQLS TLNISIFQLE 

Seq ID NO: 674 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1140 



1 11 21 31 41 51 

I I 1 i I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGG TCAATAAAAC TTTCGGCTTT 60 

CCAGGGTATC TGCTCCTCTC TGTTCTTCAG TTTTTGTATC CTTTTATAGC AATGATAAGT 120 

TACAATATAA TAGCTGGAGA TACTTTGAGC AAAGTTTTTC AAAGAATCCC AGGAGTTGAT 180 

25 CCTGAAAACG TGTTTATTGG TCGCCACTTC ATTATTGGAC TTTCCACAGT TACCTTTACT 240 

CTGCCTTTAT CCTTGTACCG AAATATAGCA AAGCTTGGAA AGGTCTCCCT CATCTCTACA 300 

- • GGTTTAACAA CTCTGATTCT TGGAATTGTA ATGGCAAGGG CAATTTCACT GGGTCCACAC 360 

ATACCAAAAA CAGAAGACGC TTGGGTATTT GCAAAGCCCA ATGCCATTCA AGCGGTCGGG 420 

GTTATGTCTT TTGCATTTAT TTGCCACCAT AACTCCTTCT TAGTTTACAG TTCTCTAGAA 480 

30 GAACCCACAG TAGCTAAGTG GTCCCGCCTT ATCCATATGT CCATCGTGAT TTCTGTATTT 540 

ATCTGTATAT TCTTTGCTAC ATGTGGATAC TTGACATTTA CTGGCTTCAC CCAAGGGGAC 600 

TTATTTGAAA ATTACTGCAG AAATGATGAC CTGGTAACAT TTGGAAGATT TTGTTATGGT 660 

GTCACTGTCA TTTTGACATA CCCTATGGAA TGCTTTGTGA CAAGAGAGGT AATTGCCAAT 720 

GTGTTTTTTG GTGGGAATCT TTCATCGGTT TTCCACATTG TTGTAACAGT GATGGTCATC 780 

35 ACTGTAGCCA CGCTTGTGTC ATTGCTGATT GATTGCCTCG GGATAGTTCT AGAACTCAAT 840 

GGTGTGCTCT GTGCAACTCC CCTCATTTTT ATCATTCCAT CAGCCTGTTA TCTGAAACTG 900 

TCTGAAGAAC CAAGGACACA CTCCGATAAG ATTATGTCTT GTGTCATGCT TCCCATTGGT 960 

GCTGTGGTGA TGGTTTTTGG ATTCGTCATG GCTATTACAA ATACTCAAGA CTGCACCCAT 1020 

GGGCAGGAAA TGTTCTACTG CTTTCCTGAC AATTTCTCTC TCACAAATAC CTCAGAGTCT 1080 
40 CATGTTCAGC AGACAACACA ACTTTCTACT TTAAATATTA GTATCTTTCA ACTCGAGTAA 

Seq ID NO: 675 Protein sequence 
Protein Accession #: Eos sequence 

45 1 11 21 31 41 51 

I I I I LI 

MGYQRQEPVI PPQVNKTFGF PGYLLLSVLQ FLYPFIAMIS YNIIAGDTLS KVFQRIPGVD 60 

PENVFIGRHF IIGLSTVTFT LPLSLYRNIA KLGKVSLIST GLTTLILGIV MARAI SLGPH 120 

IPKTEDAWVF AKPNAIQAVG VMSFAFICHH NSFLVYSSLE EPTVAKWSRL IHMSIVISVF 180 

50 ICIFFATCGY LTFTGFTQGD LFENYCRNDD LVTFGRFCYG VTVILTYPME CFVTREVIAN 240 

VFFGGNLSSV FHIWTVMVI TVATLVSLLI DCLGIVLELN GVLCATPLIF IIPSACYUCL 300 

SEEPRTHSDK IMSCVMLPIG AWMVFGFVM AITNTQDCTH GQEMFYCFPD NFSLTNTSES 360 
HVQQTTQLST LNISIFQLE 

55 Seq ID NO: 676 DNA sequence 

Nucleic Acid Accession NM_006853 . 1 . 
Coding sequence: 2 6.. 874 

1 11 21 31 41 51 

60 | [ | | | | 

AGGAATCTGC GCTCGGGTTC CGCAGATGCA GAGGTTGAGG TGGCTGCGGG ACTGGAAGTC 60 

ATCGGGCAGA GGTCTCACAG CAGCCAAGGA ACCTGGGGCC CGCTCCTCCC CCCTCCAGGC 120 

CATGAGGATT CTGCAGTTAA TCCTGCTTGC TCTGGCAACA GGGCTTGTAG GGGGAGAGAC 180 

CAGGATCATC AAGGGGTTCG AGTGCAAGCC TCACTCCCAG CCCTGGCAGG CAGCCCTGTT 240 

65 CGAGAAGACG CGGCTACTCT GTGGGGCGAC GCTCATCGCC CCCAGATGGC TCCTGACAGC 300 

AGCCCACTGC CTCAAGCCCC GCTACATAGT TCACCTGGGG CAGCACAACC TCCAGAAGGA 360 

GGAGGGCTGT GAGCAGACCC GGACAGCCAC TGAGTCCTTC CCCCACCCCG GCTTCAACAA 420 

CAGCCTCCCC AACAAAGACC ACCGCAATGA CATCATGCTG GTGAAGATGG CATCGCCAGT 480 

CTCCATCACC TGGGCTGTGC GACCCCTCAC CCTCTCCTCA CGCTGTGTCA CTGCTGGCAC 540 

70 CAGCTGCCTC ATTTCCGGCT GGGGCAGCAC GTCCAGCCCC CAGTTACGCC TGCCTCACAC 600 

CTTGCGATGC GCCAACATCA CCATCATTGA GCACCAGAAG TGTGAGAACG CCTACCCCGG 660 

CAACATCACA GACACCATGG TGTGTGCCAG CGTGCAGGAA GGGGGCAAGG ACTCCTGCCA 720 

GGGTGACTCC GGGGGCCCTC TGGTCTGTAA CCAGTCTCTT CAAGGCATTA TCTCCTGGGG 780 

CCAGGATCCG TGTGCGATCA CCCGAAAGCC TGGTGTCTAC ACGAAAGTCT GCAAATATGT 840 

GGACTGGATC CAGGAGACGA TGAAGAACAA TTAGACTGGA CCCACCCACC ACAGCCCATC 900 

ACCCTCCATT TCCACTTGGT GTTTGGTTCC TGTTCACTCT GTTAATAAGA AACCCTAAGC 960 

CAAGACCCTC TACGAACATT CTTTGGGCCT CCTGGACTAC AGGAGATGCT GTCACTTAAT 1020 

AATCAACCTG GGGTTCGAAA TCAGTGAGAC CTGGATTCAA ATTCTGCCTT GAAATATTGT 1080 

GACTCTGGGA ATGACAACAC CTGGTTTGTT CTCTGTTGTA TCCCCAGCCC CAAAGACAGC 1140 
TCCTGGCCAT ATATCAAGGT TTCAATAAAT ATTTGCTAAA TGAGTG 



Seq ID NO: 677 Protein sequence 
Protein Accession #: NP_006844.1 

1 . 11 21 31 41 51 

I I I I I I 

MRILQLILLA LATGLVGGET RIIKGFECKP HSQPWQAALF EKTRLLCGAT LIAPRWLLTA 60 



443 



10 

15 

20 

25 

30 
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50 
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85 



WO 02/086443 

AHCLKPR Y IV HLGQHNLQKE EGCEQTRTAT ESPPHPGFNN SLPNKDHRND IMLVKMASPV 
SITWAVRPLT LSSRCVTAG? SCLISGWGST SSPQLRLPHT LRCANITIIE HQKCENAYPG 
NITDTMVCAS VQEGGKDSCQ GDSGGPLVCN QSLQGHSWG QDPCAITRKP GVYTKVCKYV 
DWIQETMKNN 

Seq ID NO: 678 DNA sequence 

Nucleic Acid Accession Eos sequence 

Coding sequence: 1..933 



PCT/US02/12476 



120 
180 
240 



ATGTGCAGCA 
TTCGACAAGA 
TTCCCCTGTG 
GACTGTCCCG 
GCCCGCTACC 
AATAACTGTC 
GGGCAGGTGT 
ATCATCGGCA 
CACCAGCGGA 
CTGCTGTCCC 
AATAATGGCA 
CCACCCTCCT 
CCGCCCTACT 
CGGTCCGGGA 
GACACCAGCC 
TCTGAGCCGA 



11 
I 

ATGGACGGTG 
GTGATGAGAA 
CCAGCGGCAT 
ATGGCAGCGA 
ACTGCAAGAA 
AAGACAACAG 
TTGTGACTTC 
GCTCOGTCAT 
AGCGGAACAA 
GCCTGGTGGT 
TCCAGTATGT 
ACTCCGAGGC 
CTTCTGACAC 
GTGCCAACAG 
ACAGCCCGGG 
GCCAGGGCAC 



21 
I 

CATCCCGGGC 
GGAGTGCCCC 
CCATTGCATC 
TGAAGAGAAC 
CGGCCTCTGT 
TGATGAGGAA 
AGAGAACCAA 
TTTTGTGCTG 
CCTCATGACG 
CCTGGACCAC 
GGCCAGCCAG 
CTTGCTGGAC 
GGAATCTCTG 
TGCCAGCTCC 
GCAGCCTGGC 
TGAAGAAGTA 



31 
I 

GCCTGGCAGT 
AAGGCTAAGT 
ATTGGTCGCT 
TGCACAGCAA 
ATTGACAAGA 
AGCTGTGAAA 
CTTGTGTATT 
GTGGTGGCCC 
CTGCCCGTGC 
CCCCACCACT 
GCGGAGCAGA 
CAGAGGCCTG 
AACCAAGCCG 
CAGGCAGCCA 
CCCCAGGAGG 
TAA 



41 

I 

GTGACGGGCT 
CGAAATGTGG 
TCCGGTGCAA 
ACCCTCTGCT 
GCTTCATCTG 
GTTCTCAAGA 
ACCCCAGCAT 
TGCTGGCACT 
ACCGGCTGCA 
GCAACGTCAC 
ATGCGTCGGA 
CGTGGTATGA 
ACCTGCCCCC 
GCAGCCTCCT 
GCACTGCTGA 



51 

GCCTGACTGC 
CCCGACCTTC 
TGGGTTTGAG 
TTGCTCCACC 
CGATGGACAG 
ACCCGGCAGT 
CACCTATGCC 
GGTCTTGCAC 
GCACCCTGTG 
CTACAACGTC 
AGTAGGCTCC 
CCTTCCTCCA 
CTACCGCTCC 
GAGCGTGGAA 
GCCCAGGGAC 



Seq ID NO:. 679 Protein sequence 
Protein Accession #: Eos sequence 



MCSNGRCIPG 
DCPDGSDEEN 
GQVFVTSENQ 
LLSRIiWLDH 
PPYSSDTBSL 
SEPSQGTEEV 



11 
I 

AWQCDGLPDC 
CTANPLLCST 
LVYYPSITYA 
PHHCNVTYNV 
NQADLPPYRS 



21 

I 

PDKSDEKECP 
ARYHCKNGLC 
IIGSSVIFVL 
NNGIQYVASQ 
RSGSANSASS 



31 

i 

KAKSKCGPTF 
IDKSFICDGQ 
WALliALVLH 
AEQNASEVGS 
QAASSLLSVE 



41 
I 

FPCASGIHCI 
NNCQDNSDEE 
HQRKRNNLMT 
PPSYSEALLD 
DTSHSPGQPG 



51 
I 

IGRFRCNGFE 
SCESSQEPGS 
LPVHRLQHPV 
QRPAWYDLPP 
PQEGTAEPRD 



Seq ID NO: 680 DNA sequence 
Nucleic Acid Accession #: S78203.1 
Coding sequence: 1..2190 



ATGAATCCTT 
GAGGTACCAC 
AACTATCCAC 
TATGGAATGA 
ACCTCCACAT 
GCAGCCATTG 
TATGTGCTTG 
GTACACACAG 
AAACCCTGTG 
ACTAGATACT 
ATCACACCCA 
TTTGGAGTTC 
ATATACAATA 
TTTGCTATTT 
CTAGACTGGG 
AGGGTACTAT 
TCACGATGGA 
CCGGACCAGA 
TTTGTCATTT 
GCTGTTGGTA 
ATAAATGAAA 
CTGGCAGATG 
GAGTCCATCA 
AGCCAGGATT 
GTGCAGGAGA 
ATGATGGTAA 
AACACTTTGC 
GAAGACTATG 
TGTAGAACAG 
TATCTGTTTG 
ATTCCAGCCA 
GGGGAGGTCA 
ATGAAATCTG 
CTTGTTGTGG 
CTCCTGCTGG 
ACAGAGGATA 
AAACTAGAGA 



11 

I . 

TCCAGAAAAA 
CTCGACCACC 
TGAGCATTGC 
AAGCTGTGCT 
CTATATACCA 
CTGACTCGTG 
GCCATGTGAT 
TCCTATCATT 
TGGCAGCTTT 
TCTCAGTCTT 
TGCTGAGAGG 
CAGGACTGCT 
AACCACCCCC 
CCAATCGTTT 
CAGCTGAGAA 
TCCTTTATAT 
CTTTGCAAGC 
TGCAGGTTCT 
ATCGTCTGGT 
TGATCCTAGC 
TGGCCCCAGC 
ATGAGGTGAA 
AATCCTTTCA 
TTCACTTCCA 
AGAACTGGTA 
AGGATACAGA 
ATAAAGATGT 
GTGTGTCTGC 
AAGATAAGAA 
TTATTACTAA 
ACAAAATGTC 
TGTTCTCTGT 
TGCTCCAGGC 
CACAGTTCAG 
TGATCTGCCT 
TGCGGGGTCC 
CCAAGAAGAC 



21 
I 

TGAGTCCAAG 
TAGCCCTCCA 
CTTCATTGTG 
GATCCTGTAT 
TGCCTTCAGC 
GTTGGGAAAA 
CAAGTCCTTG 
GATCGGCCTG 
TGGTGGAGAC 
CTACCTGTCC 
AGATGTGCAA 
CATGGTAATT 
TGAAGGAAAC 
CAAGAACCGT 
ATATCCAAAG 
CCCATTGCCC 
CATCAGGATG 
AAATCCCTTT 
CTCCAAGTGT 
GTGCCTGGCA 
CCAGTCAGGT 
GGTGACAGTG 
GAAAACACCA 
CCTGAAATAT 
CAGTCTTGTC 
AAGCAAAACA 
CAACATCTCC 
TTATAGAACT 
CTTTTCTCTG 
TAACACCAAT 
CATTGCGTGG 
CACAGGTCTT 
AGCTTGGCTA 
TGGCCTGGTA 
GATCTTCTCC 
AGCAGATAAG 
AAAACTCTGA 



31 
I 

GAAACTCTTT 
AAGAAGCCAT 
GTGAATGAAT 
TTCCTGTATT 
AGCCTCTGTT 
TTCAAGACAA 
GGTGCCTTAC 
AGTCTAATAG 
CAGTTTGAAG 
ATCAATGCAG 
TGTTTTGGAG 
GCACTTGTTG 
ATAGTGGCTC 
TCTGGAGACA 
CAGCTCATTA 
ATGTTCTGGG 
AATAGGAATT 
CTGGTTCTTA 
GGAATTAACT 
TTTGCAGTTG 
CCCCAGGAGG 
GTGGGAAATG 
CACTATTCCA 
CACAATTTGT 
ATTCGTGAAG 
ACCAATGGGA 
CTGAGTACAG 
GTGCAAAGAG 
AATTTGGGTC 
CAGGGTCTTC 
CAGCTACCAC 
GAGTTTTCTT 
TTGACAATTG 
CAGTGGGCCG 
ATCATGGGCT 
CACATTCCTC 



41 
I 

TTTCACCTGT 
CTCCGACAAT 
TCTGCGAGCG 
TCCTGCACTG 
ATTTTACTCC 
TCATCTATCT 
CAATACTGGG 
CTTTGGGGAC 
AAAAACATGC 
GGAGCTTGAT 
AAGACTGCTA 
TGTTTGCAAT 
AAGTTTTCAA 
TTCCAAAGCG 
TGGATGTAAA 
CTCTTTTGGA 
TGGGGTTTTT 
TCTTCATCCC 
TCTCATCACT 
CGGCAGCTGT 
TTTTCCTACA 
AAAACAATTC 
AACTGCACCT 
CTCTCTACAC 
ATGGGAACAG 
TGACAACCGT 
ATACCTCTCT 
GAGAATACCC 
TTCTAGACTT 
AGGCCTGGAA 
AATATGCCCT 
ATTCTCAGGC 
CAGTTGGGAA 
AATTCATTTT 
ACTACTATGT 
ACATCCAGGG 



51 
I 

CTCCATTGAA 
CTGTGGCTCC 
CTTTTCCTAT 
GAATGAAGAT 
CATCCTGGGA 
CTCCTTGGTG 
AGGACAAGTG 
AGGAGGCATC 
AGAGGAACGG 
TTCTACATTT 
TGCATTGGCT 
GGGAAGCAAA 
ATGTATCTGG 
ACAGCACTGG 
GGCACTGACC 
TCAGCAGGGT 
TGTGCTTCAG 
GTTGTTTGAC 
TAGGAAAATG 
AGAGATAAAA 
AGTCTTGAAT 
TCTGTTGATA 
GAAAACAAAA 
TGAGCATTCT 
TATCTCCAGC 
GAGGTTTGTT 
CAATGTTGGT 
TGCAGTGCAC 
TGGTGCAGCA 
GATTGAAGAC 
GGTTACAGCT 
TCCCTCTAGC 
TATCATCGTG 
GTTTTCCTGC 
TCCTGTAAAG 
GAACATGATC 



60 
120 
180 
240 
300 
360^ 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



Seq ID NO: 681 Protein sequence 
Protein Accession #: AAB34388.1 



444 



WO 02/086443 
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1 
I 

MNPFQKNESK 
YGMKAVLILY 
YVLGHVIKSL 
TRYFSVFYLS 
IYNKPPPEGN 
RVLFLYIPLP 
FVIYRLVSKC 
LADDEVKVTV 
VQBKNWYSLV 
EDYGVSAYRT 
IPANKMSIAW 
LWAQFSGLV 
KLETKKTKL 



11 

I 

ETLFSPVSIE 
FLYPLHWNED 
GAIiPILGGQV 
INAGSLISTF 
IVAQVFKCIW 
MFWALLDQQG 
GINFSSLRKM 
VGNENNSIiLI 

iredgnsiss 
vqrgeypavh 
qlpqyalvta 
qwaefilfsc 



21 
I 

EVPPRPPSPP 
TSTSIYHAFS 
VHTVLSLIGL 
ITPMLRGDVQ 
FAI SNRFKNR 
SRWTLQAIRM 
AVGMILACLA 
ESIKSFQKTP 
MMVKDTESKT 
CRTBDKNFSL 
GEVMFSVTGL 
LLLVICLIFS 



31 
I 

KKPSPTICGS 
SLCYFTPILG 
SLIALGTGGI 
CFGEDCYAIA 
SGDIPKRQHW 
NRNLGFFVLQ 
FAVAAAVEIK 
HYSKLHLKTK 
TNGMTTVRFV 
NLGLLDFGAA 
EFSYSQAPSS 
IMGYYYVPVK 



41 
I 

NYPLSIAFIV 
AAXADSWLGK 
KPCVAAFGGD 
FGVPGLLMVI 
LDWAAEKYPK 
PDQMQVLNPF 
INEMAPAQSG 
SQDFHFHLKY 
NTLHKDVNIS 
YLFVITNNTN 
MKSVLQAAWL 
TEDMRGPADK 



PCT/US02/12476 



51 
I 

VNEFCERFSY 
FKTIIYIiSLV 
QFEEKHAEBR 
ALWFAMGSK 
QLIMDVKALT 
LVLIFIPLFD 
PQEVFLQVLN 
HNLSLYTEHS 
LSTDTSLNVG 
QGLQAWKIED 
LTIAVGNIIV 
HIPHIQGNMI 



Seq ID NO: 682 DNA sequence 

Nucleic Acid Accession ft: NMJU6077.1 

Coding sequence: 128.. 667 



1 
I 

TCGCTTTGTG 
CGCGATAGAA 
ACTGTAGATG 
CTTGGCTGTT 
GATGCTCCCC 
CTTGGGAGAC 
AAAAGGGAAA 
AAGAAGAAAT 
CAAAGCTCCT 
GACTGTAAGT 
CCTAGGGATT 
TTACTAGGTG 
GATTCTAACA 
AAACCTATTC 



11 
I 

ATTCTTGATC 
ACGTGTTCGC 
CCCTCCAAAT 
GGAGTTGCTT 
AAAAGCAAGA 
AGCGGGGAGT 
GTGGCTGCCC 
CCTGAAATGC 
GATGAAGAAA 
TTAATTCAAG 
GGGCCAGGAC 
GACTTTGATA 
ACAAAAGCTG 
CCATGTTCTA 



21 
I 

CGGAACTTTG 
TTGCCCAGAA 
CCTTGGTTAT 
GTGGCATGTG 
CGAGCAAGAC 
ACAAGATGAT 
AGTGCTCTCA 
TCAAACAATG 
CCCTGATTGC 
ATGCTGGACG 
CAGCAGACCT 
TGACAACAAC 
AATTTCTTCA 
AAAAAA 



31 
I 

TCACCCAGGA 
GAAGGGAAGG 
GGAATATTTG 
CCTGGGCTGG 
ACACACAGAT 
TCTTGTGGTT 
TGCTGCTGTT 
GGAATACTGT 
ATTATTGGCC 
TACTCAGATT 
AATTGACAAA 
CCCTCCATCA 
CCCAACTTAA 



41 

I 

ACCCCGGAAG 
CGCGAGTGAG 
GCTCATCCCA 
AGCCTTCGAG 
ACTGAAAGTG 
CGAAATGACT 
TCAGCCTACA 
GGCCAGCCCA 
CATGCAAAAA 
GCACCAGGCT 
GTCACTGGTC 
CAAGTGTTTG 
ATGTTCTTGA 



51 
I 

AGGTAGCTCA 
GAAAGGAGGT 
GTACACTCGG 
TATGCTTTGG 
AAGCAAGCAT 
TAAAGATGGG 
AGCAGATTCA 
AGGTGGTGGT 
TGCTGGGACT 
CTCAAACTGT 
ACGTAAAACT 
AAGCCTGTCA 
GATGAAAATA 



Seq ID NO: 683 Protein sequence 
Protein Accession &: NP_057161.1 

21 



11 21 31 41 51 

I I 1-1 I 

MPSKSLVMEY LAHPSTLGLA VGVACGMCLG W.SLRVCFGML PKSKTSKTHT DTESEASILG 

DSGEYKMILV VRNDLKMGKG KVAAQCSHAA VSAYKQIQRR NPEMLKQWEY CGQPKVWKA 

PDEETLIALL AHAKMLGLTV SLIQDAGRTQ IAPGSQTVLG IGPGPADLID KVTGHLKLY 

Seq ID NO: 684 DNA sequence 

Nucleic Acid Accession #; NM_004864.1 

Coding sequence: 26.. 952 



I 

CGGAACGAGG 
TCAGATGCTC 
GGCCGAGGCG 
ATTCCGAGAG 
CTGGGAAGAT 
AGTGCGGCTG 
GGGGCTCCCC 
AAGGTCGTGG 
GCCCGCGCTG 
ATCTTCGTCC 
CCGCAGAGCG 
TCTGCACACG 
ACGGGAGGTG 
CATGCACGCG 
CTGCTGCGTG 
GTCGCTCCAG 
GGTCCTTCCA 
GGGCTCAAGG 
TTATTTATTA 
ACTGTGTATT 
AAAA 



11 
I 

GCAACCTGCA 
CTGGTGTTGC 
AGCCGCGCAA 
TTGCGGAAAC 
TCGAACACCG 
GGATCCGGCG 
GAGGCCTCCC 
GACGTGACAC 
CACCTGCGAC 
GCACGGCCCC 
CGTGCGCGCA 
GTCCGCGCGT 
CAAGTGACCA 
CAGATCAAGA 
CCCGCCAGCT 
ACCTATGATG 
CTGTGCACCT 
TTCCTGAGAC 
TTAATTTATT 
TATTTAAAAC 



21 
I 

CAGCCATGCC 
TGGTGCTCTC 
GTTTCCCGGG 
GCTACGAGGA 
ACCTCGTCCC 
GCCACCTGCA 
GCCTTCACCG 
GACCGCTGCG 
TGTCGCCGCC 
AGCTGGAGTT 
ACGGGGACGA 
CGCTGGAAGA 
TGTGCATCGG 
CGAGCCTGCA 
ACAATCCCAT 
ACTTGTTAGC 
GCGCGGGGGA 
ACCCGATTCC 
GGGGTGACCT 
TCTGGTGATA 



31 

I 

CGGGCAAGAA 
GTGGCTGCCG 
ACCCTCAGAG 
CCTGCTAACC 
GGCCCCTGCA 
CCTGCGTATC 
GGCTCTGTTC 
GCGTCAGCTC 
GCCGTCGCAG 
GCACTTGCGG 
CTGTCCGCTC 
CCTGGGCTGG 
CGCGTGCCCG 
CCGCCTGAAG 
GGTGCTCATT 
CAAAGACTGC 
GGCGACCTCA 
TGCCCAAACA 
TCTTGGGGAC 
AAAATAAAGC 



41 
I 

CTCAGGACGG 
CATGGGGGCG 
TTGCACTCCG 
AGGCTGCGGG 
GTCCGGATAC 
TCTCGGGCCG 
CGGCTGTCCC 
AGCCTTGCAA 
TCGGACCAAC 
CCGCAAGCCG 
GGGCCCGGGC 
GCCGATTGGG 
AGCCAGTTCC 
CCCGACACGG 
CAAAAGACCG 
CACTGCATAT 
GTTGTCCTGC 
GCTGTATTTA 
TCGGGGGCTG 
TGTCTGAACT 



51 
I 

TGAATGGCTC 
CCCTGTCTCT 
AAGACTCCAG 
CCAACCAGAG 
TCACGCCAGA 
CCCTTCCCGA 
CGACGGCGTC 
GACCCCAAGC 
TGCTGGCAGA 
CCAGGGGGCG 
GTTGCTGCCG 
TGCTGTCGCC 
GGGCGGCAAA 
AGCCAGCGCC 
ACACCGGGGT 
GAGCAGTCCT 
CCTGTGGAAT 
TATAAGTCTG 
GTCTGATGGA 
GTTAAAAAAA 



Seq ID NO: 685 Protein sequence 
Protein Accession #: NP 004855.1 



I 

MPGQELRTVN 
EDLLTRLRAN 
HRALFRLSPT 
ELHLRPQAAR 
IGACPSQFRA 
LAKDCHCI 



I 

GSQMLLVLLV 
QSWEDSNTDL 
ASRSWDVTRP 
GRRRARARNG 
ANMHAQIKTS 



21 
I 

LSWLPHGGAL 
VPAPAVRILT 
LRRQLSLARP 
DDCPLGPGRC 
LHRLKPDTEP 



31 
I 

SLAEASRASF 
PEVRLGSGGH 
QAPALHLRLS 
CRLHTVRASL 
APCCVPASYN 



41 
1 

PGPSELHSED 
LHLRISRAAL 
PPPSQSDQLL 
EDLGWADWVL 
PMVLIQKTDT 



51 
I 

SRFRELRKRY 
PEGLPEASRL 
AESSSARPQL 
SPREVQVTMC 
GVSLQTYDDL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



60 
120 
180 
240 
300 



Seq ID NO: 686 DNA sequence 



445 



WO 02/086443 



PCT/US02/12476 



Nucleic Acid Accession #: NM__002423.2 
Coding sequence: 4 8.. 6 51 

1 11 21 31 41 51 

5 | I i | | | 

ACCAAATCAA CCATAGGTCC AAGAACAATT GTCTCTGGAC GGCAGCTATG CGACTCACCG 60 

TGCTGTGTGC TGTGTGCCTG CTGCCTGGCA GCCTGGCCCT GCCGCTGCCT CAGGAGGCGG 120 

GAGGCATGAG TGAGCTACAG TGGGAACAGG CTCAGGACTA TCTCAAGAGA TTTTATCTCT 180 

ATGACTCAGA AACAAAAAAT GCCAACAGTT TAGAAGCCAA ACTCAAGGAG ATGCAAAAAT 240 

10 TCTTTGGCCT ACCTATAACT GGAATGTTAA ACTCCCGCGT CATAGAAATA ATGCAGAAGC 300 

CCAGATGTGG AGTGCCAGAT GTTGCAGAAT ACTCACTATT TCCAAATAGC CCAAAATGGA 360 

CTTCCAAAGT GGTCACCTAC AGGATCGTAT CATATACTOG AGACTTACCG CATATTACAG 420 

TGGATCGATT AGTGTCAAAG GCTTTAAACA TGTGGGGCAA AGAGATCCCC CTGCATTTCA 480 

GGAAAGTTGT ATGGGGAACT GCTGACATCA TGATTGGCTT TGCGCGAGGA GCTCATGGGG 540 

15 ACTCCTACCC ATTTGATGGG CCAGGAAACA CGCTGGCTCA TGCCTTTGCG CCTGGGACAG 600 

GTCTCGGAGG AGATGCTCAC TTCGATGAGG ATGAACGCTG GACGGATGGT AGCAGTCTAG 660 

GGATTAACTT CCTGTATGCT GCAACTCATG AACTTGGCCA TTCTTTGGGT ATGGGACATT 720 

CCTCTGATCC TAATGCAGTG ATGTATCCAA CCTATGGAAA TGGAGATCCC CAAAATTTTA 780 

AACTTTCCCA GGATGATATT AAAGGCATTC AGAAACTATA TGGAAAGAGA AGTAATTCAA 840 

20 GAAAGAAATA GAAACTTCAG GCAGAACATC CATTCATTCA TTCATTGGAT TGTATATCAT 900 

TGTTGCACAA TCAGAATTGA TAAGCACTGT TCCTCCACTC CATTTAGCAA TTATGTCACC 960 

CTTTTTTATT GCAGTTGGTT TTTGAATGTC TTTCACTCCT TTTATTGGTT AAACTCCTTT 1020 

ATGGTGTGAC TGTGTCTTAT TCCATCTATG AGCTTTGTCA GTGCGCGTAG ATGTCAATAA 1080 
ATGTTACATA CACAAATAAA TAAAATGTTT ATTCCATGGT AAATTTA 



25 



40 
45 
50 
55 
60 
65 



Seq ID NO: 687 Protein sequence 
Protein Accession #: NP 002414.1 



1 11 21 31 41 51 

30 | | i | | | 

MRLTVLCAVC LLPGSLALPXj PQEAGGMSEL QWEQAQDYLK RPYLYDSETK NANSLEAKLK 60 

EMQKFFGLPI TGMLNSRVIE IMQKPRCGVP DVAEYSLFPN SPKWTSKWT YRIVSYTRDIj 120 

PHITVDRLVS KALNMWGKEI PLHFRKWWG TADIMIGFAR GAHGDSYPFD GPGNTLAHAF 180 

APGTGLGGDA HFDEDERWTD GS SLGINFLY AATHELGHSL GMGHSSDPNA VMYPTYGNGD 240 
35 PQNFKLSQDD IKGIQKLYGK RSNSRKK 



Seq ID NO: 688 DNA sequence 

Nucleic Acid Accession ft: NMJ>05221.3 

Coding sequence : 1 . . 870 

1 11 21 31 41 51 

III III 

ATGACAGGAG TGTTTGACAG AAGGGTCCCC AGCATCCGAT CCGGCGACTT CCAAGCTCCG 60 

TTCCAGACGT CCGCAGCTAT GCACCATCCG TCTCAGGAAT CGCCAACTTT GCCCGAGTCT 120 

TCAGCTACCG ATTCTGACTA CTACAGCCCT ACGGGGGGAG CCCCGCACGG CTACTGCTCT 180 

CCTACCTCGG CTTCCTATGG CAAAGCTCTC AACCCCTACC AGTATCAGTA TCACGGCGTG 240 

AACGGCTCCG CCGGGAGCTA CCCAGCCAAA GCTTATGCCG ACT AT AG CT A CGCTAGCTCC 300 

TACCACCAGT ACGGCGGCGC CTACAACCGC GTCCCAAGCG CCACCAACCA GCCAGAGAAA 360 

GAAGTGACCG AGCCCGAGGT GAGAATGGTG AATGGCAAAC CAAAGAAAGT TCGTAAACCC 420 

AGGACTATTT ATTCCAGCTT TCAGCTGGCC GCATTACAGA GAAGGTTTCA GAAGACTCAG 480 

TACCTCGCCT TGCCGGAACG CGCCGAGCTG GCCGCCTCGC TGGGATTGAC ACAAACACAG 540 

GTGAAAATCT GGTTTCAGAA CAAAAGATCC AAGATCAAGA AGATCATGAA AAACGGGGAG 600 

ATGCCCCCGG AGCACAGTCC CAGCTCCAGC GACCCAATGG CGTGTAACTC GCCGCAGTCT 660 

CCAGCGGTGT GGGAGCCCCA GGGCTCGTCC CGCTCGCTCA GCCACCACCC TCATGCCCAC 720 

CCTCCGACCT CCAACCAGTC CCCAGCGTCC AGCTACCTGG AGAACTCTGC ATCCTGGTAC 780 

ACAAGTGCAG CCAGCTCAAT CAATTCCCAC CTGCCGCCGC CGGGCTCCTT ACAGCACCCG 840. 
CTGGCGCTGG CCTCCGGGAC ACTCTATTAG 

Seq ID NO: 689 Protein sequence 
Protein Accession #: NP_005212.1 

1 11 21 31 41 51 

III III 

MTGVFDRRVP SIRSGDFQAP FQTSAAMHHP SQESPTLPES SATDSDYYSP TGGAPHGYCS 60 
PTSASYGKAL NPYQYQYHGV NGSAGSYPAK AYADYSYASS YHQYGGAYNR VPSATNQPEK 120 
EVTEPEVRMV NGKPKKVRKP RTIYSSFQLA ALQRRFQKTQ YLALPERAEL AASLGLTQTQ 180 
VKIWFQNKRS KIKKIMKNGE MPPEHSPSSS DPMACNSPQS PAVWEPQGSS RSLSHHPHAH 240 
PPTSNQSPAS SYLENSASWY TSAASSINSH LPPPGSLQHP LALASGTLY 
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It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
sequences of accession numbers, and patent applications cited in this specification are herein 
jicorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 LA method of detecting a lung cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1A-16. 

1 2. The method of claim 1 , wherein the polynucleotide selectively 

2 hybridizes to a sequence at least 95% identical to a sequence as shown in Tables 1A-16. 

1 3. The method of claim 1 , wherein the biological sample is a tissue 

2 sample. 

1 4. The method of claim 1 , wherein the biological sample comprises 

2 isolated nucleic acids. 

1 5. The method of claim 4, wherein the nucleic acids are mRNA. 

1 6. The method of claim 4, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 7. The method of claim 1 , wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1 A- 16. 

1 8. The method of claim 1 , wherein the polynucleotide is labeled. 

1 9. The method of claim 8, wherein the label is a fluorescent label. 

1 1 0. The method of claim 1 , wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 11. The method of claim 1 , wherein the patient is undergoing a therapeutic 

2 regimen to treat lung cancer. 

1 12. The method of claim 1 , wherein the patient is suspected of having lung 

2 cancer. 

1 1 3 . A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 
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3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 

5 (ii) determining the level of a lung cancer-associated transcript in the 

6 biological sample by contacting the biological sample with a polynucleotide that selectively 

7 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1A-16, 

8 thereby monitoring the efficacy of the therapy. 

1 14, The method of claim 13, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated transcript to a level of the lung cancer-associated 

3 transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. ... 

1 15. The method of claim 1 3, wherein the patient is a human. 

1 1 6. A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 

3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 

5 (ii) determining the level of a lung cancer-associated antibody in the biological 

6 sample by contacting the biological sample with a polypeptide encoded by a polynucleotide 

7 that selectively hybridizes to a sequence at least 80% identical to a sequence as shown in 

8 Tables 1 A- 16, wherein the polypeptide specifically binds to the lung cancer-associated 

9 antibody, thereby monitoring the efficacy of the therapy. 

1 17. The method of claim 16, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated antibody to a level of the lung cancer-associated 

3 antibody in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. 

1 18. The method of claim 1 6, wherein the patient is a human. 

1 1 9. A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 

3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 
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5 (ii) determining the level of a lung cancer-associated polypeptide in the 

6 biological sample by contacting the biological sample with an antibody, wherein the antibody 

7 specifically binds to a polypeptide encoded by a polynucleotide that selectively hybridizes to 

8 a sequence at least 80% identical to a sequence as shown in Tables 1 A-16, thereby 

9 monitoring the efficacy of the therapy. 

1 20. The method of claim 19, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated polypeptide to a level of the lung cancer-associated 

3 polypeptide in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. 

2 1 . The method of claim 1 9, wherein the patient is a human. 

22. An isolated nucleic acid molecule consisting of a polynucleotide 
sequence as shown in Tables 1A-16. 

23. The nucleic acid molecule of claim 22, which is labeled. 

24. The nucleic acid of claim 23, wherein the label is a fluorescent label 

25. An expression vector comprising the nucleic acid of claim 22. 

26. A host cell comprising the expression vector of claim 25. 

27. An isolated polypeptide which is encoded by a nucleic acid molecule 
having polynucleotide sequence as shown in Tables 1 A-l 6. 

28. An antibody that specifically binds a polypeptide of claim 27. 

29. The antibody of claim 28, further conjugated to an effector component. 

30. The antibody of claim 29, wherein the effector component is a 
fluorescent label. 

3 1 . The antibody of claim 29, wherein the effector component is a 
radioisotope or a cytotoxic chemical. 

32. The antibody of claim 29, which is an antibody fragment. 
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1 33 . The antibody of claim 29, which is a humanized antibody 

1 34. A method of detecting a lung cancer cell in a biological sample from a 

2 patient, the method comprising contacting the biological sample with an antibody of claim 

3 28. 

1 35. The method of claim 34, wherein the antibody is further conjugated to 

2 an effector component. 

1 36. The method of claim 35, wherein the effector component is a 

2 fluorescent label. 

1 37. A method of detecting antibodies specific to lung cancer in a patient, 

2 the method comprising contacting a biological sample from the patient with a polypeptide 

3 encoded by a nucleic acid comprises a sequence from Tables 1A-16. 

1 38. A method for identifying a compound that modulates a lung cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a lung cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1A-16; and 

6 (ii) determining the functional effect of the compound upon the polypeptide. 

1 39. The method of claim 38, wherein the functional effect is a physical 

2 effect. ' 

1 40. The method of claim 38, wherein the functional effect is a chemical 

2 effect. 

1 41. The method of claim 38, wherein the polypeptide is expressed in a 

2 eukaryotic host cell or cell membrane. 

1 42. The method of claim 38, wherein the functional effect is determined by 

2 measuring ligand binding to the polypeptide. 

1 43. The method of claim 38, wherein the polypeptide is recombinant. 
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1 44. A method of inhibiting proliferation of a lung cancer-associated cell to 

2 treat lung cancer in a patient, the method comprising the step of administering to the subject a 

3 therapeutically effective amount of a compound identified using the method of claim 38. 

1 45, The method of claim 44, wherein the compound is an antibody. 

1 46. The method of claim 45, wherein the patient is a human. 

1 47. A drug screening assay comprising the steps of 

2 (i) administering a test compound to a mammal having lung cancer or a cell 

3 isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 



5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1 A- 16 in a 

6 treated cell or mammal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compound that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of lung cancer. 



1 48. The assay of claim 47, wherein the control is a mammal with lung 

2 cancer or a cell therefrom that has not been treated with the test compound. 

1 49. The assay of claim 47, wherein the control is a normal cell or mammal. 

1 50. A method for treating a mammal having lung cancer comprising 

2 administering a compound identified by the assay of claim 47. 

1 5 1 . A pharmaceutiPcal composition for treating a mammal having lung 

2 cancer, the composition comprising a compound identified by the assay of claim 47 and a 

3 physiologically acceptable excipient. 
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